Canonical Form
<Publisher>Harper & Row</Publisher>
<Publisher>Harper & Row</Publisher>
<Publisher><![CDATA[Harper & Row]]></Publisher>
<Publisher>Harper & Row</Publisher>
Canonicalization algorithm:
- Replace all entity references with their replacement text.
- Replace all character entity references with their replacement text.
- Replace CDATA sections with their content.
- Then, replace all illegal characters (e.g., &) with an entity reference
Yes, all three forms are the same!
<Publisher>Harper & Row</Publisher>