|
Posted by Jukka K. Korpela on 04/09/07 11:29
Scripsit J.O. Aho:
> (HTML entities still depends on the fonts installed
> on the client machine supports the characters that you want to
> display with the HTML entities).
Please don't confuse people in this already confusing issue. Confusing
entities with character references is understandable (W3C does it too), but
character display does _not_ depend on the way the character was represented
in HTML source.
> Not sure what you mean with Unicode, but I suspect you mean HTML
> entities ( &#XXX; ),
Character references (as defined in SGML and XML, not really in HTML
separately).
UTF-8 is Unicode and you don't need HTML
> entities as you can represent all characters in UTF-8 and you still
> can get trash if you mix Unicode with HTML entities in UTF-8,
> depending on how you insert the HTML entities into the text.
>
>
>> (ps- looking for simple "what is best answers",
>> 10-words-or-less-kind-of-thing, no long treatises or court-cases
>> please :)
>
> Use UTF-8, don't use HTML entities.
No. Use UTF-8 or a Chinese encoding, based on criteria that cannot be
discussed without further information. Using ASCII or ISO-8859-1 with
character references would make sense for pages in Chinese only in rare
cases, e.g. when the pages must be editable (somehow) using simple editors
that don't let you input and view Chinese characters.
Use character references or predefined entity references (like —)
whenever you find them comfortable. You probably won't find them comfortable
at all for Chinese characters, though.
--
Jukka K. Korpela ("Yucca")
http://www.cs.tut.fi/~jkorpela/
Navigation:
[Reply to this message]
|