Reply to Re: content-type and unicode — HTML

Posted by J.O. Aho on 04/09/07 06:40

Simply Confusing! wrote:
> Let me see if I have this any where near *understanding* ...
>
> If I use the iso-8859-1 charset definition, I can only use roman
> alpha-numeric sequences in my code (ie, like regular english/french/german,
> or the unicode &#__; sequence), no problem, correct?

Yes, thats true (HTML entities still depends on the fonts installed on the
client machine supports the characters that you want to display with the HTML
entities).

> If I use BIG5 or GB2312 ... I (should/can/must?) insert the original chinese
> figures into the raw code?

You should type the Chinese in BIG5/GB2312 directly (in the same way as you
would type German in ISO-8859-1), you still can type English (a-zA-Z) in the
CJK encodings.

> If I use UTF-8, I can use either the unicode sequence or the original
> chinese figures.

Not sure what you mean with Unicode, but I suspect you mean HTML entities (
&#XXX; ), UTF-8 is Unicode and you don't need HTML entities as you can
represent all characters in UTF-8 and you still can get trash if you mix
Unicode with HTML entities in UTF-8, depending on how you insert the HTML
entities into the text.

> (ps- looking for simple "what is best answers",
> 10-words-or-less-kind-of-thing, no long treatises or court-cases please :)

Use UTF-8, don't use HTML entities.

--

//Aho

[Back to original message]

Удаленная работа для программистов • Как заработать на Google AdSense • England, UK • статьи на английском • PHP MySQL CMS Apache Oscommerce • Online Business Knowledge Base • DVD MP3 AVI MP4 players codecs conversion help

Home • Search • Site Map • Set as Homepage • Add to Favourites

Сайт изготовлен в Студии Валентина Петручека —
изготовление и поддержка веб-сайтов, разработка программного обеспечения, поисковая оптимизация