|
Posted by Jukka K. Korpela on 04/09/07 05:07
Scripsit Simply Confusing!:
> If I use the iso-8859-1 charset definition, I can only use roman
> alpha-numeric sequences in my code (ie, like regular
> english/french/german, or the unicode &#__; sequence), no problem,
> correct?
What you are probably trying to say is correct, but what you are actually
saying is obscure; I mean the "alpha-numeric sequences in my code" part. You
can use almost any Latin letters as used in Western European languages as
such, and for characters that have no iso-8859-1 code, you can use a &#__;
sequence. (Regular English, French, and German contain punctuation marks
that don't exist in iso-8859-1, and French even has a letter, the oe
ligature, that isn't there.)
> If I use BIG5 or GB2312 ... I (should/can/must?) insert the original
> chinese figures into the raw code?
You can. You probably should, since it doesn't really make much sense to use
a Chinese encoding and yet use &#__; for Chinese characters, but you can, so
"must" would be wrong.
> If I use UTF-8, I can use either the unicode sequence or the original
> chinese figures.
Yes. Here, too, there's normally little reason not to insert the characters
as such.
> (ps- looking for simple "what is best answers",
> 10-words-or-less-kind-of-thing, no long treatises or court-cases
> please :)
Nobody expects the Spanish inquisition!
--
Jukka K. Korpela ("Yucca")
http://www.cs.tut.fi/~jkorpela/
Navigation:
[Reply to this message]
|