Posted by Toby Inkster on 05/27/06 14:17
ironcorona wrote:
> How come you're using UTF-16? Russian and Chinese can both be encoded
> in UTF-8.
A typical Chinese character will take up 16 bits in a UTF-16 file, but 24
bits in a UTF-8 file. Thus a UTF-8 file may be up to 50% bigger than
UTF-16. Most Western characters only use 8 bits in UTF-8, but 16 in
UTF-16, so for Western languages, UTF-8 can be up to 50% smaller than
UTF-16.
So if a page uses primarily non-Western characters, UTF-16 is often a
better choice.
--
Toby A Inkster BSc (Hons) ARCS
Contact Me ~ http://tobyinkster.co.uk/contact
[Back to original message]
|