|
Posted by Jukka K. Korpela on 03/16/07 20:46
Scripsit groups2@reenie.org:
> Byte-Order Mark found in UTF-8 File.
- -
> Should I be worried about this ?
Not much. The document
http://www.w3.org/International/questions/qa-utf8-bom
is a bit vague and doesn't list down the software that doesn't grok the BOM,
but the symptoms it mentions (an extra line or the funny characters 
aren't really catastrophic.
> It seems that only way to avoid this problem is to leave the file in
> Dos and tidy the file in ascii. Is this correct ?
I don't know about the specific software, but leaving the file in ASCII,
presumably with the software presenting any non-ASCII characters as
character or entity references like – or &ndash:, is a good option, if
you have relatively few non-ASCII characters, so that it's not significant
in terms of amount of data.
> I am about to to edit quit a few pages so I want to do whatever will
> be most common and most recommended in the future. Am I safe in
> assuming that will be utf-8 ?
UTF-8 is clearly favored in Internet protocol development, and there's no
reason to expect this to change. But of course ASCII is still slightly
better supported.
--
Jukka K. Korpela ("Yucca")
http://www.cs.tut.fi/~jkorpela/
Navigation:
[Reply to this message]
|