|  | Posted by Jukka K. Korpela on 03/16/07 20:46 
Scripsit groups2@reenie.org:
 > Byte-Order Mark found in UTF-8 File.
 - -
 > Should I be worried about this ?
 
 Not much. The document
 http://www.w3.org/International/questions/qa-utf8-bom
 is a bit vague and doesn't list down the software that doesn't grok the BOM,
 but the symptoms it mentions (an extra line or the funny characters 
 aren't really catastrophic.
 
 > It seems that only way to avoid this problem is to leave the file in
 > Dos and tidy the file in ascii. Is this correct ?
 
 I don't know about the specific software, but leaving the file in ASCII,
 presumably with the software presenting any non-ASCII characters as
 character or entity references like – or &ndash:, is a good option, if
 you have relatively few non-ASCII characters, so that it's not significant
 in terms of amount of data.
 
 > I am about to to edit quit a few pages so I  want to do whatever will
 > be most common and most recommended in the future. Am I safe in
 > assuming that will be utf-8 ?
 
 UTF-8 is clearly favored in Internet protocol development, and there's no
 reason to expect this to change. But of course ASCII is still slightly
 better supported.
 
 --
 Jukka K. Korpela ("Yucca")
 http://www.cs.tut.fi/~jkorpela/
  Navigation: [Reply to this message] |