You are here: Re: Tidy using unicode does not validate « HTML « IT news, forums, messages
Re: Tidy using unicode does not validate

Posted by Jukka K. Korpela on 03/16/07 20:46

Scripsit groups2@reenie.org:

> Byte-Order Mark found in UTF-8 File.
- -
> Should I be worried about this ?

Not much. The document
http://www.w3.org/International/questions/qa-utf8-bom
is a bit vague and doesn't list down the software that doesn't grok the BOM,
but the symptoms it mentions (an extra line or the funny characters 
aren't really catastrophic.

> It seems that only way to avoid this problem is to leave the file in
> Dos and tidy the file in ascii. Is this correct ?

I don't know about the specific software, but leaving the file in ASCII,
presumably with the software presenting any non-ASCII characters as
character or entity references like – or &ndash:, is a good option, if
you have relatively few non-ASCII characters, so that it's not significant
in terms of amount of data.

> I am about to to edit quit a few pages so I want to do whatever will
> be most common and most recommended in the future. Am I safe in
> assuming that will be utf-8 ?

UTF-8 is clearly favored in Internet protocol development, and there's no
reason to expect this to change. But of course ASCII is still slightly
better supported.

--
Jukka K. Korpela ("Yucca")
http://www.cs.tut.fi/~jkorpela/

 

Navigation:

[Reply to this message]


Удаленная работа для программистов  •  Как заработать на Google AdSense  •  England, UK  •  статьи на английском  •  PHP MySQL CMS Apache Oscommerce  •  Online Business Knowledge Base  •  DVD MP3 AVI MP4 players codecs conversion help
Home  •  Search  •  Site Map  •  Set as Homepage  •  Add to Favourites

Copyright © 2005-2006 Powered by Custom PHP Programming

Сайт изготовлен в Студии Валентина Петручека
изготовление и поддержка веб-сайтов, разработка программного обеспечения, поисковая оптимизация