Re: Tidy using unicode does not validate — HTML

You are here: Re: Tidy using unicode does not validate « HTML « IT news, forums, messages

Posted by Jukka K. Korpela on 03/16/07 20:46

Scripsit groups2@reenie.org:

> Byte-Order Mark found in UTF-8 File.
- -
> Should I be worried about this ?

Not much. The document
http://www.w3.org/International/questions/qa-utf8-bom
is a bit vague and doesn't list down the software that doesn't grok the BOM,
but the symptoms it mentions (an extra line or the funny characters ï»¿
aren't really catastrophic.

> It seems that only way to avoid this problem is to leave the file in
> Dos and tidy the file in ascii. Is this correct ?

I don't know about the specific software, but leaving the file in ASCII,
presumably with the software presenting any non-ASCII characters as
character or entity references like – or &ndash:, is a good option, if
you have relatively few non-ASCII characters, so that it's not significant
in terms of amount of data.

> I am about to to edit quit a few pages so I want to do whatever will
> be most common and most recommended in the future. Am I safe in
> assuming that will be utf-8 ?

UTF-8 is clearly favored in Internet protocol development, and there's no
reason to expect this to change. But of course ASCII is still slightly
better supported.

--
Jukka K. Korpela ("Yucca")
http://www.cs.tut.fi/~jkorpela/

Navigation:

Next in forum: Re: Web Design and Web Development
Prev in forum: Re: Curious Kink
Thread view: Re: Tidy using unicode does not validate

[Reply to this message]

Удаленная работа для программистов • Как заработать на Google AdSense • England, UK • статьи на английском • PHP MySQL CMS Apache Oscommerce • Online Business Knowledge Base • DVD MP3 AVI MP4 players codecs conversion help

Home • Search • Site Map • Set as Homepage • Add to Favourites

Сайт изготовлен в Студии Валентина Петручека —
изготовление и поддержка веб-сайтов, разработка программного обеспечения, поисковая оптимизация