|
Posted by groups2 on 03/16/07 22:21
On Mar 16, 4:46 pm, "Jukka K. Korpela" <jkorp...@cs.tut.fi> wrote:
> Scripsit grou...@reenie.org:
>
>
>
> > Byte-Order Mark found in UTF-8 File.
> - -
> > Should I be worried about this ?
>
> Not much. The documenthttp://www.w3.org/International/questions/qa-utf8-bom
> is a bit vague and doesn't list down the software that doesn't grok the BOM,
> but the symptoms it mentions (an extra line or the funny characters
> aren't really catastrophic.
You don't work for the people I work for !
With every little non catastrophic "mistake" I see my job slipping
away to India where everyone works for 8 dollars an hour and these
things never happen.
>
> > It seems that only way to avoid this problem is to leave the file in
> > Dos and tidy the file in ascii. Is this correct ?
>
> I don't know about the specific software, but leaving the file in ASCII,
> presumably with the software presenting any non-ASCII characters as
> character or entity references like – or &ndash:, is a good option, if
> you have relatively few non-ASCII characters, so that it's not significant
> in terms of amount of data.
OK thanks.
My problem right now is that any when I clean up a page in tidy using
utf encoding
it takes out all the $mdash; characters and ads long dashes. That's
fine but when I
validate the page with the w3 validator, it says the long dashes
cannot be interpreted as utf-8. It just doesn't make any sense. I'm
guessing tidy is wrong to replace — but I'm only guessing. .
>
> > I am about to to edit quit a few pages so I want to do whatever will
> > be most common and most recommended in the future. Am I safe in
> > assuming that will be utf-8 ?
>
> UTF-8 is clearly favored in Internet protocol development, and there's no
> reason to expect this to change. But of course ASCII is still slightly
> better supported.
>
> --
> Jukka K. Korpela ("Yucca")http://www.cs.tut.fi/~jkorpela/
Navigation:
[Reply to this message]
|