|
Posted by Jukka K. Korpela on 12/20/05 01:40
"Tony Vella" <tony.vella@rogers.com> wrote:
> I am preparing a series of philatelic html pages (lots of text and a few
> scans of stamps) which will include alpha-characters (accents) in Italian,
> French, Spanish, Portuguese and Danish.
They are all covered by the ISO-8859-1 encoding, except for some punctuation
marks and letters like the oe ligature. If you use windows-1252, you get the
punctuation marks and the ligature, too.
>The pages I have finished in draft
> form so far I have encoded UTF-8 but I have just been told that 99% of the
> world will not be able to read them
Nonsense. More probably, 99 % of the WWW users _are_ able to read them. Well,
let's say 97.6 %. After all, 96,3 % of all percentages have just been made
up, and the remaining 4,7 % have been miscalculated.
> and that I should go through all the
> pages and re-encode them "western european - windows (1252)".
I wouldn't do that at this point, unless you have good tools that do such
things for you with minimal effort.
> I guess what
> I would like to know is what encoding would be most effective for these
> particular languages.
If you were just about to start the project, I would recommend ISO-8859-1 (or
windows-1252 if you need those extras) - not because of wider browser
coverage (though there is a _small_ improvement to be gained there) but
because those encodings are somewhat more efficient (one byte per character,
whereas UTF-8 uses two bytes for some of the characters you'd use).
UTF-8 is certainly simpler in the future if you'll ever need to add
characters in other languages.
--
Yucca, http://www.cs.tut.fi/~jkorpela/
Pages about Web authoring: http://www.cs.tut.fi/~jkorpela/www.html
Navigation:
[Reply to this message]
|