| 
	
 | 
 Posted by Jukka K. Korpela on 12/20/05 01:40 
"Tony Vella" <tony.vella@rogers.com> wrote: 
 
> I am preparing a series of philatelic html pages (lots of text and a few 
> scans of stamps) which will include alpha-characters (accents) in Italian, 
> French, Spanish, Portuguese and Danish.  
 
They are all covered by the ISO-8859-1 encoding, except for some punctuation  
marks and letters like the oe ligature. If you use windows-1252, you get the  
punctuation marks and the ligature, too. 
 
>The pages I have finished in draft 
> form so far I have encoded UTF-8 but I have just been told that 99% of the 
> world will not be able to read them  
 
Nonsense. More probably, 99 % of the WWW users _are_ able to read them. Well,  
let's say 97.6 %. After all, 96,3 % of all percentages have just been made  
up, and the remaining 4,7 % have been miscalculated. 
 
> and that I should go through all the 
> pages and re-encode them "western european - windows (1252)". 
 
I wouldn't do that at this point, unless you have good tools that do such  
things for you with minimal effort. 
 
> I guess what 
> I would like to know is what encoding would be most effective for these 
> particular languages. 
 
If you were just about to start the project, I would recommend ISO-8859-1 (or  
windows-1252 if you need those extras) - not because of wider browser  
coverage (though there is a _small_ improvement to be gained there) but  
because those encodings are somewhat more efficient (one byte per character,  
whereas UTF-8 uses two bytes for some of the characters you'd use). 
 
UTF-8 is certainly simpler in the future if you'll ever need to add  
characters in other languages. 
 
--  
Yucca, http://www.cs.tut.fi/~jkorpela/ 
Pages about Web authoring: http://www.cs.tut.fi/~jkorpela/www.html
 
  
Navigation:
[Reply to this message] 
 |