You are here: Re: URL encoding of non ASCII characters « HTML « IT news, forums, messages
Re: URL encoding of non ASCII characters

Posted by Jukka K. Korpela on 08/27/07 16:29

Scripsit Hugo:

> how do I have to encode non-ASCII characters like German Umlaute? I
> know how to encode "normal problematic" characters like space and &.
> But what do I have to do with these non-ASCII characters?

Some browsers may support a URL encoding that is based on ISO-8859-1 or some
other assumed default, so that you would represent an Umlaut letter as an
octet (byte) by ISO-8859-1 and then encode the result as %xx where xx is the
code in hexadecimal.

However, the modern and official method is based on UTF-8. You first
represent an Umlaut letter as two octets by UTF-8, then encode both as %xx.

References:
http://www.apps.ietf.org/rfc/rfc3986.html#sec-2.5
http://www.w3.org/International/O-URL-and-ident.html

--
Jukka K. Korpela ("Yucca")
http://www.cs.tut.fi/~jkorpela/

 

Navigation:

[Reply to this message]


Удаленная работа для программистов  •  Как заработать на Google AdSense  •  England, UK  •  статьи на английском  •  PHP MySQL CMS Apache Oscommerce  •  Online Business Knowledge Base  •  DVD MP3 AVI MP4 players codecs conversion help
Home  •  Search  •  Site Map  •  Set as Homepage  •  Add to Favourites

Copyright © 2005-2006 Powered by Custom PHP Programming

Сайт изготовлен в Студии Валентина Петручека
изготовление и поддержка веб-сайтов, разработка программного обеспечения, поисковая оптимизация