|
Posted by Willem Bogaerts on 06/15/07 07:34
> I'm trying to let PHP write a 'sitemap.xml' sitemap for Google and other
> searchengines. It's working, except that the content in the XML file doesn't
> seem to be UTF8. (Which it should be, judging by the information given on
> Google's webmaster helpcenter).
How can you tell? YOU tell the system what encoding is used. The system
rarely tells you, as bytes can be perfectly valid text in a lot of
encodings and look very different in each of them.
Even if the system tells you, it usually does so separately from the
text itself. Which is obvious, because you need the encoding to be able
to read the text! In webpages and e-mail, for example, headers are used
to set the encoding of the data.
I suggest you search the net for encodings and how to work with them.
This is a good start:
http://www.joelonsoftware.com/articles/Unicode.html
Good luck with the onions,
--
Willem Bogaerts
Application smith
Kratz B.V.
http://www.kratz.nl/
[Back to original message]
|