Re: locale fr_FR.utf8 and str_word_count() — PHP Programming Language

You are here: Re: locale fr_FR.utf8 and str_word_count() « PHP Programming Language « IT news, forums, messages

Posted by Kimmo Laine on 08/30/06 10:04

"Peter M�nster" <look@signature.invalid> wrote in message
news:Pine.LNX.4.64.0608300806240.28934@gaston.deltadore.bzh...
> Hello,
>
> str_word_count() does not seem to work with locale "fr_FR.utf8".
> The output of the following script is
> string(10) "fr_FR.utf8" Array ( [0] => bi [1] => re )
>
> I think, that "bi�re" should be recognized as word.
>
> Here is the test-script:
>
> <?
> echo '<html><head>
> <meta http-equiv="content-type" content="text/html; charset=utf-8" />
> </head><body>';
> var_dump(setlocale(LC_ALL, 'fr_FR.utf8'));
> print_r(str_word_count('bi�re', 1));
> echo '</body></html>';
> ?>
>
> Could someone help please?
> My PHP version is 5.1.2.

That might be a multibyte-string related problem. If the string is encoded
using multibyte charset, such as utf-8, it could be the reason
str_word_count is confused. PHP has a library for multibyte-functionality
designed to overcome the problems created by multibyte-encoded strings.
See:
http://fi2.php.net/manual/en/ref.mbstring.php

Once you've installed multibyte library, you could try writing a regular
expression for counting the words and use it with the mb_ereg* functions.

It's very sad that handling multibyte strings is not as easy as it would be
with simple english charset, but on the bright side, at least there is some
sort of support for it with the multibyte function library.

--
"Ohjelmoija on organismi joka muuttaa kofeiinia koodiksi" - lpk
http://outolempi.net/ahdistus/ - Satunnaisesti p�ivittyv� nettisarjis
spam@outolempi.net || Gedoon-S @ IRCnet || rot13(xvzzb@bhgbyrzcv.arg)

Navigation:

Next in forum: Re: Is PHP.net web server down?
Prev in forum: Re: Video file Uploading
Thread view: Re: locale fr_FR.utf8 and str_word_count()

[Reply to this message]

Удаленная работа для программистов • Как заработать на Google AdSense • England, UK • статьи на английском • PHP MySQL CMS Apache Oscommerce • Online Business Knowledge Base • DVD MP3 AVI MP4 players codecs conversion help

Home • Search • Site Map • Set as Homepage • Add to Favourites

Сайт изготовлен в Студии Валентина Петручека —
изготовление и поддержка веб-сайтов, разработка программного обеспечения, поисковая оптимизация