|
Posted by Schraalhans Keukenmeester on 06/08/07 12:03
At Fri, 08 Jun 2007 20:52:39 +1000, Pavel Kalinov let h(is|er) monkeys
type:
> Hi all,
>
> I am trying to build an application to classify texts from a number of
> sources. I am programming it in PHP and I go "by the book" - i.e.
> calculating probabilities according to the formula etc.
> It works, but it's very slow (due to slow PHP mathematical
> implementation, I guess).
> Is there some variation of the Naive Bayes classifier which is not so
> demanding in the way of computing power used?
>
> Best
> Pavel
You may like http://xhtml.net/php/PHPNaiveBayesianFilter
I am a bit surprised you have such a slow response, the typical algorithms
don't seem to be extremely taxing.
As part of an author authenticity scoring app Naive Bayesian filtering
proved quite useful, for spam filtering its use *by itself) proves rather
limited. Quite a few spam creators (scripts) are well equipped these days
to lower scores substantially, allowing their messages to leak through.
hth
--
Schraalhans Keukenmeester - schraalhans@the.Spamtrapexample.nl
[Remove the lowercase part of Spamtrap to send me a message]
"strcmp('apples','oranges') < 0"
Navigation:
[Reply to this message]
|