Hi.

Anyone aware of an anti-spam plugin for nutch ? Or even a decent java
spam/ham classifier?

I was thinking of creating a plugin which uses jASEN
http://www.jasen.org/and adds a field "spamValue" to the indexed
document or I will create this
outside of Nutch. The one can search in nutch and get guaranteed spam-free
results i.e. when spamValue <= 0.1

Kindly

//Marcus

-- 
Marcus Herou CTO and co-founder Tailsweep AB
+46702561312
[EMAIL PROTECTED]
http://www.tailsweep.com/
http://blogg.tailsweep.com/

Reply via email to