Hi. Anyone aware of an anti-spam plugin for nutch ? Or even a decent java spam/ham classifier?
I was thinking of creating a plugin which uses jASEN http://www.jasen.org/and adds a field "spamValue" to the indexed document or I will create this outside of Nutch. The one can search in nutch and get guaranteed spam-free results i.e. when spamValue <= 0.1 Kindly //Marcus -- Marcus Herou CTO and co-founder Tailsweep AB +46702561312 [EMAIL PROTECTED] http://www.tailsweep.com/ http://blogg.tailsweep.com/
