Euan Clark wrote:
Adult/Non-adult: Is there an existing content clustering plugin (e.g.
carrot) that can do this?

Unfortunately, no. However, the idea itself is very simple, and it's not complicated to implement a Nutch plugin that performs this tagging.


The scenario I'm thinking about  is when an adult-related search query
allows adult-related results and a non-adult query filters them out.

E.g. Query 'toys' - you don't want adult-related results turning up ....

Right, this wouldn't do. The trick is to add the "adult" flag as a prohibited term by default, and only remove it when users specifically request this type of results. Of course you can also remove the offending pages by not indexing them at all, if you never plan to show adult pages in results.

--
Best regards,
Andrzej Bialecki     <><
 ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com

Reply via email to