Hello I'm new to Nutch, and this mailing list. From what I've seen in my short 3 hours of work with Nutch, I'm very impressed. Excellent work!! I was able to get an index built and serve it up with Tomcat in short order. So, now that I've had that success, I'm on to creating my site with Nutch.
I have a number of boolean type tags that I want to attach to each url (from a domain) that I crawl and add to the index. Of course the various urls will get some of the tags and others will not. This breakdown of domain to tags is stored in a flat file at the moment. To the user, the tags will be shown as checkboxes in a search UI, such that, if you put a check in the box, you will get all urls that have that tag associated, or set to true. How do I do this Nutch? I'm guessing I could use a plugin, but I'm not sure which one. Or, do I need to create my own plugin? Are there example anywhere of implementing something like this? thanks!!
