Hi > I wanted to do some sort of faceted search with Nutch, but am not able to > figure out a clean and elegant solution for this. Could anyone give me any > sort of pointers on how to achieve this?
hope this answers your question: every field that lucene indexes is a sort of facet. you can search within one specific field by passing "fieldname:value" as the query. one such field by default is title. searching for title:test return results that contain "test" in the "title" field, which was derived from the <title> tag. you could extend the fields that lucene indexes by writing plugins. this http://office.labforculture.org:8180/search/search.jsp?query=dc_subject:aboriginal returns all the urls we have that contain "aboriginal" in the <meta name="DC:subject" field (and some variations on it). if you'd define your own metadata, and write your own plugin to parse that .. see http://wiki.apache.org/nutch/WritingPluginExample $2c, *pike
