Pike, Thanks for your quick response. However I was looking for something sightly different. I understand the concept of query filtering, but what I really need is some sort of "category counting" to refine searches.
For e.g. my documents can have a fieldname - location, which could be any city in a country. I want to display the documents (and count) that match the search query for each city, so that the user can then search within the search results. The name of cities is not known in advance. An example of something similar is - http://reviews.cnet.com/4566-6501_7-0.html I just wanted to know whether anyone has tried doing this using Nutch , and if so then I would be glad if he could give me some pointers for the same. Regards, Puneet On 6/2/07, Pike <[EMAIL PROTECTED]> wrote:
Hi > I wanted to do some sort of faceted search with Nutch, but am not able to > figure out a clean and elegant solution for this. Could anyone give me any > sort of pointers on how to achieve this? hope this answers your question: every field that lucene indexes is a sort of facet. you can search within one specific field by passing "fieldname:value" as the query. one such field by default is title. searching for title:test return results that contain "test" in the "title" field, which was derived from the <title> tag. you could extend the fields that lucene indexes by writing plugins. this http://office.labforculture.org:8180/search/search.jsp?query=dc_subject:aboriginal returns all the urls we have that contain "aboriginal" in the <meta name="DC:subject" field (and some variations on it). if you'd define your own metadata, and write your own plugin to parse that .. see http://wiki.apache.org/nutch/WritingPluginExample $2c, *pike
