Hi I am at the start of my uni dissertation on interactive query expansion. At the mo I am using an Ajax framework and Wordnet to suggest alternative or additional search terms based on the user's original query. The webpage is updated as the user types. I now which to integrate my system into a search engine and Nutch seems suitable. I have successfully completed the whole-web crawl tutorial. I have two questions:
1. I wish to formulate a boolean query using the OR operator to search on all of the alternative search terms Wordnet has suggested. I have found no documentation neither in the Wiki or in the mailing list archive. Are boolean queries possible in Nutch? 2. How do I extract all index terms from nutch, and possibly their tf/idf score too? I inted to use this information to have a function similar to Google Suggest, in that as you type, suggested terms will appear based on terms actually in the index. I would want to put the terms and their associated score into a database like postgresql. Any pointers would be much appreciated! Regards, Nick.
