Hi there, I'm fairly new to Nutch and need a few pointers in the right direction.
I am using Wordnet as part of an interactive query expansion system. Wordnet is stored in a Postgresql database. I would like to show only the terms that actually appear in the Nutch's index rather than every possible hyponym. My solution for this requires that whilst Nutch indexes the documents it has retrieved it, for every indexed word it flags the word in the wordnet database to say that it appears in the index. Understand me so far? Good! Where would I insert the code to carry out this action? Many thanks for any suggestions, Nick.
