Hello, This is what I want to do. Given a document, find all its terms and frequencies.
I understand that Nutch is built on top of Lucene. In Lucene, I can access the terms and their frequencies of a document via the indexreader. However, in nutch, I am not sure if there's an equivalent. In Lucene, indexreader needs to know where the inverted indexes are. In Nutch, I am not sure how and where to locate the inverted indexes. Is it possible to access the inverted index from Nutch? Thank you very much for your help. -- View this message in context: http://www.nabble.com/Nutch-and-Lucene-tf2606327.html#a7272844 Sent from the Nutch - Dev mailing list archive at Nabble.com.
