Re: Extracting data from Lucene index files

2006-12-25 Thread Venkateshprasanna
Thanks a lot Doron, it worked fine and thanks for your tip as well! Prasanna Using term vectors means passing on the terms too many times - i.e - loop on terms - - loop on docs of a term - - - loop on terms of a doc Would something like this be better: do { System.out.println(tenum.

Re: Extracting data from Lucene index files

2006-12-20 Thread Doron Cohen
Using term vectors means passing on the terms too many times - i.e - loop on terms - - loop on docs of a term - - - loop on terms of a doc Would something like this be better: do { System.out.println(tenum.term()+" appears in "+tenum.docFreq()+" docs!"); TermDocs td = reader.termDo

Re: Extracting data from Lucene index files

2006-12-19 Thread Venkateshprasanna
> Take a look at TermDocs and TermEnum. I need to get the frequency of each word in each of the documents I have indexed. This is what I could do with TermEnums and TermDocs. For each Term from TermEnum, I have instantiated a TermsDoc and for each doc, I am trying to get the frequency of the Ter

Re: Extracting data from Lucene index files

2006-12-13 Thread Grant Ingersoll
Take a look at TermDocs and TermEnum. -Grant On Dec 13, 2006, at 6:02 AM, Venkateshprasanna wrote: I would like to use the data stored in the Lucene indexes, like the words and their frequencies and store them in a database. Can anyone suggest a way of going about it or is it possible at