Re: Extract terms not by reader, but by documents

2007-09-06 Thread Mike Klaas
On 6-Sep-07, at 11:48 AM, Grant Ingersoll wrote: On Sep 6, 2007, at 1:32 PM, Rafael Rossini wrote: Karl, I´m aware of IndexReader.getTermFreqVector, with this I can get all terms of a document, but I want all terms of a document that matched a query. Grant, Yes, I think I understand.

Re: Extract terms not by reader, but by documents

2007-09-06 Thread Grant Ingersoll
On Sep 6, 2007, at 1:32 PM, Rafael Rossini wrote: Karl, I´m aware of IndexReader.getTermFreqVector, with this I can get all terms of a document, but I want all terms of a document that matched a query. Grant, Yes, I think I understand. You want to know what terms from your query matched i

Re: Extract terms not by reader, but by documents

2007-09-06 Thread Rafael Rossini
Karl, I´m aware of IndexReader.getTermFreqVector, with this I can get all terms of a document, but I want all terms of a document that matched a query. Grant, >Yes, I think I understand. You want to know what terms from your >query matched in a given document. Yep, that´s what I want. In the co

Re: Extract terms not by reader, but by documents

2007-09-05 Thread Grant Ingersoll
On Sep 5, 2007, at 10:48 AM, Rafael Rossini wrote: Thank´s for the reply Grant, let me try to explain exactly what I´d like to do. Take the 2 docs: Doc1: "Microsoft is a nice software company, and Xbox seems to be a nice product too." Doc2: "Nintendo and Sony have been in the game industr

Re: Extract terms not by reader, but by documents

2007-09-05 Thread Karl Wettin
Rafael, are you looking for IndexReader.getTermFreqVector? -- karl 5 sep 2007 kl. 16.48 skrev Rafael Rossini: Thank´s for the reply Grant, let me try to explain exactly what I´d like to do. Take the 2 docs: Doc1: "Microsoft is a nice software company, and Xbox seems to be a nice product

Re: Extract terms not by reader, but by documents

2007-09-05 Thread Rafael Rossini
Thank´s for the reply Grant, let me try to explain exactly what I´d like to do. Take the 2 docs: Doc1: "Microsoft is a nice software company, and Xbox seems to be a nice product too." Doc2: "Nintendo and Sony have been in the game industry for a long time, but now, Microsoft is trying to enter wit

Re: Extract terms not by reader, but by documents

2007-09-04 Thread Grant Ingersoll
Not sure if I am understanding what you are trying to do. I think you are trying to find out which terms occurred in a particular document, correct? I also am not sure about your first example. My understanding of extractTerms is that it just gives you back the set of all terms that occ

Extract terms not by reader, but by documents

2007-09-04 Thread Rafael Rossini
Hi all, In some custom highlighting, I often write a code like this: Set matchedTerms = new HashSet(); query.rewrite(reader).extractTerms(matchedTerms); With this code the Term Set gets populated by the matched query in your whole index. Is it possible to this with a docume