Re: Retrieving Top Terms for a subset of the index (or for all results of a query)

Grant Ingersoll Sun, 12 Oct 2008 08:39:56 -0700

How large of a subset are you talking?

You might look at the FitleredTermEnum class, but you will probablyhave to do some work on it to extend it to what you want

If you are talking a smallish subset (say, at most a couple hundreddocs), then you could store Term Vectors and use the TermVectorMapper,I suspect.



HTH,
Grant


On Oct 11, 2008, at 6:36 AM, Aleksander M. Stensby wrote:

Hello everyone. I've been fiddeling with the idea of retrieving thetop terms from a subset of the index (i.e. top terms from thedocuments retrieved by a given search). This could for instance beuseful to identify top ranking terms in a given datespan etc.
It would be something like getting the top 50 terms (like you can dowith luke) but instead of doing it for the full index, I would liketo do the same procedure after applying a filter or a query. Don'tknow if this is a bad explaination or wheter it makes any sense atall...
So, I really want to avoid iterating over all results (obviously),so my question is really if there is a prefered approach for doingsuch analysis / has this been done in a good way before?
Thanks for any help!

Best regards,
Aleksander

--
Aleksander M. Stensby
Senior Software Developer
Integrasco A/S
+47 41 22 82 72
[EMAIL PROTECTED]

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


--------------------------
Grant Ingersoll
Lucene Boot Camp Training Nov. 3-4, 2008, ApacheCon US New Orleans.
http://www.lucenebootcamp.com


Lucene Helpful Hints:
http://wiki.apache.org/lucene-java/BasicsOfPerformance
http://wiki.apache.org/lucene-java/LuceneFAQ










---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Retrieving Top Terms for a subset of the index (or for all results of a query)

Reply via email to