Re: topdocs per facet

2014-10-09 Thread Shai Erera
The facets translation should be done at the application level. So if you index the dimension A w/ two facets A/A1 and A/A2, where A1 should also be translated to B1 and A2 translated to B2, there are several options: Index the dimensions A and B with their respective facets, and count the relevan

Re: TopDocs - Get all docs?

2008-12-17 Thread Michael McCandless
It might be faster to use FieldCache.DEFAULT.getStrings(reader, "empid"), assuming empid is indexed but is not analyzed (or always analyzes to one token). Though, that then persists the resulting array in the FieldCache. We are wanting to create "column stride fields" (LUCENE-1231) to make

Re: TopDocs - Get all docs?

2008-12-17 Thread Donna L Gresh
Thanks- Yes in my use-case there are never any deleted documents when the search is run- (deletion takes place in a pre-processing stage) Toke Eskildsen wrote on 12/17/2008 08:16:31 AM: > On Mon, 2008-12-08 at 15:17 +0100, Donna L Gresh wrote: > > public Vector getIndexIds() throws Exce

Re: TopDocs - Get all docs?

2008-12-17 Thread Toke Eskildsen
On Mon, 2008-12-08 at 15:17 +0100, Donna L Gresh wrote: > public Vector getIndexIds() throws Exception { > > Vector vec = new Vector(); > IndexReader ireader = IndexReader.open(directoryName); > int numdocs = ireader.numDocs(); >

Re: TopDocs - Get all docs?

2008-12-08 Thread Erick Erickson
I'm a great fan of not changing working code for a "might be better sometime in the far future if lots of things change" ... Erick On Mon, Dec 8, 2008 at 10:54 AM, Donna L Gresh <[EMAIL PROTECTED]> wrote: > Erick- > Thanks for the pointer; in my app the difference is between 30 > milliseconds an

Re: TopDocs - Get all docs?

2008-12-08 Thread Donna L Gresh
Erick- Thanks for the pointer; in my app the difference is between 30 milliseconds and 45 milliseconds (and this is a once-a-day kind of thing), but hey it's always worth doing something the better way in case my index ever gets a whole lot bigger or the use case changes-- thanks. Donna L. Gres

Re: TopDocs - Get all docs?

2008-12-08 Thread Erick Erickson
is empid indexed? If it is this should run *much* faster if you used TermEnum/TermDocs to fetch all the empids.. FWIW Erick On Mon, Dec 8, 2008 at 9:17 AM, Donna L Gresh <[EMAIL PROTECTED]> wrote: > I have a need to get the list of all "empid"s (defined by me) in the index > so that I can re

Re: TopDocs - Get all docs?

2008-12-08 Thread Donna L Gresh
I have a need to get the list of all "empid"s (defined by me) in the index so that I can remove the ones that are "stale" by my definition; in this snippet I'm returning all the "empids" for later processing, but the core is very simple. public Vector getIndexIds() throws Exception {

Re: TopDocs - Get all docs?

2008-12-05 Thread Grant Ingersoll
See the MatchAllDocsQuery in the search package. -Grant On Dec 5, 2008, at 5:57 PM, Ian Vink wrote: Is there an easy way to get all the documents in the index? Kinda like this: TopDocs everything = ???.GetAllDocuments(); -- Grant Ingersoll Lucene Helpful Hints: http

Re: TopDocs

2008-12-04 Thread John Wang
searcher.doc(scoreDoc.doc); On Thu, Dec 4, 2008 at 6:59 PM, Ian Vink <[EMAIL PROTECTED]> wrote: > I have this search which returns TopDocs > TopDocs topDocs = searcher.Search(query, bookFilter, maxDocsToFind); > > > How do I get the document object for the ScoreDoc? > > foreach (ScoreDoc scoreDo

Re: TopDocs question

2008-09-15 Thread Cam Bazz
Otis > -- > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch > > > > - Original Message >> From: Cam Bazz <[EMAIL PROTECTED]> >> To: java-user@lucene.apache.org >> Sent: Monday, September 15, 2008 11:25:39 PM >> Subject: Re: TopDocs

Re: TopDocs question

2008-09-15 Thread Otis Gospodnetic
ent: Monday, September 15, 2008 11:25:39 PM > Subject: Re: TopDocs question > > Yes, I looked into implementing a custom collector that would return > number of hits, but - I could not. > > collect() can not access anything that is final, and final can not be > incremented.

Re: TopDocs question

2008-09-15 Thread Cam Bazz
Yes, I looked into implementing a custom collector that would return number of hits, but - I could not. collect() can not access anything that is final, and final can not be incremented. Any ideas? Best. On Tue, Sep 16, 2008 at 6:05 AM, Daniel Noll <[EMAIL PROTECTED]> wrote: > Cam Bazz wrote: >

Re: TopDocs question

2008-09-15 Thread Daniel Noll
Cam Bazz wrote: Hello, Could it harm if I make a searcher.search(query, Integer.MAX_VALUE) ? I just need to make a query to get the number of hits in this case, but I dont know what the max hits will be. PriorityQueue will attempt to allocate an array of that size. But if you only need to k