Re: multiple collections indexing

2003-03-21 Thread Morus Walter
Hi, > > Are lots of different combinations of collections used frequently? > Probably not. If only a handful of different subsets of collections are > frequently searched, then QueryFilter could be very useful. > I did some test and thought the results might be interesting for others also. I

Re: multiple collections indexing

2003-03-20 Thread Morus Walter
Tatu Saloranta writes: > On Wednesday 19 March 2003 01:44, Morus Walter wrote: > > This might still be a feasible thing to do, except if number of collections > changes very frequently (as you need to reindex all docs, not just > incremental). > Well the number is slowly growing. > Another po

Re: multiple collections indexing

2003-03-20 Thread Morus Walter
Hi, thanks for all your answers, I think I collect some of the hints and ideas rather than commenting all of them apart. Doug Cutting writes: > Morus Walter wrote: > > Searches must be able on any combination of collections. > > A typical search includes ~ 40 collections. > > > > Now the questio

Re: multiple collections indexing

2003-03-19 Thread Doug Cutting
Morus Walter wrote: Searches must be able on any combination of collections. A typical search includes ~ 40 collections. Now the question is, how to implement this in lucene best. Currently I see basically three possibilities: - create a data field containing the collection name for each document

Re: multiple collections indexing

2003-03-19 Thread Ype Kingma
Morus, On Wednesday 19 March 2003 00:44, Morus Walter wrote: > Hi, > > we are currently evaluating lucene. > > The data we'd like to index consists of ~ 80 collections of documents > (a few hundred up to 20 documents per collection, ~ 1.5 million > documents total; medium document size is in t

Re: multiple collections indexing

2003-03-19 Thread Tatu Saloranta
On Wednesday 19 March 2003 01:44, Morus Walter wrote: ... > Searches must be able on any combination of collections. > A typical search includes ~ 40 collections. > > Now the question is, how to implement this in lucene best. > > Currently I see basically three possibilities: > - create a data fiel

Re: multiple collections indexing

2003-03-19 Thread Vladimir Lukin
Hello Morus, I'd tell, how wildcard query works: 1. First, it runs over the lexcon and collects a list of terms that satisfy the specified pattern. 2. Then it makes a boolean query joining the collected terms with "or". 3. Then the constructed boolean query is used for searching. So is seems

Re: multiple collections indexing

2003-03-19 Thread John L Cwikla
ings I'd like to see addressed -- how lucene can handle partitioned data in a more scalable manner. - Original Message - From: "Morus Walter" <[EMAIL PROTECTED]> To: <[EMAIL PROTECTED]> Sent: Wednesday, March 19, 2003 12:44 AM Subject: multiple collections indexing >

multiple collections indexing

2003-03-19 Thread Morus Walter
Hi, we are currently evaluating lucene. The data we'd like to index consists of ~ 80 collections of documents (a few hundred up to 20 documents per collection, ~ 1.5 million documents total; medium document size is in the order of 1 kB). Searches must be able on any combination of collection