Re: performance of million documents search

2010-04-25 Thread weiqi wang
Hi Erick, It's very useful.Thank you very much 2010/4/26 Erick Erickson > NGrams might help here, search the SOLR list for "NGram" > and I think you'll find that this subject has been discussed > several times... > > HTH > Erick > > On Sat, Apr 24, 2010 at 9:26 PM, weiqi wang wrote: > > > Hi,

Re: hybrid approach to using cloud servers for Solr/Lucene

2010-04-25 Thread findbestopensource
Hello Dennis >>If the load goes up, then queries are sent to the cloud at a certain point. My advice is to do load balancing between local and cloud. Your local system seems to be capable as it is a dedicated host. Another option is to do indexing in local and sync it with cloud. Cloud will be on

Re: DIH: inner select fails when outter entity is null/empty

2010-04-25 Thread Otis Gospodnetic
Hi, Thanks for this tip, Paul. But what if this is not an error. Is this what transformers should be used for somehow? Thanks, Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ - Original Message > From: Noble P

Re: hybrid approach to using cloud servers for Solr/Lucene

2010-04-25 Thread Otis Gospodnetic
Hi, Hm. Everything is doable, but this sounds a bit undefined and possibly messy. If flexibility is of such importance, why have the "local" part at all? Why not have everything in an elastic cloud environment? Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosys

RE: Howto build a function query using the 'query' function

2010-04-25 Thread Villemos, Gert
If the 'query' returned a count, yes. But my problem is exactly that as far as I can see from the description of the 'query' function, it does NOT return the count but the score of the search. So my quetion is; How can I write a 'query' function that returns a count, not a score? Cheers, Ge

hybrid approach to using cloud servers for Solr/Lucene

2010-04-25 Thread Dennis Gearon
I'm working on an app that could grow much faster and bigger than I could scale local resources, at least on certain dates and for other reasons. So I'd like to run a local machine in a dedicated host or even virtual machine at a host. If the load goes up, then queries are sent to the cloud at

Re: performance of million documents search

2010-04-25 Thread Erick Erickson
NGrams might help here, search the SOLR list for "NGram" and I think you'll find that this subject has been discussed several times... HTH Erick On Sat, Apr 24, 2010 at 9:26 PM, weiqi wang wrote: > Hi, > > I have about 2 million documents in my index. I want to search them by a > string field.

Re: [spAm] Solr does not honor facet.mincount and field.facet.mincount

2010-04-25 Thread Chris Hostetter
: REQUEST: : http://localhost:8983/solr/select/?q=*%3A*&version=2.2&rows=0&start=0&indent=on&facet=true&facet.field=Instrument&facet.field=Location&facet.mincount=9 : : RESPONSE: ... : ... : 9 ...the REQUST url you listed says facet.mincount, but the response from Solr disa

local vs cloud

2010-04-25 Thread Dennis Gearon
I'm working on an app that could grow much faster and bigger than I could scale local resources, at least on certain dates and for other reasons. So I'd like to run a local machine in a dedicated host or even virtual machine at a host. If the load goes up, then queries are sent to the cloud at

Re: DIH: inner select fails when outter entity is null/empty

2010-04-25 Thread Noble Paul നോബിള്‍ नोब्ळ्
do an onError="skip" on the inner entity On Fri, Apr 23, 2010 at 3:56 AM, Otis Gospodnetic wrote: > Hello, > > Here is a newbie DataImportHandler question: > > Currently, I have entities with entities.  There are some > situations where a column value from the outer entity is null, and when I try

Re: How to setup search engine for B2B web app

2010-04-25 Thread Shalin Shekhar Mangar
Hi Bill, On Sun, Apr 25, 2010 at 12:23 PM, Bill Paetzke wrote: > *Given:* > > - 1 database per client (business customer) > - 5000 clients > - Clients have between 2 to 2000 users (avg is ~100 users/client) > - 100k to 10 million records per database > - Users need to search those recor