How to setup search engine for B2B web app

2010-04-25 Thread Bill Paetzke
*Given:* - 1 database per client (business customer) - 5000 clients - Clients have between 2 to 2000 users (avg is ~100 users/client) - 100k to 10 million records per database - Users need to search those records often (it's the best way to navigate their data) *The Question:*

Re: How to setup search engine for B2B web app

2010-04-25 Thread Shalin Shekhar Mangar
Hi Bill, On Sun, Apr 25, 2010 at 12:23 PM, Bill Paetzke billpaet...@gmail.comwrote: *Given:* - 1 database per client (business customer) - 5000 clients - Clients have between 2 to 2000 users (avg is ~100 users/client) - 100k to 10 million records per database - Users need to

Re: DIH: inner select fails when outter entity is null/empty

2010-04-25 Thread Noble Paul നോബിള്‍ नोब्ळ्
do an onError=skip on the inner entity On Fri, Apr 23, 2010 at 3:56 AM, Otis Gospodnetic otis_gospodne...@yahoo.com wrote: Hello, Here is a newbie DataImportHandler question: Currently, I have entities with entities.  There are some situations where a column value from the outer entity is

local vs cloud

2010-04-25 Thread Dennis Gearon
I'm working on an app that could grow much faster and bigger than I could scale local resources, at least on certain dates and for other reasons. So I'd like to run a local machine in a dedicated host or even virtual machine at a host. If the load goes up, then queries are sent to the cloud at

Re: [spAm] Solr does not honor facet.mincount and field.facet.mincount

2010-04-25 Thread Chris Hostetter
: REQUEST: : http://localhost:8983/solr/select/?q=*%3A*version=2.2rows=0start=0indent=onfacet=truefacet.field=Instrumentfacet.field=Locationfacet.mincount=9 : : RESPONSE: ... : lst name=params ... : str name=facet.minCount9/str ...the REQUST url you listed says

Re: performance of million documents search

2010-04-25 Thread Erick Erickson
NGrams might help here, search the SOLR list for NGram and I think you'll find that this subject has been discussed several times... HTH Erick On Sat, Apr 24, 2010 at 9:26 PM, weiqi wang weiqi...@gmail.com wrote: Hi, I have about 2 million documents in my index. I want to search them by a

hybrid approach to using cloud servers for Solr/Lucene

2010-04-25 Thread Dennis Gearon
I'm working on an app that could grow much faster and bigger than I could scale local resources, at least on certain dates and for other reasons. So I'd like to run a local machine in a dedicated host or even virtual machine at a host. If the load goes up, then queries are sent to the cloud at

RE: Howto build a function query using the 'query' function

2010-04-25 Thread Villemos, Gert
If the 'query' returned a count, yes. But my problem is exactly that as far as I can see from the description of the 'query' function, it does NOT return the count but the score of the search. So my quetion is; How can I write a 'query' function that returns a count, not a score? Cheers,

Re: DIH: inner select fails when outter entity is null/empty

2010-04-25 Thread Otis Gospodnetic
Hi, Thanks for this tip, Paul. But what if this is not an error. Is this what transformers should be used for somehow? Thanks, Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ - Original Message From: Noble

Re: hybrid approach to using cloud servers for Solr/Lucene

2010-04-25 Thread findbestopensource
Hello Dennis If the load goes up, then queries are sent to the cloud at a certain point. My advice is to do load balancing between local and cloud. Your local system seems to be capable as it is a dedicated host. Another option is to do indexing in local and sync it with cloud. Cloud will be

Re: performance of million documents search

2010-04-25 Thread weiqi wang
Hi Erick, It's very useful.Thank you very much 2010/4/26 Erick Erickson erickerick...@gmail.com NGrams might help here, search the SOLR list for NGram and I think you'll find that this subject has been discussed several times... HTH Erick On Sat, Apr 24, 2010 at 9:26 PM, weiqi wang