Re: Speeding up indexing

2012-02-28 Thread Erik Hatcher
30 million - that's feasible on a single (beefy) Solr server but whether it's advisable to go distributed or not depends on other factors, like query speed issues you may have with that many docs in a single server, expected collection growth, and so on. As for your questions further below

Re: Speeding up indexing

2012-02-27 Thread Memory Makers
A quick add on to this -- we have over 30 million documents. I take it that we should be looking @ Distributed Solr? as in http://www.lucidimagination.com/content/scaling-lucene-and-solr#d0e344 Thanks. On Mon, Feb 27, 2012 at 2:33 PM, Memory Makers wrote: > Many thanks for the response. > > H

Re: Speeding up indexing

2012-02-27 Thread Memory Makers
Many thanks for the response. Here is the revised questions: For example if I have N processes that are producing documents to index: 1. Should I have them simultaneously submit documents to Solr (will this improve the indexing throughput)? 2. Is there anything I can do Solr configuration wise th

Re: Speeding up indexing

2012-02-27 Thread Mikhail Khludnev
My two cents: - pulling is better than pushing - http://wiki.apache.org/solr/Solrj#Streaming_documents_for_an_update - DIH is not thread safe https://issues.apache.org/jira/browse/SOLR-3011 But there are few patches for trunk which fix it. Regards On Mon, Feb 27, 2012 at 10:46 PM, Erik Hatcher

Re: Speeding up indexing

2012-02-27 Thread Erik Hatcher
Yes, absolutely. Parallelizing indexing can make a huge difference. How you do so will depend on your indexing environment. Most crudely, running multiple indexing scripts on different subsets of data up to the the limitations of your operating system and hardware is how many do it. SolrJ h

Re: Tips on speeding up indexing needed...

2009-10-13 Thread William Pierce
sday, October 13, 2009 11:07 AM To: Subject: Re: Tips on speeding up indexing needed... : References: <4acb30d2.2010...@umich.edu> : <69de18140910070109m27e50d2sc82a7c7bdd683...@mail.gmail.com> : <4acc95a3.5000...@umich.edu> : : <4acfc943.4040...@umich.e

Re: Tips on speeding up indexing needed...

2009-10-13 Thread Chris Hostetter
: References: <4acb30d2.2010...@umich.edu> : <69de18140910070109m27e50d2sc82a7c7bdd683...@mail.gmail.com> : <4acc95a3.5000...@umich.edu> : : <4acfc943.4040...@umich.edu> : In-Reply-To: <4acfc943.4040...@umich.edu> : Subject: Tips on speedi

Re: Tips on speeding up indexing needed...

2009-10-11 Thread William Pierce
Thanks, Lance. I already commit at the end. I will take a look at the data import handler. Thanks again! -- Bill -- From: "Lance Norskog" Sent: Saturday, October 10, 2009 7:58 PM To: Subject: Re: Tips on speeding up indexing nee

Re: Tips on speeding up indexing needed...

2009-10-10 Thread Lance Norskog
ur help! > > -- Bill > > -- > From: "William Pierce" > Sent: Saturday, October 10, 2009 5:47 PM > To: > Subject: Tips on speeding up indexing needed... > >> Folks: >> >> I have a corpus of approx 6 M

Re: Tips on speeding up indexing needed...

2009-10-10 Thread William Pierce
y, October 10, 2009 5:47 PM To: Subject: Tips on speeding up indexing needed... Folks: I have a corpus of approx 6 M documents each of approx 4K bytes. Currently, the way indexing is set up I read documents from a database and issue solr post requests in batches (batches are set up s

Tips on speeding up indexing needed...

2009-10-10 Thread William Pierce
Folks: I have a corpus of approx 6 M documents each of approx 4K bytes. Currently, the way indexing is set up I read documents from a database and issue solr post requests in batches (batches are set up so that the maxPostSize of tomcat which is set to 2MB is adhered to). This means that in

Re: speeding up indexing with a LOT of indexed fields

2009-03-25 Thread Britske
-- >> From: Britske >> To: solr-user@lucene.apache.org >> Sent: Wednesday, March 25, 2009 10:05:17 AM >> Subject: speeding up indexing with a LOT of indexed fields >> >> >> hi, >> >> I'm having difficulty indexing a collection of documents

Re: speeding up indexing with a LOT of indexed fields

2009-03-25 Thread Otis Gospodnetic
- Original Message > From: Britske > To: solr-user@lucene.apache.org > Sent: Wednesday, March 25, 2009 10:05:17 AM > Subject: speeding up indexing with a LOT of indexed fields > > > hi, > > I'm having difficulty indexing a collection of documents in a reasonable

speeding up indexing with a LOT of indexed fields

2009-03-25 Thread Britske
- lastly: should I be able to get more out of this box or am I just complaining ;-) Thanks for making it to here, and hoping to receive some valuable info, Cheers, Britske -- View this message in context: http://www.nabble.com/speeding-up-indexing-with-a-LOT-of-indexed-fields-tp22702364p22702364.html Sent from the Solr - User mailing list archive at Nabble.com.