Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Solr Wiki" for change 
notification.

The following page has been changed by paulsundling:
http://wiki.apache.org/solr/SolrPerformanceFactors

------------------------------------------------------------------------------
  
  Using an [EmbeddedSolr] for indexing can be over 50% faster than one using 
XML messages that are posted.  
  
- For example it took 2:10:23 to index 3 million records and optimize, while it 
took 3:21:36 on the same machine to index using HTTP Post.  If you consider 
that optimize is only one call, then the difference is slightly bigger.  The 
machine for these sample numbers was a 3Ghz Pentium 4 desktop machine.
+ For example it took 2:10:23 to index 3 million records and optimize, while it 
took 3:21:36 on the same machine to index using HTTP Post with 10 records/post 
or 2:37:17 with 200 records/post.  If you consider that optimize is only one 
call, then the difference is slightly bigger.  The machine for these sample 
numbers was a 3Ghz Pentium 4 desktop machine.
  
+ However the tradeoff is larger records/post requires greater memory 
footprint.  As the records/post becomes higher it makes more sense to have 
separate threads for getting records from database/files and another for 
posting the XML messages to Solr (could also double buffer).  
+ 
+ See 
[http://java.sun.com/j2se/1.5.0/docs/api/java/util/concurrent/package-summary.html
 java.util.concurrency javadoc] for more information on threading.
+ 

Reply via email to