Dear Wiki user, You have subscribed to a wiki page or wiki category on "Solr Wiki" for change notification.
The following page has been changed by paulsundling: http://wiki.apache.org/solr/SolrPerformanceFactors ------------------------------------------------------------------------------ Using an [EmbeddedSolr] for indexing can be over 50% faster than one using XML messages that are posted. - For example it took 2:10:23 to index 3 million records and optimize, while it took 3:21:36 on the same machine to index using HTTP Post. If you consider that optimize is only one call, then the difference is slightly bigger. The machine for these sample numbers was a 3Ghz Pentium 4 desktop machine. + For example it took 2:10:23 to index 3 million records and optimize, while it took 3:21:36 on the same machine to index using HTTP Post with 10 records/post or 2:37:17 with 200 records/post. If you consider that optimize is only one call, then the difference is slightly bigger. The machine for these sample numbers was a 3Ghz Pentium 4 desktop machine. + However the tradeoff is larger records/post requires greater memory footprint. As the records/post becomes higher it makes more sense to have separate threads for getting records from database/files and another for posting the XML messages to Solr (could also double buffer). + + See [http://java.sun.com/j2se/1.5.0/docs/api/java/util/concurrent/package-summary.html java.util.concurrency javadoc] for more information on threading. +
