Hi Joel, With approx. 100K doc size, on dual-quad core machine, (3.0Ghz) - Windows platform, we have an average 1000 docs/sec. This includes text extraction from PDF docs.
Hope this helps. Sincerely, Sithu D Sudarsan -----Original Message----- From: Joel Halbert [mailto:[email protected]] Sent: Thursday, September 24, 2009 11:17 AM To: Lucene Users Subject: metrics for index ~100M docs Hi, Does anyone know of any recent metrics & stats on building out an index of ~100mm documents (each doc approx 5k). I'm looking for approx stats on time to build, time to query and infrastructure requirements (number of machines & spec) to reasonably support an index of such a size. Thanks, Joel --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
