All the source code for the nightly Lucene perf tests I run ( http://people.apache.org/~mikemccand/lucenebench/ ) are here: https://code.google.com/a/apache-extras.org/p/luceneutil/
These are also the scripts I use for A/B performance tests for a new patch. It's somewhat tricky getting those Python scripts set up to run ... but I think it'd be a good way to smoke test your new Directory. The queries are "synthetic"; it's a real problem, not having a real world, biggish corpus plus real queries, for better performance testing... Mike McCandless http://blog.mikemccandless.com On Mon, Jan 20, 2014 at 11:22 PM, Scott Schneider <scott_schnei...@symantec.com> wrote: > Hello, > > Would you folks mind giving me a few tips on performance testing Lucene? I > want to test the performance impact of a Directory subclass. > > What is a good testing tool to use? I don't see a great way to get SolrMeter > to run the max # updates/minute and measure throughput that way. When I set > the # updates/minute to a large #, SolrMeter logs NullPointerExceptions. (I > assume these are within SolrMeter, as I don't see errors in Solr.) Mike > McCandless's nightly Lucene performance tests look good, though I've only > just started looking at it. > > Are there any particularly standard or good test sets? I'd like to test 3 > scenarios: indexing only, querying only, and indexing plus querying. > McCandless's indexing test uses wikipedia, which seems great, but he has a > slew of tests that are each specific to some querying feature. I'd like a > single, general query test. It's not hard to come up with a decent set of > queries, but I'd really like something representative of real world queries. > If there some standard set of commonly used queries, that would be ideal. > > Thanks! > > Scott > --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org