All the source code for the nightly Lucene perf tests I run (
http://people.apache.org/~mikemccand/lucenebench/ ) are here:
https://code.google.com/a/apache-extras.org/p/luceneutil/

These are also the scripts I use for A/B performance tests for a new patch.

It's somewhat tricky getting those Python scripts set up to run ...
but I think it'd be a good way to smoke test your new Directory.

The queries are "synthetic"; it's a real problem, not having a real
world, biggish corpus plus real queries, for better performance
testing...

Mike McCandless

http://blog.mikemccandless.com


On Mon, Jan 20, 2014 at 11:22 PM, Scott Schneider
<scott_schnei...@symantec.com> wrote:
> Hello,
>
> Would you folks mind giving me a few tips on performance testing Lucene?  I 
> want to test the performance impact of a Directory subclass.
>
> What is a good testing tool to use?  I don't see a great way to get SolrMeter 
> to run the max # updates/minute and measure throughput that way.  When I set 
> the # updates/minute to a large #, SolrMeter logs NullPointerExceptions.  (I 
> assume these are within SolrMeter, as I don't see errors in Solr.)  Mike 
> McCandless's nightly Lucene performance tests look good, though I've only 
> just started looking at it.
>
> Are there any particularly standard or good test sets?  I'd like to test 3 
> scenarios:  indexing only, querying only, and indexing plus querying.  
> McCandless's indexing test uses wikipedia, which seems great, but he has a 
> slew of tests that are each specific to some querying feature.  I'd like a 
> single, general query test.  It's not hard to come up with a decent set of 
> queries, but I'd really like something representative of real world queries.  
> If there some standard set of commonly used queries, that would be ideal.
>
> Thanks!
>
> Scott
>

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Reply via email to