Re: quick jruby + solr benchmarks

Erik Hatcher Wed, 26 Nov 2008 07:11:19 -0800

On Nov 26, 2008, at 9:54 AM, Matt Mitchell wrote:

Yeah I overlooked all of that. Thanks Erik. So could a better querytest be
an incremental one based on id like:
100.times do |id|
 q = "id:#{id}"
 # query request here...
end

?

Testing is an art form. Depends on what you are testing. Issuingentirely unique queries is not very real-world either, but at least itwill cause the bypassing of query and HTTP caching shortcuts.

Many organizations mine their query logs to get a set ofrepresentative queries to test with, for example.

I think your point is proven - EmbeddedSolrServer itself is fasterthan CommonsHttpSolrServer. But would you deploy that way? Is yourfront-end going to be merged with Solr itself? That may or may not bevery viable, depending on the resources the front-end and Solr needsand how much system resources you have. What about doing loadbalancing? You're then stuck with load balancing your front-end intandem with Solr itself.

Again, it all boils down to what you're after with the benchmarks.And I'm not a benchmarking performance savvy person myself, so I'm notsure where to take it from here. It's an interesting test, for sure,and I'd like to have it reviewed by others that really know theirstuff in this realm and with Solr itself that can elaborate on whythere is such a huge difference in speed. Is it just HTTP andserialize/unserialize overhead? (I tend to doubt that, but don't know)

Would you happen to know why the solr home and data dir never reallychange?Anytime I use commons http or embedded, a "solr" directory iscreated in thesame directory as my script. Even though I'm setting the home anddata dir
in my code?


I don't know at the moment, I'd have to dig deeper.

        Erik

Re: quick jruby + solr benchmarks

Reply via email to