Re: CPU utilization and query time high on Solr slave when snapshot install

Mark Miller Mon, 02 Nov 2009 15:27:22 -0800

Hmm...I think you have to setup warming queries yourself and thatautowarm just copies entries from the old cache to the new cache,rather than issuing queries - the value is how many entries it willcopy. Though that's still going to take CPU and time.


- Mark


http://www.lucidimagination.com (mobile)

On Nov 2, 2009, at 12:47 PM, Walter Underwood <wun...@wunderwood.org>wrote:

If you are going to pull a new index every 10 minutes, try turningoff cache autowarming.
Your caches are never more than 10 minutes old, so spending a minutewarming each new cache is a waste of CPU. Autowarm submits queriesto the new Searcher before putting it in service. This will create aburst of query load on the new Searcher, often keeping one CPUpretty busy for several seconds.
In solrconfig.xml, set autowarmCount to 0.
Also, if you want the slaves to always have an optimized index,create the snapshot only in post-optimize. If you create snapshotsin both post-commit and post-optimize, you are creating a non-optimized index (post-commit), then replacing it with an optimizedone a few minutes later. A slave might get a non-optimized index onetime, then an optimized one the next.
wunder

On Nov 2, 2009, at 1:45 AM, biku...@sapient.com wrote:
Hi Solr Gurus,
We have solr in 1 master, 2 slave configuration. Snapshot iscreated post commit, post optimization. We have autocommit after 50documents or 5 minutes. Snapshot puller runs as a cron every 10minutes. What we have observed is that whenever snapshot isinstalled on the slave, we see solrj client used to query slavesolr, gets timedout and there is high CPU usage/load avg. on slaveserver. If we stop snapshot puller, then slaves work with noissues. The system has been running since 2 months and this issuehas started to occur only now when load on website is increasing.
Following are some details:

Solr Details:
apache-solr Version: 1.3.0
Lucene - 2.4-dev

Master/Slave configurations:

Master:
- for indexing data HTTPRequests are made on Solr server.
- autocommit feature is enabled for 50 docs and 5 minutes
- caching params are disable for this server
- mergeFactor of 10 is set
- we were running optimize script after every 2 hours, but now havereduced the duration to twice a day but issue still persists
Slave1/Slave2:
- standard requestHandler is being used
- default values of caching are set
Machine Specifications:

Master:
- 4GB RAM
- 1GB JVM Heap memory is allocated to Solr

Slave1/Slave2:
- 4GB RAM
- 2GB JVM Heap memory is allocated to Solr
Master and Slave1 (solr1)are on single box and Slave2(solr2) ondifferent box. We use HAProxy to load balance query requestsbetween 2 slaves. Master is only used for indexing.Please let us know if somebody has ever faced similar kind of issueor has some insight into it as we guys are literally struck at themoment with a very unstable production environment.
As a workaround, we have started running optimize on master every 7minutes. This seems to have reduced the severity of the problem butstill issue occurs every 2days now. please suggest what could bethe root cause of this.
Thanks,
Bipul

Re: CPU utilization and query time high on Solr slave when snapshot install

Reply via email to