Re: solr4 optimization

2014-06-10 Thread Vineet Mishra
As Otis mentioned, its obviously good to run Optimization once in a while
or when you are done with most of your heavy indexing operation. Its not
concern with the Disk Capacity rather with the IO and seeking in segements,
When comparably it has less segments to query the IO operation will be less
and so quick will be your query response.

Give it a go and come up with the stats.

Cheers!


On Tue, Jun 10, 2014 at 1:54 AM, Otis Gospodnetic 
otis.gospodne...@gmail.com wrote:

 Hi,

 I don't remember last time I ran optimize.  Sure, yes, things will work
 faster if you optimize an index and reduce the number of segments, but if
 you are regularly writing to that index and performance is OK, leave it to
 Lucene segment merges to purge deletes.

 Otis
 --
 Performance Monitoring * Log Analytics * Search Analytics
 Solr  Elasticsearch Support * http://sematext.com/


 On Mon, Jun 9, 2014 at 4:15 PM, Joshi, Shital shital.jo...@gs.com wrote:

   Hi,
 
  We have SolrCloud cluster (5 shards and 2 replicas) on 10 boxes. On some
  of the boxes we have about 5 million deleted docs and we have never run
  optimization since beginning. Does number of deleted docs have anything
 to
  do with performance of query? Should we consider optimization at all if
  we're not worried about disk space?
 
  Thanks!
 
 
 



Re: solr4 optimization

2014-06-09 Thread Otis Gospodnetic
Hi,

I don't remember last time I ran optimize.  Sure, yes, things will work
faster if you optimize an index and reduce the number of segments, but if
you are regularly writing to that index and performance is OK, leave it to
Lucene segment merges to purge deletes.

Otis
--
Performance Monitoring * Log Analytics * Search Analytics
Solr  Elasticsearch Support * http://sematext.com/


On Mon, Jun 9, 2014 at 4:15 PM, Joshi, Shital shital.jo...@gs.com wrote:

  Hi,

 We have SolrCloud cluster (5 shards and 2 replicas) on 10 boxes. On some
 of the boxes we have about 5 million deleted docs and we have never run
 optimization since beginning. Does number of deleted docs have anything to
 do with performance of query? Should we consider optimization at all if
 we're not worried about disk space?

 Thanks!