Thanks for showing I wasn't completely crazy to think this made sense, Mike.

I added:
https://issues.apache.org/jira/browse/LUCENE-5134
https://issues.apache.org/jira/browse/LUCENE-5135

Otis



On Mon, Jul 15, 2013 at 1:28 PM, Michael McCandless
<luc...@mikemccandless.com> wrote:
> Lookback is a good idea: you could at least gather statistics and
> assess, later, whether good merges had been selected, and maybe play
> "what if" games to explore if different merge selections would have
> resulted in less copying.
>
> A time-based MergeScheduler would make sense: e.g., it would allow
> small merges to run any time, but big ones must wait until "after
> hours".
>
> Also, RateLimitedDirWrapper can be used to limit IO impact of ongoing
> merges.  It's like a naive ionice, for merging.
>
> Mike McCandless
>
> http://blog.mikemccandless.com
>
>
> On Mon, Jul 8, 2013 at 10:41 PM, Otis Gospodnetic
> <otis.gospodne...@gmail.com> wrote:
>> Hi,
>>
>> I was (re-re-re-re)-reading Mike's post about Lucene segment merges -
>> http://blog.mikemccandless.com/2011/02/visualizing-lucenes-segment-merges.html
>>
>> Mike mentioned lookhead as something that could possibly yield more
>> optimal merges.
>>
>> But what about lookback? :)
>>
>> What if some sort of stats were kept about about which segments were
>> picked for merges?  With some sort of stats in hand, could one look
>> back and, knowing what happened after those merges, evaluate if more
>> optimal merge choices could have been made and then use that "next
>> time"?
>>
>> Also, what about time of day and query rates?  Very often search
>> traffic follows the wave pattern, which could mean that more
>> aggressive merging could be done during periods with lower query
>> rates... or maybe during that time more segments could be allowed to
>> live in the index, assuming that after allowing that for some time,
>> the subsequent merge could be bigger/more thorough, so to speak.
>>
>> Thoughts?
>>
>> Otis
>> --
>> Solr & ElasticSearch Support -- http://sematext.com/
>> Performance Monitoring -- http://sematext.com/spm

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to