[ 
https://issues.apache.org/jira/browse/LUCENE-8162?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16355374#comment-16355374
 ] 

Tommaso Teofili edited comment on LUCENE-8162 at 5/9/18 11:49 AM:
------------------------------------------------------------------

the class in Oak is a fork of TMP, but the one in Lucene would extend TMP (see 
[https://gist.github.com/tteofili/f60bd633557b93be106dc8e806d2b8fa).|https://gist.github.com/tteofili/f60bd633557b93be106dc8e806d2b8fa]

the logic uses doc/sec and mb/sec so you're right that the no. of _commits_ is 
not measured.
{quote}So if I index at a high rate but don't commit, the throttling logic can 
still kick in?
{quote}
yes


was (Author: teofili):
the class in Oak is a fork of TMP, but the one in Lucene would extend TMP (see 
[https://gist.github.com/tteofili/f60bd633557b93be106dc8e806d2b8fa).]

the logic uses doc/sec and mb/sec so you're right that the no. of _commits_ is 
not measured.
{quote}So if I index at a high rate but don't commit, the throttling logic can 
still kick in?
{quote}
yes

> Make it possible to throttle (Tiered)MergePolicy when commit rate is high
> -------------------------------------------------------------------------
>
>                 Key: LUCENE-8162
>                 URL: https://issues.apache.org/jira/browse/LUCENE-8162
>             Project: Lucene - Core
>          Issue Type: Improvement
>          Components: core/index
>            Reporter: Tommaso Teofili
>            Priority: Major
>             Fix For: trunk
>
>
> As discussed in a recent mailing list thread [1] and observed in a project 
> using Lucene (see OAK-5192 and OAK-6710), it is sometimes helpful to throttle 
> the aggressiveness of (Tiered)MergePolicy when commit rate is high.
> In the case of Apache Jackrabbit Oak a dedicated {{MergePolicy}} was 
> implemented [2].
> That MP doesn't merge in case the number of segments is below a certain 
> threshold (e.g. 30) and commit rate (docs per sec and MB per sec) is high 
> (e.g. above 1000 doc / sec , 5MB / sec).
> In such impl, the commit rate thresholds adapt to average commit rate by 
> means of single exponential smoothing.
> The results in that specific case looked encouraging as it brought a 5% perf 
> improvement in querying and ~10% reduced IO. However Oak has some specifics 
> which might not fit in other scenarios. Anyway it could be interesting to see 
> how this behaves in plain Lucene scenario.
> [1] : [http://markmail.org/message/re3ifmq2664bqfjk]
> [2] : 
> [https://github.com/apache/jackrabbit-oak/blob/trunk/oak-lucene/src/main/java/org/apache/jackrabbit/oak/plugins/index/lucene/writer/CommitMitigatingTieredMergePolicy.java]



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to