[ 
https://issues.apache.org/jira/browse/LUCENE-1539?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12718907#action_12718907
 ] 

Michael McCandless commented on LUCENE-1539:
--------------------------------------------

bq. When existing deletes are over 50%, we loop through termdocs instead.

OK good, except it's deleting too aggressively when > 50% deletions are already 
present (using nextBoolean()).  Can you change that to target a certain 
deletion rate?  Ie if you need to delete 20%, then do random.nextDouble() < 
0.20 to do the delete?  But then I guess put a floor on that rate so that it 
doesn't get too slow on the "tail"?  It won't be perfectly random when it hits 
that tail but I think that's OK.


> Improve Benchmark
> -----------------
>
>                 Key: LUCENE-1539
>                 URL: https://issues.apache.org/jira/browse/LUCENE-1539
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: contrib/benchmark
>    Affects Versions: 2.4
>            Reporter: Jason Rutherglen
>            Assignee: Michael McCandless
>            Priority: Minor
>             Fix For: 2.9
>
>         Attachments: LUCENE-1539.patch, LUCENE-1539.patch, LUCENE-1539.patch, 
> LUCENE-1539.patch, LUCENE-1539.patch, LUCENE-1539.patch, sortBench2.py, 
> sortCollate2.py
>
>   Original Estimate: 336h
>  Remaining Estimate: 336h
>
> Benchmark can be improved by incorporating recent suggestions posted
> on java-dev. M. McCandless' Python scripts that execute multiple
> rounds of tests can either be incorporated into the codebase or
> converted to Java.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org

Reply via email to