[ 
https://issues.apache.org/jira/browse/LUCENE-2662?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Simon Willnauer updated LUCENE-2662:
------------------------------------

    Attachment: LUCENE-2662.patch

Next iteration - seems to be very close!

I have applied the following changes:

* introduces a AtomicLong to track bytesUsed in DocumetnsWriter, 
TermsHashPerField, ByteRefHash and RecyclingByteBlockAllocator
* Factored out  a BytesStartArray class from BytesRefHash that manages the 
int[] holding the bytesStart offsets. TermsHashPerField subclasses and manages 
the ParallelPostingsArray through it. 
* remove remaining no-commits
* made RecyclingbyteBlockAllocator synced by default (we use synchronized 
methods for it now)

I run a quick Wikipedia 100k docs benchmark against trunk vs. LUCENE-2662 and 
the results are promising.
|version|rec/sec|elapsed sec|avgUsedMem|
|LUCENE-2662|717.30|139.41|536,682,592|
|trunk| 682.66|146.49|546,065,344|

I will run the 10M benchmark once I get back to this.


> BytesHash
> ---------
>
>                 Key: LUCENE-2662
>                 URL: https://issues.apache.org/jira/browse/LUCENE-2662
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Index
>    Affects Versions: Realtime Branch, 4.0
>            Reporter: Jason Rutherglen
>            Assignee: Simon Willnauer
>            Priority: Minor
>             Fix For: Realtime Branch, 4.0
>
>         Attachments: LUCENE-2662.patch, LUCENE-2662.patch, LUCENE-2662.patch, 
> LUCENE-2662.patch, LUCENE-2662.patch
>
>
> This issue will have the BytesHash separated out from LUCENE-2186

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to