[
https://issues.apache.org/jira/browse/LUCENE-2321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Michael McCandless updated LUCENE-2321:
---------------------------------------
Attachment: LUCENE-2321.patch
First cut -- it has some limitations though:
* Cannot handle termsIndexInterval yet (forces this to 1)
* Loads all term data as a single byte[] -- this may be a problem, eg it
means your total bytes for indexed terms cannot exceed 2.1B; it also may fail
to allocate due to memory fragmentation on 32 bit JREs. So I think we have to
make this paged.
Also found & fixed some small bugs in packed ints.
> use packed ints for the terms dict index
> ----------------------------------------
>
> Key: LUCENE-2321
> URL: https://issues.apache.org/jira/browse/LUCENE-2321
> Project: Lucene - Java
> Issue Type: Improvement
> Components: Index
> Affects Versions: Flex Branch
> Reporter: Michael McCandless
> Assignee: Michael McCandless
> Fix For: Flex Branch
>
> Attachments: LUCENE-2321.patch
>
>
> Terms dict index needs to store large RAM resident arrays of ints, but,
> because their size is bound & variable (depending on the segment/docs), we
> should used packed ints for them.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]