[ 
https://issues.apache.org/jira/browse/LUCENE-2321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Michael McCandless updated LUCENE-2321:
---------------------------------------

    Attachment: LUCENE-2321.patch

First cut -- it has some limitations though:

  * Cannot handle termsIndexInterval yet (forces this to 1)

  * Loads all term data as a single byte[] -- this may be a problem, eg it 
means your total bytes for indexed terms cannot exceed 2.1B; it also may fail 
to allocate due to memory fragmentation on 32 bit JREs.  So I think we have to 
make this paged.

Also found & fixed some small bugs in packed ints.

> use packed ints for the terms dict index
> ----------------------------------------
>
>                 Key: LUCENE-2321
>                 URL: https://issues.apache.org/jira/browse/LUCENE-2321
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Index
>    Affects Versions: Flex Branch
>            Reporter: Michael McCandless
>            Assignee: Michael McCandless
>             Fix For: Flex Branch
>
>         Attachments: LUCENE-2321.patch
>
>
> Terms dict index needs to store large RAM resident arrays of ints, but, 
> because their size is bound & variable (depending on the segment/docs), we 
> should used packed ints for them.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org

Reply via email to