[
https://issues.apache.org/jira/browse/LUCENE-4677?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Michael McCandless updated LUCENE-4677:
---------------------------------------
Attachment: LUCENE-4677.patch
Initial patch ... not committable until I add a back-compat layer somehow ...
(how come TestBackCompat isn't failing...).
I tested Kuromoji's TokenInfo FST, temporarily turning off packing: vInt
encoding made the non-packed FST ~12% smaller (good!). The packed FST is
unchanged in size.
Then I tested on a bigger FST (AnalyzingSuggester build of FreeDB's song
titles) and the resulting FST is nearly the same size (1.0463 GB for trunk and
1.0458 with patch).
> Use vInt to encode node addresses inside FST
> --------------------------------------------
>
> Key: LUCENE-4677
> URL: https://issues.apache.org/jira/browse/LUCENE-4677
> Project: Lucene - Core
> Issue Type: Improvement
> Reporter: Michael McCandless
> Assignee: Michael McCandless
> Fix For: 4.2, 5.0
>
> Attachments: LUCENE-4677.patch
>
>
> Today we use int, but towards enabling > 2.1G sized FSTs, I'd like to make
> this vInt instead.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]