[
https://issues.apache.org/jira/browse/LUCENE-5819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Michael McCandless updated LUCENE-5819:
---------------------------------------
Attachment: LUCENE-5819.patch
Patch.
I put the new terms dict in lucene/codecs, tied it into MockRandomPF,
and improved BasePostingsFormatTestCase to test terms ords when they
are supported. Tests seem to be passing, even test-core with
-Dtests.pf=OrdsLucene41.
I also made a change to CheckIndex, which may be controversial, to
have a "fail fast" option so it just throws the first exception it
hits. I found this *really* useful when debugging because I could
immediately see the exception causing a failure vs. the default
behavior of CheckIndex to keep trying on the next segment. If people
disagree w/ this I can separate it out / revert it..
Also I realized no classes in the new IDVPF are in fact public
(woops!); I'll commit that separately.
> Add block tree postings format that supports term ords
> ------------------------------------------------------
>
> Key: LUCENE-5819
> URL: https://issues.apache.org/jira/browse/LUCENE-5819
> Project: Lucene - Core
> Issue Type: Improvement
> Components: modules/other
> Reporter: Michael McCandless
> Assignee: Michael McCandless
> Fix For: 5.0, 4.10
>
> Attachments: LUCENE-5819.patch
>
>
> BlockTree is our default terms dictionary today, but it doesn't
> support term ords, which is an optional API in the postings format to
> retrieve the ordinal for the currently seek'd term, and also later
> seek by that ordinal e.g. to lookup the term.
> This can possibly be useful for e.g. faceting, and maybe at some point
> we can share the postings terms dict with the one used by sorted/set
> DV for cases when app wants to invert and facet on a given field.
> The older (3.x) block terms dict can easily support ords, and we have
> a Lucene41OrdsPF in test-framework, but it's not as fast / compact as
> block-tree, and doesn't (can't easily) implement an optimized
> intersect, but it could be for fields we'd want to facet on, these
> tradeoffs don't matter. It's nice to have options...
--
This message was sent by Atlassian JIRA
(v6.2#6252)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]