[
https://issues.apache.org/jira/browse/LUCENE-10122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17443965#comment-17443965
]
Greg Miller commented on LUCENE-10122:
--------------------------------------
Yeah, +1 to moving to doc values. Even if we see a minor taxonomy size growth,
it's a more sensible data structure for this use-case. Taxonomy indices are
generally quite small anyway (compared to the main index), so I'd rather align
the use-case with an appropriate data structure then see if we can optimize it
over time.
> Explore using NumericDocValue to store taxonomy parent array
> ------------------------------------------------------------
>
> Key: LUCENE-10122
> URL: https://issues.apache.org/jira/browse/LUCENE-10122
> Project: Lucene - Core
> Issue Type: Improvement
> Components: modules/facet
> Affects Versions: main (10.0)
> Reporter: Haoyu Zhai
> Priority: Minor
> Time Spent: 2h
> Remaining Estimate: 0h
>
> We currently use term position of a hardcoded term in a hardcoded field to
> represent the parent ordinal of each taxonomy label. That is an old way and
> perhaps could be dated back to the time where doc values didn't exist.
> We probably would want to use NumericDocValues instead given we have spent
> quite a lot of effort optimizing them.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]