[
https://issues.apache.org/jira/browse/LUCENE-7590?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15748192#comment-15748192
]
Shai Erera commented on LUCENE-7590:
------------------------------------
There are now few tasks left:
* Add more statistics, such as {{sum}} and {{stdev}} (for numeric fields).
Should we care about overflow, or only document it?
* We can also compute more stats like what Solr gives in [Stats
Component|https://cwiki.apache.org/confluence/display/solr/The+Stats+Component#TheStatsComponent-StatisticsSupported].
What do you think?
* Add stats for {{SortedDocValues}}. This should be fairly straightforward by
comparing the {{BytesRef}} of all matching documents. But I don't think we
should have a {{mean}} stat for it? Likewise for {{SortedSetDocValues}}.
* What should we do with {{SortedNumericDocValues}}? {{min}} and {{max}} are
well defined, but what about {{mean}}? Should it be across all values?
I intend to close this issue and handle the rest in follow-on issues, unless
you think otherwise. Also, would appreciate your feedback on the above points.
> Add DocValues statistics helpers
> --------------------------------
>
> Key: LUCENE-7590
> URL: https://issues.apache.org/jira/browse/LUCENE-7590
> Project: Lucene - Core
> Issue Type: New Feature
> Components: modules/misc
> Reporter: Shai Erera
> Assignee: Shai Erera
> Attachments: LUCENE-7590.patch, LUCENE-7590.patch, LUCENE-7590.patch,
> LUCENE-7590.patch, LUCENE-7590.patch, LUCENE-7590.patch, LUCENE-7590.patch
>
>
> I think it can be useful to have DocValues statistics helpers, that can allow
> users to query for the min/max/avg etc. stats of a DV field. In this issue
> I'd like to cover numeric DV, but there's no reason not to add it to other DV
> types too.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]