[
https://issues.apache.org/jira/browse/LUCENE-4622?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13529988#comment-13529988
]
Gilad Barkai commented on LUCENE-4622:
--------------------------------------
I'm not fond of the post processing...
Today the sort is consistent. Lucene breaks even on doc ids, which order may
not be consistent due to out of order merges. This is not the case with
category ordinals.
If one wishes to post process they should be able to do so quite easy? But as
pointed out, it might not produce the results as intended due to a lot of
categories which scored the same and were left out.
> TopKFacetsResultHandler should tie break sort by label not ord?
> ---------------------------------------------------------------
>
> Key: LUCENE-4622
> URL: https://issues.apache.org/jira/browse/LUCENE-4622
> Project: Lucene - Core
> Issue Type: Bug
> Components: modules/facet
> Reporter: Michael McCandless
>
> EG I now get these facets:
> {noformat}
> Author (5)
> Lisa (2)
> Frank (1)
> Susan (1)
> Bob (1)
> {noformat}
> The primary sort is by count, but secondary is by ord (= order in which they
> were indexed), which is not really understandable/transparent to the end
> user. I think it'd be best if we could do tie-break sort by label ...
> But talking to Shai, this seems hard/costly to fix, because when visiting the
> facet ords to collect the top K, we don't currently resolve to label, and in
> the worst case (say my example had a million labels with count 1) that's a
> lot of extra label lookups ...
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]