Michael McCandless created LUCENE-4602:
------------------------------------------

             Summary: Use DocValues to store per-doc facet ord
                 Key: LUCENE-4602
                 URL: https://issues.apache.org/jira/browse/LUCENE-4602
             Project: Lucene - Core
          Issue Type: Improvement
            Reporter: Michael McCandless


Spinoff from LUCENE-4600

DocValues can be used to hold the byte[] encoding all facet ords for
the document, instead of payloads.  I made a hacked up approximation
of in-RAM DV (see CachedCountingFacetsCollector in the patch) and the
gains were somewhat surprisingly large:

{noformat}
                    Task    QPS base      StdDev    QPS comp      StdDev        
        Pct diff
                HighTerm        0.53      (0.9%)        1.00      (2.5%)   
87.3% (  83% -   91%)
                 LowTerm        7.59      (0.6%)       26.75     (12.9%)  
252.6% ( 237% -  267%)
                 MedTerm        3.35      (0.7%)       12.71      (9.0%)  
279.8% ( 268% -  291%)
{noformat}

I didn't think payloads were THAT slow; I think it must be the advance
implementation?

We need to separately test on-disk DV to make sure it's at least
on-par with payloads (but hopefully faster) and if so ... we should
cutover facets to using DV.


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to