[ https://issues.apache.org/jira/browse/LUCENE-4602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13527958#comment-13527958 ]
Shai Erera commented on LUCENE-4602: ------------------------------------ I reviewed DocValuesCountingFacetsCollector, nice work ! See my last comment on LUCENE-4565 about taxoReader.getParent, vs. using the parents[] directly. Specifically, I wonder if we'll see any gain if we move to use the parents[] array directly, instead of getParent (in getFacetResults): {code} + if (count != 0) { + int ordUp = taxoReader.getParent(ord); // HERE + while(ordUp != 0) { + //System.out.println(" parent=" + ordUp + " cp=" + taxoReader.getPath(ordUp)); + counts[ordUp] += count; + ordUp = taxoReader.getParent(ordUp); // AND HERE + } + } {code} > Use DocValues to store per-doc facet ord > ---------------------------------------- > > Key: LUCENE-4602 > URL: https://issues.apache.org/jira/browse/LUCENE-4602 > Project: Lucene - Core > Issue Type: Improvement > Reporter: Michael McCandless > Attachments: LUCENE-4602.patch > > > Spinoff from LUCENE-4600 > DocValues can be used to hold the byte[] encoding all facet ords for > the document, instead of payloads. I made a hacked up approximation > of in-RAM DV (see CachedCountingFacetsCollector in the patch) and the > gains were somewhat surprisingly large: > {noformat} > Task QPS base StdDev QPS comp StdDev > Pct diff > HighTerm 0.53 (0.9%) 1.00 (2.5%) > 87.3% ( 83% - 91%) > LowTerm 7.59 (0.6%) 26.75 (12.9%) > 252.6% ( 237% - 267%) > MedTerm 3.35 (0.7%) 12.71 (9.0%) > 279.8% ( 268% - 291%) > {noformat} > I didn't think payloads were THAT slow; I think it must be the advance > implementation? > We need to separately test on-disk DV to make sure it's at least > on-par with payloads (but hopefully faster) and if so ... we should > cutover facets to using DV. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org