[ 
https://issues.apache.org/jira/browse/LUCENE-4610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13528662#comment-13528662
 ] 

Shai Erera commented on LUCENE-4610:
------------------------------------

Borrowing a comment I made on LUCENE-4602 - we should pull the {{parents[]}} 
from TaxoReader, rather than call getParent for every ordinal. Though I didn't 
benchmark it, when I called tr.getParent() in 
TestDirTaxoWriter.testConcurrency, which generates a large taxonomy, I saw 
nearly 3x speedups after moving to pulling the array. 100 iterations dropped 
from 150s to 60s.
                
> Implement a NoParentsAccumulator
> --------------------------------
>
>                 Key: LUCENE-4610
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4610
>             Project: Lucene - Core
>          Issue Type: New Feature
>          Components: modules/facet
>            Reporter: Shai Erera
>
> Mike experimented with encoding just the exact categories ordinals on 
> LUCENE-4602, and I added OrdinalPolicy.NO_PARENTS, with a comment saying that 
> this requires a special FacetsAccumulator.
> The idea is to write the exact categories only for each document, and then at 
> search time count up the parents chain to compute requested facets (I say 
> count, but it can be any weight).
> One limitation of such accumulator is that it cannot be used when e.g. a 
> document is associated with two categories who share the same parent, because 
> that may result in incorrect weights computed (e.g. a document might have 
> several Authors, and so counting the Author facet may yield wrong counts). So 
> it can be used only when the app knows it doesn't add such facets, or that it 
> always asks to aggregate a 'root' that in its path this criteria doesn't hold 
> (no categories share the same parent).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to