[jira] [Commented] (SOLR-12878) FacetFieldProcessorByHashDV is reconstructing FieldInfos on every instantiation

Tim Underwood (JIRA) Fri, 02 Nov 2018 09:06:39 -0700


    [ 
https://issues.apache.org/jira/browse/SOLR-12878?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16673334#comment-16673334
 ]


Tim Underwood commented on SOLR-12878:
--------------------------------------

There are 2 other LeafReader implementations not caching their FieldInfos:
 * org.apache.solr.search.CollapsingQParserPlugin.ReaderWrapper
 * org.apache.solr.handler.component.ExpandComponent.ReaderWrapper

Both have code that is basically this:
{code:java}
@SuppressWarnings("resource") LeafReader uninvertingReader = 
UninvertingReader.wrap(
    new ReaderWrapper(searcher.getSlowAtomicReader(), field),
    Collections.singletonMap(field, UninvertingReader.Type.SORTED)::get);
{code}
Note the searcher.getSlowAtomicReader() call which means moving the caching of 
FieldInfos into SlowCompositeReaderWrapper will help in this case!

Here is part of the code from UninvertingReader.wrap:
{code:java}
public static LeafReader wrap(LeafReader in, Function<String, Type> mapping) {
  boolean wrap = false;

  // Calculate a new FieldInfos that has DocValuesType where we didn't before
  ArrayList<FieldInfo> newFieldInfos = new 
ArrayList<>(in.getFieldInfos().size());
  for (FieldInfo fi : in.getFieldInfos()) {
    DocValuesType type = fi.getDocValuesType();
{code}
Note the 2 immediate calls to in.getFieldInfos().  So... I'll add the FieldInfo 
caching to both of those ReaderWrapper classes since it would help.

 

> FacetFieldProcessorByHashDV is reconstructing FieldInfos on every 
> instantiation
> -------------------------------------------------------------------------------
>
>                 Key: SOLR-12878
>                 URL: https://issues.apache.org/jira/browse/SOLR-12878
>             Project: Solr
>          Issue Type: Bug
>      Security Level: Public(Default Security Level. Issues are Public) 
>          Components: Facet Module
>    Affects Versions: 7.5
>            Reporter: Tim Underwood
>            Priority: Major
>              Labels: performance
>             Fix For: 7.6, master (8.0)
>
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> The FacetFieldProcessorByHashDV constructor is currently calling:
> {noformat}
> FieldInfo fieldInfo = 
> fcontext.searcher.getSlowAtomicReader().getFieldInfos().fieldInfo(sf.getName());
> {noformat}
> Which is reconstructing FieldInfos each time.  Simply switching it to:
> {noformat}
> FieldInfo fieldInfo = 
> fcontext.searcher.getFieldInfos().fieldInfo(sf.getName());
> {noformat}
>  
> causes it to use the cached version of FieldInfos in the SolrIndexSearcher.
> On my index the FacetFieldProcessorByHashDV is 2-3 times slower than the 
> legacy facets without this fix.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (SOLR-12878) FacetFieldProcessorByHashDV is reconstructing FieldInfos on every instantiation

Reply via email to