David Smiley created SOLR-11595: ----------------------------------- Summary: optimize SolrIndexSearcher.localCollectionStatistics to use cached MultiFields Key: SOLR-11595 URL: https://issues.apache.org/jira/browse/SOLR-11595 Project: Solr Issue Type: Improvement Security Level: Public (Default Security Level. Issues are Public) Components: search Reporter: David Smiley Assignee: David Smiley Priority: Minor Fix For: 7.2
{{SolrIndexSearcher.localCollectionStatistics(field)}} simply calls Lucene's {{IndexSearcher.collectionStatistics(field)}} which in turn calls {{MultiFields.getTerms(reader, field)}}. Profiling in an app with many 150 fields in the query shows that building the MultiTerms here is expensive. Fortunately it turns out that Solr already has a cached instance via {{SlowCompositeReaderWrapper}} (using MultiFields which has a ConcurrentHashMap to the MultiTerms keyed by field String. Perhaps this should be improved on the Lucene side... not sure. But here on the Solr side, the solution is straight-forward. -- This message was sent by Atlassian JIRA (v6.4.14#64029) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org