Gotcha - that's disheartening.

One idea: when I run termfreq, I get all of the termfreqs for each document
one-by-one.

Is there a way to have solr sum it up before creating the request, so I
only receive one number in the response?


On Sat, Oct 24, 2015 at 11:05 AM, Upayavira <u...@odoko.co.uk> wrote:

> If you mean using the term frequency function query, then I'm not sure
> there's a huge amount you can do to improve performance.
>
> The term frequency is a number that is used often, so it is stored in
> the index pre-calculated. Perhaps, if your data is not changing,
> optimising your index would reduce it to one segment, and thus might
> ever so slightly speed the aggregation of term frequencies, but I doubt
> it'd make enough difference to make it worth doing.
>
> Upayavira
>
> On Sat, Oct 24, 2015, at 03:37 PM, Aki Balogh wrote:
> > Thanks, Jack. I did some more research and found similar results.
> >
> > In our application, we are making multiple (think: 50) concurrent
> > requests
> > to calculate term frequency on a set of documents in "real-time". The
> > faster that results return, the better.
> >
> > Most of these requests are unique, so cache only helps slightly.
> >
> > This analysis is happening on a single solr instance.
> >
> > Other than moving to solr cloud and splitting out the processing onto
> > multiple servers, do you have any suggestions for what might speed up
> > termfreq at query time?
> >
> > Thanks,
> > Aki
> >
> >
> > On Fri, Oct 23, 2015 at 7:21 PM, Jack Krupansky
> > <jack.krupan...@gmail.com>
> > wrote:
> >
> > > Term frequency applies only to the indexed terms of a tokenized field.
> > > DocValues is really just a copy of the original source text and is not
> > > tokenized into terms.
> > >
> > > Maybe you could explain how exactly you are using term frequency in
> > > function queries. More importantly, what is so "heavy" about your
> usage?
> > > Generally, moderate use of a feature is much more advisable to heavy
> usage,
> > > unless you don't care about performance.
> > >
> > > -- Jack Krupansky
> > >
> > > On Fri, Oct 23, 2015 at 8:19 AM, Aki Balogh <a...@marketmuse.com>
> wrote:
> > >
> > > > Hello,
> > > >
> > > > In our solr application, we use a Function Query (termfreq) very
> heavily.
> > > >
> > > > Index time and disk space are not important, but we're looking to
> improve
> > > > performance on termfreq at query time.
> > > > I've been reading up on docValues. Would this be a way to improve
> > > > performance?
> > > >
> > > > I had read that Lucene uses Field Cache for Function Queries, so
> > > > performance may not be affected.
> > > >
> > > >
> > > > And, any general suggestions for improving query performance on
> Function
> > > > Queries?
> > > >
> > > > Thanks,
> > > > Aki
> > > >
> > >
>

Reply via email to