Hi Nitin

We rerank the top-n documents based on the original solr score from a
solr-plugin. This gives very good speed.

We use this code: https://github.com/pegesund/clojureranker.

Clojure clode - but it should not be hard to make a java-version :)

Petter

ons. 5. mai 2021 kl. 17:12 skrev Umesh Prasad <umesh.i...@gmail.com>:

> Hi Nitin,
>      In Flipkart, we used an external re-ranker (called it L2 re-ranker).
> In the first version we tried building an auxiliary store for ephemeral
> fields and plugged them inside solr scoring. It didn't scale. Retrieval of
> features from solr index is one challenge but redundant matching in 1st
> phase was also wasteful.
>    L2 re-ranker outside worked really well. We tried a bunch of other
> optimizations as well. sorted index, single segment index (built through MR
> and nightly). You can watch our talks from slashN ( Flipakrt tech
> conference).
>
> 1. Near Real-Time Indexing - Umesh Prasad & Thejus V M, Flipkart
>
> https://www.youtube.com/watch?v=05rX0mJ2N4U&list=PLf85w1fkhA5EW7KvZULMKm97REOCxsUZz&index=2
>
> 2..Resource optimisation for Search at Scale in Flipkart - Monish Gandhi
>
>
> https://www.youtube.com/watch?v=PCFJ7iZ1Uvs&list=PLf85w1fkhA5EW7KvZULMKm97REOCxsUZz&index=8
>
> These are lessons learnt from trenches in managing a large cluster and
> continuously growing traffic. Would be happy to answer any questions you
> have from talks.
>
> Thanks & Regards
> Umesh Prasad
>
> https://www.linkedin.com/in/umesh-prasad-iitk/
>
>
>
>
>
>
> On Wed, 5 May 2021 at 20:22, Alessandro Benedetti <a.benede...@sease.io>
> wrote:
>
> > Hi Nitin,
> > based on my experience, if you have document-level features and query
> > dependent features (query-document level) using the internal Solr
> re-ranker
> > would be beneficial in terms of performance.
> > The way Solr extracts features values from the index data structures is
> > expensive but it should be much cheaper than just fetching the top-K from
> > Solr and then extracting all the feature vectors and re-ranking outside.
> > I never did an explicit benchmark comparison though, it can be an
> > interesting idea for a blog.
> >
> > Cheers
> >
> > --------------------------
> > Alessandro Benedetti
> > Apache Lucene/Solr Committer
> > Director, R&D Software Engineer, Search Consultant
> >
> > www.sease.io
> >
> >
> > On Thu, 29 Apr 2021 at 19:24, Nitin Arora <nitinaror...@gmail.com>
> wrote:
> >
> > > Hello, Can someone share the pros and cons of using SOLR's learning to
> > rank
> > > vs having an external reranker after fetching SOLR's top results. Which
> > > option would you recommend?
> > >
> > > Thanks in advance,
> > >
> >
>

Reply via email to