Hi Nitin, In Flipkart, we used an external re-ranker (called it L2 re-ranker). In the first version we tried building an auxiliary store for ephemeral fields and plugged them inside solr scoring. It didn't scale. Retrieval of features from solr index is one challenge but redundant matching in 1st phase was also wasteful. L2 re-ranker outside worked really well. We tried a bunch of other optimizations as well. sorted index, single segment index (built through MR and nightly). You can watch our talks from slashN ( Flipakrt tech conference).
1. Near Real-Time Indexing - Umesh Prasad & Thejus V M, Flipkart https://www.youtube.com/watch?v=05rX0mJ2N4U&list=PLf85w1fkhA5EW7KvZULMKm97REOCxsUZz&index=2 2..Resource optimisation for Search at Scale in Flipkart - Monish Gandhi https://www.youtube.com/watch?v=PCFJ7iZ1Uvs&list=PLf85w1fkhA5EW7KvZULMKm97REOCxsUZz&index=8 These are lessons learnt from trenches in managing a large cluster and continuously growing traffic. Would be happy to answer any questions you have from talks. Thanks & Regards Umesh Prasad https://www.linkedin.com/in/umesh-prasad-iitk/ On Wed, 5 May 2021 at 20:22, Alessandro Benedetti <a.benede...@sease.io> wrote: > Hi Nitin, > based on my experience, if you have document-level features and query > dependent features (query-document level) using the internal Solr re-ranker > would be beneficial in terms of performance. > The way Solr extracts features values from the index data structures is > expensive but it should be much cheaper than just fetching the top-K from > Solr and then extracting all the feature vectors and re-ranking outside. > I never did an explicit benchmark comparison though, it can be an > interesting idea for a blog. > > Cheers > > -------------------------- > Alessandro Benedetti > Apache Lucene/Solr Committer > Director, R&D Software Engineer, Search Consultant > > www.sease.io > > > On Thu, 29 Apr 2021 at 19:24, Nitin Arora <nitinaror...@gmail.com> wrote: > > > Hello, Can someone share the pros and cons of using SOLR's learning to > rank > > vs having an external reranker after fetching SOLR's top results. Which > > option would you recommend? > > > > Thanks in advance, > > >