Re: E-Commerce Search: tf-idf, tie-break and boolean model

2017-10-20 Thread Walter Underwood
Setting mm to 100% means that any misspelled word in a query means zero results. That is not a good experience. Usually, 10% of queries contain a misspelling. Set mm to 1. The F-measure is not a good choice for this because recall is not very important in e-commerce. Use precision-oriented

Re: E-Commerce Search: tf-idf, tie-break and boolean model

2017-10-20 Thread Vincenzo D'Amore
Thanks for all the info, I really appreciate your help. I'm working on the configuration and following your suggestions. We already had a golden set of query-results pairs (~1000) used to tune and check how my application (and Solr configuration) performs. But I've to entirely double check if

Re: E-Commerce Search: tf-idf, tie-break and boolean model

2017-10-17 Thread Walter Underwood
That page from Stanford is not about e-commerce search. Westlaw is professional librarian search. I agree with Emir’s advice. Start with edismax. Use a small value for the tie-breaker. It is one of the least important configuration values. I use the default from the sample configs: 0.1

Re: E-Commerce Search: tf-idf, tie-break and boolean model

2017-10-17 Thread Erick Erickson
ation to concepts/classes), synonyms and relations between > search terms. > > /JZ > > -Original Message- > From: Charlie Hull [mailto:char...@flax.co.uk] > Sent: Tuesday, October 17, 2017 10:10 AM > To: solr-user@lucene.apache.org > Subject: Re: E-Commerce Sea

RE: E-Commerce Search: tf-idf, tie-break and boolean model

2017-10-17 Thread Junte Zhang
between search terms. /JZ -Original Message- From: Charlie Hull [mailto:char...@flax.co.uk] Sent: Tuesday, October 17, 2017 10:10 AM To: solr-user@lucene.apache.org Subject: Re: E-Commerce Search: tf-idf, tie-break and boolean model For our e-commerce customers we've been recommending

Re: E-Commerce Search: tf-idf, tie-break and boolean model

2017-10-17 Thread Charlie Hull
For our e-commerce customers we've been recommending a test-based relevance tuning strategy: here's a series of blogs written for us by someone who ran search for the world's largest electronic component distributor:

Re: E-Commerce Search: tf-idf, tie-break and boolean model

2017-10-16 Thread alessandro.benedetti
I was having the discussion with a colleague of mine recently, about E-commerce search. Of course there are tons of things you can do to improve relevancy: Custom similarity - edismax tuning - basic user events processing - machine learning integrations - semantic search ect ect more you do,

Re: E-Commerce Search: tf-idf, tie-break and boolean model

2017-10-16 Thread Emir Arnautović
Hi Vincenzo, Unless you have really specific ranking requirements, I would not suggest you to start with you proprietary similarity implementation. In most cases edismax will be good enough to cover your requirements. It is not easy task to tune edismax since it has a log knobs that you can