Yes, that is roughly how MLT works as well. You can also do a full OR-search on 
the terms using LuceneQParser.

Markus

 
 
-----Original message-----
> From:Junte Zhang <junte.zh...@localsearch.ch>
> Sent: Friday 25th August 2017 18:38
> To: solr-user@lucene.apache.org
> Subject: RE: Search by similarity?
> 
> If you already have the title of the document, then you could run that title 
> as a new query against the whole index and exclude the source document from 
> the results as a filter.
> 
> You could use the DisMax query parser: 
> https://cwiki.apache.org/confluence/display/solr/The+DisMax+Query+Parser
> 
> And then set the minimum match ratio of the OR clauses to 90%.
> 
> /JZ
> 
> -----Original Message-----
> From: Darko Todoric [mailto:todo...@mdpi.com] 
> Sent: Friday, August 25, 2017 5:49 PM
> To: solr-user@lucene.apache.org
> Subject: Search by similarity?
> 
> Hi,
> 
> 
> I have 90.000.000 documents in Solr and I need to compare "title" of this 
> document and get all documents with more than 80% similarity. PHP have 
> "similar_text" but it's not so smart inserting 90m documents in the array...
> Can I do some query in Solr which will give me the more the 80% similarity?
> 
> 
> Kind regards,
> Darko Todoric
> 
> --
> Darko Todoric
> Web Engineer, MDPI DOO
> Veljka Dugosevica 54, 11060 Belgrade, Serbia
> +381 65 43 90 620
> www.mdpi.com
> 
> Disclaimer: The information and files contained in this message are 
> confidential and intended solely for the use of the individual or entity to 
> whom they are addressed.
> f you have received this message in error, please notify me and delete this 
> message from your system.
> You may not copy this message in its entirety or in part, or disclose its 
> contents to anyone.
> 
> 

Reply via email to