Hi Veda,

Lucene doesn't provide such functionality out of the box, but you could use
MoreLikeThis (
https://lucene.apache.org/core/7_4_0/queries/org/apache/lucene/queries/mlt/MoreLikeThis.html)
to search for similar documents and then compute a finer-grained similarity
score on client-side. This would avoid having to compute a similarity score
with every document of your collection.

Le mer. 19 sept. 2018 à 15:28, Veda G M <ved...@gmail.com> a écrit :

> Hello,
>
> Is it possible to compare large chunks of text and get the similarity
> score/percentage using Lucene?
>
> Say for e.g., we have 2-3 paragraphs of text and need to search if there is
> any document that matches this semantically and the similarity that the
> returned hit and the search string share in terms of percentage.
>
> Could you please let me know if this is possible with Lucene?
>
> Thanks.
>
> Regards,
> Veda
>

Reply via email to