Hi All Following on from https://issues.apache.org/jira/browse/LUCENE-6968 <https://issues.apache.org/jira/browse/LUCENE-6968> (I know it’s been a while…) I have a QParser plugin that can generate the appropriate banded queries for Jaccard similarity.
It covers the same functionality that was proposed in the original issue but wrapped up as a query parser. There are two analysis cases and two query cases.. Hashes generated by tokenisation or those generated by pre-analysis. Queries based on text or provided hash values. If there is interest, I will create the issue and put up the patch. Regards Andy
