Tommaso Teofili created OAK-7961:
------------------------------------
Summary: LSH should truncate to first decimal place
Key: OAK-7961
URL: https://issues.apache.org/jira/browse/OAK-7961
Project: Jackrabbit Oak
Issue Type: Task
Components: lucene
Reporter: Tommaso Teofili
Fix For: 1.9.14
Feature vector shingles currently contain three decimal places, e.g. _[0.101,
0.202, 0.303]_.
Since min hash is not aware of the fact that second and third decimal place
values are less important than first place values, it is likely that this will
impact the search precision.
We should be truncating to the first decimal place.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)