[ 
https://issues.apache.org/jira/browse/OAK-9339?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17286296#comment-17286296
 ] 

Amrit Verma commented on OAK-9339:
----------------------------------

[~mreutegg] Thanks for pointing out. I will add license to 
elasticstartscript.sh. Do we also need license in the imagedata.txt file? I 
would have to then modify the test class code to skip those license lines. I 
didn't add license in the text file because i didn't see license in one other 
simiar file file - 

[https://github.com/oak-indexing/jackrabbit-oak/blob/trunk/oak-lucene/src/test/resources/org/apache/jackrabbit/oak/query/fvs.csv]

Regarding size, we have reduced it quite a lot by truncating the decimal points 
used in double values. Do we need more compression? Then I think we need to 
save it as zip and then unzip it during the tests.

Reagrding, if we are allowed to use it or not, I think we should be but let me 
check again.

> Image Similarity: LSH based search
> ----------------------------------
>
>                 Key: OAK-9339
>                 URL: https://issues.apache.org/jira/browse/OAK-9339
>             Project: Jackrabbit Oak
>          Issue Type: New Feature
>          Components: indexing
>            Reporter: Amrit Verma
>            Assignee: Amrit Verma
>            Priority: Major
>             Fix For: 1.40.0
>
>
> Image Similarity introduced in OAK-9213 won't scale well with millions of 
> images because it does brute force comparisons. We need to explore 
> alternative approaches such as using LSH for similarity search. 
> One of the plugins which provides such functionality is elastiknn - 
> [https://elastiknn.com/api/]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to