Hi Everyone,

We have found the jaccard similarity utility to be a useful way to compare the 
similarity/dissimilarity between theta sketches. 

My use case involves using summaries from tuple sketches for both mean and 
distinct summary counts.

The java library however, does not provide an implementation that is able to 
compare:
- two tuple sketches
- a theta and a tuple sketch

I would like to know if it makes sense to implement a similar utility as can be 
found in the theta package, for tuple sketches.  Clearly this would only apply 
to the hashed values and not the summaries.

I would be happy to submit a pull request if adding this makes sense, and would 
benefit other users.

Thanks in advance for any feedback,
David
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to