Hi Everyone, We have found the jaccard similarity utility to be a useful way to compare the similarity/dissimilarity between theta sketches.
My use case involves using summaries from tuple sketches for both mean and distinct summary counts. The java library however, does not provide an implementation that is able to compare: - two tuple sketches - a theta and a tuple sketch I would like to know if it makes sense to implement a similar utility as can be found in the theta package, for tuple sketches. Clearly this would only apply to the hashed values and not the summaries. I would be happy to submit a pull request if adding this makes sense, and would benefit other users. Thanks in advance for any feedback, David --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
