Hi, When CMS is merged, the table is updated with the counts added from all the estimators. In my use case I think, it could be either min or max function of the estimated counts from different estimators to be used.
Use case there are n-categories & for each category there is CMS for estimation of topK in the category; Additionally I need to find topK for all the items across categories; using merge it could result in double counting, instead I would like to use min or max. I have tried to extend the existing CountMinSketch class to add a new combine function which will take min / max in merge() call. Alas, all the required variables are package private and I couldn't make it work. Do let me know if it is correct to use different functions in merging. Thanks, Sumanth -- You received this message because you are subscribed to the Google Groups "stream-lib-user" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/d/optout.
