Hi, 

When CMS is merged, the table is updated with the counts added from all the 
estimators.
In my use case I think, it could be either min or max function of the 
estimated counts from different estimators to be used.

Use case
there are n-categories & for each category there is CMS for estimation of 
topK in the category; 
Additionally I need to find topK for all the items across categories; using 
merge it could result in double counting,
instead I would like to use min or max. 

I have tried to extend the existing CountMinSketch class to add a new 
combine function which will take min / max in merge() call.
Alas, all the required variables are package private and I couldn't make it 
work.

Do let me know if it is correct to use different functions in merging.

Thanks,
Sumanth

-- 
You received this message because you are subscribed to the Google Groups 
"stream-lib-user" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/d/optout.

Reply via email to