[ https://issues.apache.org/jira/browse/APEXMALHAR-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sandeep Narayanaswami updated APEXMALHAR-2094: ---------------------------------------------- Comment: was deleted (was: [~thw] Thomas, I do intend to add more functionality. DataSketches includes algorithms like hyperloglog, and heavy hitters / frequent items and a "theta sketch" framework that will allow set operations on multiple streams, and I want to build Apex operators for each of these. By "module", do you mean a package like {{statistics}} or {{algo}} in the Malhar library? If so, yes, it might make sense to move this operator from {{statistics}} into a {{sketches}} package that will encapsulate all stream sketching algorithms. ) > Quantiles sketch operator > ------------------------- > > Key: APEXMALHAR-2094 > URL: https://issues.apache.org/jira/browse/APEXMALHAR-2094 > Project: Apache Apex Malhar > Issue Type: New Feature > Reporter: Sandeep Narayanaswami > Assignee: Sandeep Narayanaswami > Priority: Minor > > An operator that "sketches" in an online fashion the probability distribution > of an input (numeric) data stream, enabling computation of quantiles and > cumulative distribution functions. -- This message was sent by Atlassian JIRA (v6.3.4#6332)