[
https://issues.apache.org/jira/browse/APEXMALHAR-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15296714#comment-15296714
]
Sandeep Narayanaswami commented on APEXMALHAR-2094:
---------------------------------------------------
[~thw] Thomas, I do intend to add more functionality. DataSketches includes
algorithms like hyperloglog, and heavy hitters / frequent items and a "theta
sketch" framework that will allow set operations on multiple streams, and I
want to build Apex operators for each of these.
By "module", do you mean a package like {{statistics}} or {{algo}} in the
Malhar library? If so, yes, it might make sense to move this operator from
{{statistics}} into a {{sketches}} package that will encapsulate all stream
sketching algorithms.
> Quantiles sketch operator
> -------------------------
>
> Key: APEXMALHAR-2094
> URL: https://issues.apache.org/jira/browse/APEXMALHAR-2094
> Project: Apache Apex Malhar
> Issue Type: New Feature
> Reporter: Sandeep Narayanaswami
> Assignee: Sandeep Narayanaswami
> Priority: Minor
>
> An operator that "sketches" in an online fashion the probability distribution
> of an input (numeric) data stream, enabling computation of quantiles and
> cumulative distribution functions.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)