[ 
https://issues.apache.org/jira/browse/MAHOUT-1423?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14056810#comment-14056810
 ] 

Matteo commented on MAHOUT-1423:
--------------------------------

We tried to refactor the proposed interface to make it more general. The first 
assumption we made, is that the focus is narrowed to time series anomaly 
detection as described by the issue. The second assumption is that a time 
series is represented as a Matrix, where each row corresponds to a time series 
point, and each point is represented as a Vector of features. For this reason 
the interface uses now Matrix to exchange the data.

Answering to the points made in the last feedback:
a) We improved the README and added a simple test, even though the tests should 
still be improved.

b)
- the interface is used to provide a skeleton for time series anomaly detection 
to the users, which uses the t-digest to spot the anomalies
- we changed the data format to be more general (from Vector to Matrix)
- the proposed interface is indeed not suitable for an on-line anomaly 
detection; it is possible to discuss how to adapt it to consider this use-case 

Any further comments will be appreciated!

> Add time series anomaly detection example
> -----------------------------------------
>
>                 Key: MAHOUT-1423
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1423
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Examples
>    Affects Versions: 0.9
>            Reporter: Andrew Musselman
>            Assignee: Andrew Musselman
>            Priority: Minor
>             Fix For: 1.0
>
>
> Add an example of anomaly detection in a time series along the lines of Ted's 
> EKG example starting about slide 20 here:  
> http://www.slideshare.net/tdunning/strata-2014-anomaly-detection



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to