[ 
https://issues.apache.org/jira/browse/CHUKWA-680?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13857302#comment-13857302
 ] 

Otis Gospodnetic commented on CHUKWA-680:
-----------------------------------------

Thanks Michael.  I see tables with 1/1 and 100% in Chapter 6, so that must be 
the accuracy.  I have more questions :)
# I assume each cluster is different so one has to train a model for one's own 
cluster?
# Is this really about healthy vs unhealthy or is this more about typical vs. 
atypically cluster workload?
# If cluster's workload changes, does the model need to be retrained?
Thanks!
(over at http://sematext.com/spm we collect lots of metrics from different 
types of systems, including Hadoop and HBase, so this is the angle my questions 
are coming from)


> Pattern recognition of Hadoop generated metrics
> -----------------------------------------------
>
>                 Key: CHUKWA-680
>                 URL: https://issues.apache.org/jira/browse/CHUKWA-680
>             Project: Chukwa
>          Issue Type: New Feature
>          Components: Data Collection
>         Environment: IBM InfoSphere BigInsights Enterprise
>            Reporter: michael yu
>            Assignee: michael yu
>            Priority: Minor
>              Labels: GSoC, GSoC2013
>         Attachments: Yu, Michael et al-project-report-draft.pdf
>
>   Original Estimate: 2,760h
>  Remaining Estimate: 2,760h
>
> Charles Lin and I are working on our IBM SJSU masters project on "Pattern 
> recognition of Hadoop generated metrics".
> The purpose of the project is to use libsvm to predict the health of the 
> cluster.
> The scope of the project includes:
> 1) gathering large scale data set of metrics for healthy and unhealthy 
> clusters
> 2) use #1 and libsvm to generate training model
> 3) periodic collection of metrics and comparing against training model using 
> libsvm to predict the cluster health
>    a) if unhealthy, send email notification to system administrator 



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to