[jira] [Commented] (EAGLE-1024) Monitor jobs with high RPC throughput

ASF GitHub Bot (JIRA) Fri, 19 May 2017 03:09:53 -0700

    [ 
https://issues.apache.org/jira/browse/EAGLE-1024?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16017170#comment-16017170
 ]


ASF GitHub Bot commented on EAGLE-1024:
---------------------------------------

Github user asfgit closed the pull request at:

    https://github.com/apache/eagle/pull/938


> Monitor jobs with high RPC throughput 
> --------------------------------------
>
>                 Key: EAGLE-1024
>                 URL: https://issues.apache.org/jira/browse/EAGLE-1024
>             Project: Eagle
>          Issue Type: Improvement
>    Affects Versions: v0.5.0
>            Reporter: Zhao, Qingwen
>            Assignee: Zhao, Qingwen
>
> We've identified some jobs with high RPC throughput which causes the NN heavy 
> RPC overhead. These jobs has requested extremely large HDFS operations in a 
> very short window (2 mins).
> So we tend to capture those jobs with:
> a) the job has very large RPC throughput, using the job total HDFS ops/the 
> job duration, if the throughput is larger than 1000
> b) and if the HDFS ops per task is larger than 25
> Then send out the alert out. Later, we will notify the users to optimize 
> their jobs.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

[jira] [Commented] (EAGLE-1024) Monitor jobs with high RPC throughput

Reply via email to