[jira] [Commented] (YARN-6862) Nodemanager resource usage metrics sometimes are negative

YunFan Zhou (JIRA) Mon, 24 Jul 2017 08:53:15 -0700

    [ 
https://issues.apache.org/jira/browse/YARN-6862?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16098603#comment-16098603
 ]


YunFan Zhou commented on YARN-6862:
-----------------------------------

[~sunilg] Thanks.

We only can see used memory from NM logs, and from NM logs we can see some logs 
as follows:

2017-07-24 22:19:08,551 INFO 
org.apache.hadoop.yarn.server.nodemanager.containermanager.monitor.ContainersMonitorImpl:
 Memory usage of ProcessTree 23933 for container-id 
container_e6717_1500903083707_0014_01_000259: -1B of 1 GB physical memory used; 
-1B of 2.1 GB virtual memory used

Because we collect resource usage metrics direct from MonitoringThread#run 
method, so the metrics is very reliable.


> Nodemanager resource usage metrics sometimes are negative
> ---------------------------------------------------------
>
>                 Key: YARN-6862
>                 URL: https://issues.apache.org/jira/browse/YARN-6862
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: nodemanager
>    Affects Versions: 2.8.2
>            Reporter: YunFan Zhou
>
> When we collect real-time metrics of resource usage in NM, we found those 
> values sometimes are invalid.
> For example, the following are values when collected at some point:
> "milliVcoresUsed":-5808,
> "currentPmemUsage":-1,
> "currentVmemUsage":-1,
> "cpuUsagePercentPerCore":-968.1026
> "cpuUsageTotalCoresPercentage":-24.202564,
> "pmemLimit":2147483648,
> "vmemLimit":4509715456
> There are many negative values,  there may a bug in NM. 
> We should fix it, because the real-time metrics of NM is pretty important for 
> us sometimes.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (YARN-6862) Nodemanager resource usage metrics sometimes are negative

Reply via email to