[ 
https://issues.apache.org/jira/browse/UIMA-5047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jerry Cwiklik updated UIMA-5047:
--------------------------------
    Description: 
Currently agent fetches metrics from different sources including /proc file 
system, top, ps, etc. The files found in /proc file system don't have fixed 
structure across Linux platforms causing some metrics to be missing or be 
wrong. Also scrapping lnx commands may not be 100% reliable as well

Change the code to fetch metrics from cgroups. Where to get metrics not 
available in cgroups is an open question.

For reporting the CPU, the agent needs to look at cgroups cpuacct.stat file and 
use the values for reporting. The agent should keep previous reading and use it 
to compute delta for the "current" CPU. Agent should not scrape top to get the 
current CPU use.

In case cgroups are not available, the agent will provide 0 or N/A for metrics 
which should come from cgroups.


  was:
Currently agent fetches metrics from different sources including /proc file 
system. The files found in /proc file system don't have fixed structure across 
Linux platforms causing some metrics to be missing or be wrong. 

Change the code to find an alternate way to get needed metrics from sources 
that are more reliable. 

Use cgroups cpu and memory accounting when cgroups are enabled. In case when 
cgroups are not enabled provide 0 or N/A as values.    


> UIMA-DUCC: refactor agent code which gathers node and process metrics
> ---------------------------------------------------------------------
>
>                 Key: UIMA-5047
>                 URL: https://issues.apache.org/jira/browse/UIMA-5047
>             Project: UIMA
>          Issue Type: Improvement
>          Components: DUCC
>            Reporter: Jerry Cwiklik
>            Assignee: Jerry Cwiklik
>             Fix For: 2.2.0-Ducc
>
>
> Currently agent fetches metrics from different sources including /proc file 
> system, top, ps, etc. The files found in /proc file system don't have fixed 
> structure across Linux platforms causing some metrics to be missing or be 
> wrong. Also scrapping lnx commands may not be 100% reliable as well
> Change the code to fetch metrics from cgroups. Where to get metrics not 
> available in cgroups is an open question.
> For reporting the CPU, the agent needs to look at cgroups cpuacct.stat file 
> and use the values for reporting. The agent should keep previous reading and 
> use it to compute delta for the "current" CPU. Agent should not scrape top to 
> get the current CPU use.
> In case cgroups are not available, the agent will provide 0 or N/A for 
> metrics which should come from cgroups.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to