[jira] [Commented] (HDFS-13055) Aggregate usage statistics from datanodes

Anu Engineer (JIRA) Wed, 24 Jan 2018 11:39:18 -0800

    [ 
https://issues.apache.org/jira/browse/HDFS-13055?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16338111#comment-16338111
 ]


Anu Engineer commented on HDFS-13055:
-------------------------------------

[~ajayydv] Thanks for the patch. It looks good overall. I have some high-level 
questions and some low-level comments. I will skip the low-level comments so 
that we can focus on the high-level questions first.

* The first and foremost, What issue are we trying to address here with this 
patch? 
* I am concerned that we are adding more load to HB processing, there are 
patches like HDFS-9239 which tried to address the issue of HB processing 
slowness. Aren't we adding more load to Namenode -- Memory/CPU costs to 
Namenode?
* All of this information can be retained in data node and pulled in only when 
needed, that is we can make this visible in Datanode JMX, and probably write a 
simple page on Namenode that has a Javascript that connects to data nodes 
directly and compute this info if needed. What is the rationale for aggregating 
this on Namenode all the time? Why a push and not a pull when we need this 
data? 



> Aggregate usage statistics from datanodes
> -----------------------------------------
>
>                 Key: HDFS-13055
>                 URL: https://issues.apache.org/jira/browse/HDFS-13055
>             Project: Hadoop HDFS
>          Issue Type: Bug
>            Reporter: Ajay Kumar
>            Assignee: Ajay Kumar
>            Priority: Major
>         Attachments: HDFS-13055.001.patch
>
>
> We collect variety of statistics in DataNodes and expose them via JMX. 
> Aggregating some of the high level statistics which we are already collecting 
> in {{DataNodeMetrics}} (like bytesRead,bytesWritten etc) over a configurable 
> time window will create a central repository accessible via JMX and UI.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (HDFS-13055) Aggregate usage statistics from datanodes

Reply via email to