Wei-Chiu Chuang created HDFS-14276:
--------------------------------------

             Summary: [SBN read] Reduce tailing overhead
                 Key: HDFS-14276
                 URL: https://issues.apache.org/jira/browse/HDFS-14276
             Project: Hadoop HDFS
          Issue Type: Improvement
          Components: ha, namenode
    Affects Versions: 3.3.0
         Environment: Hardware: 4-node cluster, each node has 4 core, Xeon 
2.5Ghz, 25GB memory.
Software: CentOS 7.4, CDH 6.0 + Consistent Reads from Standby, Kerberos, SSL, 
RPC encryption + Data Transfer Encryption.
            Reporter: Wei-Chiu Chuang
            Assignee: Wei-Chiu Chuang
         Attachments: Screen Shot 2019-02-12 at 10.51.41 PM.png

When Observer setsĀ {{dfs.ha.tail-edits.period}} = {{0ms}}, it tails edit log 
continuously in order to fetch the latest edits, but there is a lot of overhead 
in doing so.

Critically, edit log tailer should _not_ update NameDirSize metric every time. 
It has nothing to do with fetching edits, and it involves lots of directory 
space calculation.

Profiler suggests a non-trivial chunk of time is spent for nothing.

Other than this, the biggest overhead is in the communication to 
serialize/deserialize messages to/from JNs. I am looking for ways to reduce the 
cost because it's burning 30% of my CPU time even when the cluster is idle.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org

Reply via email to