Takuya Fukudome created HDFS-8771:
-------------------------------------
Summary: If IPCLoggerChannel#purgeLogsOlderThan takes too long,
Namenode could not send another RPC calls to Journalnodes
Key: HDFS-8771
URL: https://issues.apache.org/jira/browse/HDFS-8771
Project: Hadoop HDFS
Issue Type: Bug
Reporter: Takuya Fukudome
In our cluster, edits has became huge(about 50GB) accidentally and our
Jounalnodes' disks were busy, therefore {{purgeLogsOlderThan}} took more than
30secs. If {{IPCLoggerChannel#purgeLogsOlderThan}} takes too much time,
Namenode couldn't send other RPC calls to Journalnodes because
{{o.a.h.hdfs.qjournal.client.IPCLoggerChannel}}'s executor is single thread. It
will cause namenode shutting down.
I think IPCLoggerChannel#purgeLogsOlderThan should not block other RPC calls
like sendEdits.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)