Yongjun Zhang created HDFS-11914:
------------------------------------
Summary: Add more diagnosis info for fsimage transfer failure.
Key: HDFS-11914
URL: https://issues.apache.org/jira/browse/HDFS-11914
Project: Hadoop HDFS
Issue Type: Bug
Reporter: Yongjun Zhang
Assignee: Yongjun Zhang
Hit a fsimage download problem:
Client tries to download fsimage, and got:
WARN org.apache.hadoop.security.UserGroupInformation:
PriviledgedActionException as:hdfs (auth:SIMPLE) cause:java.io.IOException:
File http://x.y.z:50070/imagetransfer?getimage=1&txid=latest received length
xyz is not of the advertised size abc.
Basically client does not get enough fsimage data and finished prematurely
without any exception thrown, until it finds the size of data received is
smaller than expected. The client then closed the conenction to NN, that caused
NN to report
INFO org.apache.hadoop.hdfs.server.namenode.TransferFsImage: Connection closed
by client
This jira is to add some more information in logs to help debugging the
sitaution. Specifically, report the stack trace when the connection is closed.
And how much data has been sent at that point. etc.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]