[
https://issues.apache.org/jira/browse/HDFS-11914?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yongjun Zhang updated HDFS-11914:
---------------------------------
Attachment: HDFS-11914.003.patch
> Add more diagnosis info for fsimage transfer failure.
> -----------------------------------------------------
>
> Key: HDFS-11914
> URL: https://issues.apache.org/jira/browse/HDFS-11914
> Project: Hadoop HDFS
> Issue Type: Improvement
> Reporter: Yongjun Zhang
> Assignee: Yongjun Zhang
> Labels: supportability
> Attachments: HDFS-11914.001.patch, HDFS-11914.002.patch,
> HDFS-11914.003.patch
>
>
> Hit a fsimage download problem:
> Client tries to download fsimage, and got:
> WARN org.apache.hadoop.security.UserGroupInformation:
> PriviledgedActionException as:hdfs (auth:SIMPLE) cause:java.io.IOException:
> File http://x.y.z:50070/imagetransfer?getimage=1&txid=latest received length
> xyz is not of the advertised size abc.
> Basically client does not get enough fsimage data and finished prematurely
> without any exception thrown, until it finds the size of data received is
> smaller than expected. The client then closed the conenction to NN, that
> caused NN to report
> INFO org.apache.hadoop.hdfs.server.namenode.TransferFsImage: Connection
> closed by client
> This jira is to add some more information in logs to help debugging the
> sitaution. Specifically, report the stack trace when the connection is
> closed. And how much data has been sent at that point. etc.
>
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]