[ 
https://issues.apache.org/jira/browse/HDFS-11914?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16037329#comment-16037329
 ] 

Wei-Chiu Chuang commented on HDFS-11914:
----------------------------------------

LGTM +1. Thanks

> Add more diagnosis info for fsimage transfer failure.
> -----------------------------------------------------
>
>                 Key: HDFS-11914
>                 URL: https://issues.apache.org/jira/browse/HDFS-11914
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>            Reporter: Yongjun Zhang
>            Assignee: Yongjun Zhang
>              Labels: supportability
>         Attachments: HDFS-11914.001.patch, HDFS-11914.002.patch, 
> HDFS-11914.003.patch, HDFS-11914.004.patch
>
>
> Hit a fsimage download problem:
> Client tries to download fsimage, and got:
>  WARN org.apache.hadoop.security.UserGroupInformation: 
> PriviledgedActionException as:hdfs (auth:SIMPLE) cause:java.io.IOException: 
> File http://x.y.z:50070/imagetransfer?getimage=1&txid=latest received length 
> xyz is not of the advertised size abc.
> Basically client does not get enough fsimage data and finished prematurely 
> without any exception thrown, until it finds the size of data received is 
> smaller than expected. The client then closed the conenction to NN, that 
> caused NN to report
> INFO org.apache.hadoop.hdfs.server.namenode.TransferFsImage: Connection 
> closed by client
> This jira is to add some more information in logs to help debugging the 
> sitaution. Specifically, report the stack trace when the connection is 
> closed. And how much data has been sent at that point. etc.
>  



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org

Reply via email to