Yongjun Zhang created HDFS-11974:
------------------------------------

             Summary: Fsimage transfer failed due to socket timeout, but logs 
doesn't show that
                 Key: HDFS-11974
                 URL: https://issues.apache.org/jira/browse/HDFS-11974
             Project: Hadoop HDFS
          Issue Type: Bug
            Reporter: Yongjun Zhang
            Assignee: Yongjun Zhang


The idea of HDFS-11914 is to add more diagnosis information to understand what 
happened when we saw

{code}
WARN org.apache.hadoop.security.UserGroupInformation: 
PriviledgedActionException as:hdfs (auth:SIMPLE) cause:java.io.IOException: 
File http://x.y.z:50070/imagetransfer?getimage=1&txid=latest received length 
xyz is not of the advertised size abc.
{code}

After further study, I realize that the above exception is thrown in the 
{{finally}} block of {{TransferFsImage#receiveFile}} method, thus other 
exception thrown in the main code is not reported, such as SocketTimeOut.

We should include the information of the exceptions thrown in the main code 
when throwing exception in the {{finally}} block.






--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-dev-h...@hadoop.apache.org

Reply via email to