Colin Patrick McCabe created HDFS-7185:
------------------------------------------

             Summary: The active NameNode will not accept an fsimage sent from 
the standby during rolling upgrade
                 Key: HDFS-7185
                 URL: https://issues.apache.org/jira/browse/HDFS-7185
             Project: Hadoop HDFS
          Issue Type: Bug
    Affects Versions: 2.6.0
            Reporter: Colin Patrick McCabe


The active NameNode will not accept an fsimage sent from the standby during 
rolling upgrade.  The active fails with the exception:
{code}
18:25:07,620  WARN ImageServlet:198 - Received an invalid request file transfer 
request from a secondary with storage info 
-59:65195028:0:CID-385de4d7-64e4-4dde-9f5d-0a6e431987f6
18:25:07,620  WARN log:76 - Committed before 410 PutImage failed. 
java.io.IOException: This namenode has storage info 
-55:65195028:0:CID-385de4d7-64e4-4dde-9f5d-0a6e431987f6 but the secondary 
expected -59:65195028:0:CID-385de4d7-64e4-4dde-9f5d-
0a6e431987f6
        at 
org.apache.hadoop.hdfs.server.namenode.ImageServlet.validateRequest(ImageServlet.java:200)
        at 
org.apache.hadoop.hdfs.server.namenode.ImageServlet.doPut(ImageServlet.java:443)
        at javax.servlet.http.HttpServlet.service(HttpServlet.java:730)
{code}

On the standby, the exception is:
{code}
java.io.IOException: Exception during image upload: 
org.apache.hadoop.hdfs.server.namenode.TransferFsImage$HttpPutFailedException: 
This namenode has storage info 
-55:65195028:0:CID-385de4d7-64e4-4dde-9f5d-0a6e431987f6 but the secondary 
expected
 -59:65195028:0:CID-385de4d7-64e4-4dde-9f5d-0a6e431987f6
        at 
org.apache.hadoop.hdfs.server.namenode.ha.StandbyCheckpointer.doCheckpoint(StandbyCheckpointer.java:218)
        at 
org.apache.hadoop.hdfs.server.namenode.ha.StandbyCheckpointer.access$1400(StandbyCheckpointer.java:62)
{code}

This seems to be a consequence of the fact that the VERSION file still is at 
-55 (the old version) even after the rolling upgrade has started.  When the 
rolling upgrade is finalized with {{hdfs dfsadmin -rollingUpgrade finalize}}, 
both VERSION files get set to the new version, and the problem goes away.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to