[
https://issues.apache.org/jira/browse/HADOOP-1212?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12619977#action_12619977
]
Jared Stehler commented on HADOOP-1212:
---------------------------------------
I have a more elegant work-around which doesn't involve deleting the data
folders: edit the <hadoop-data-root>/dfs/data/current/VERSION file, changing
the namespaceID to match the current namenode:
[EMAIL PROTECTED] ~]$ cat /lv_main/hadoop/dfs/data/current/VERSION
#Fri Aug 01 18:40:43 UTC 2008
namespaceID=292609117
storageID=DS-1525930547-66.135.42.149-50010-1217002151282
cTime=0
storageType=DATA_NODE
layoutVersion=-11
This allowed me to bring up the slave datanode and have it recognized by the
namenode in the DFS UI.
> Data-nodes should be formatted when the name-node is formatted.
> ---------------------------------------------------------------
>
> Key: HADOOP-1212
> URL: https://issues.apache.org/jira/browse/HADOOP-1212
> Project: Hadoop Core
> Issue Type: Bug
> Components: dfs
> Affects Versions: 0.12.2
> Reporter: Konstantin Shvachko
>
> The upgrade feature HADOOP-702 requires data-nodes to store persistently the
> namespaceID
> in their version files and verify during startup that it matches the one
> stored on the name-node.
> When the name-node reformats it generates a new namespaceID.
> Now if the cluster starts with the reformatted name-node, and not reformatted
> data-nodes
> the data-nodes will fail with
> java.io.IOException: Incompatible namespaceIDs ...
> Data-nodes should be reformatted whenever the name-node is. I see 2
> approaches here:
> 1) In order to reformat the cluster we call "start-dfs -format" or make a
> special script "format-dfs".
> This would format the cluster components all together. The question is
> whether it should start
> the cluster after formatting?
> 2) Format the name-node only. When data-nodes connect to the name-node it
> will tell them to
> format their storage directories if it sees that the namespace is empty and
> its cTime=0.
> The drawback of this approach is that we can loose blocks of a data-node from
> another cluster
> if it connects by mistake to the empty name-node.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.