Re: Ubuntu Single Node Tutorial failure. No live or dead nodes.

Nick Klosterman Wed, 10 Feb 2010 14:20:06 -0800

@E.Sammer, no I don't *think* that it is part of another cluster. Thetutorial is for a single node cluster just as a initial set up to see ifyou can get things up and running. I have reformatted the namenodeseveral times in my effort to get hadoop to work.


@abishek

I tried the workaround you pointed me to no avail. I tried to modifythose directions since in the single node implementation I didn't have adfs.data.dir in hdfs-site.xml


My attempts at further debug

-----------------------------------------ATTEMPT AT FIXING THE DATANODESPROBLEM

had...@potr134pc26:/usr/local/hadoop/bin$ rm -r/usr/local/hadoop-datastore/

----NOW THERE IS NO HADOOP-DATASTORE FOLDER LOCALLY
had...@potr134pc26:/usr/local/hadoop/bin$ ./hadoop namenode -format
10/02/10 16:33:50 INFO namenode.NameNode: STARTUP_MSG:
/************************************************************
STARTUP_MSG: Starting NameNode
STARTUP_MSG:   host = potr134pc26/127.0.0.1
STARTUP_MSG:   args = [-format]
STARTUP_MSG:   version = 0.20.1

STARTUP_MSG: build =http://svn.apache.org/repos/asf/hadoop/common/tags/release-0.20.1-rc1 -r810220; compiled by 'oom' on Tue Sep 1 20:55:56 UTC 2009

************************************************************/

Re-format filesystem in/home/hadoop/hadoop-datastore/hadoop-hadoop/dfs/name ? (Y or N) Y

10/02/10 16:33:54 INFO namenode.FSNamesystem: fsOwner=hadoop,hadoop
10/02/10 16:33:54 INFO namenode.FSNamesystem: supergroup=supergroup
10/02/10 16:33:54 INFO namenode.FSNamesystem: isPermissionEnabled=true

10/02/10 16:33:54 INFO common.Storage: Image file of size 96 saved in 0seconds.10/02/10 16:33:54 INFO common.Storage: Storage directory/home/hadoop/hadoop-datastore/hadoop-hadoop/dfs/name has been successfullyformatted.

10/02/10 16:33:54 INFO namenode.NameNode: SHUTDOWN_MSG:
/************************************************************
SHUTDOWN_MSG: Shutting down NameNode at potr134pc26/127.0.0.1
************************************************************/
had...@potr134pc26:/usr/local/hadoop/bin$ ./start-all.sh

starting namenode, logging to/usr/local/hadoop/bin/../logs/hadoop-hadoop-namenode-potr134pc26.outlocalhost: starting datanode, logging to/usr/local/hadoop/bin/../logs/hadoop-hadoop-datanode-potr134pc26.outlocalhost: starting secondarynamenode, logging to/usr/local/hadoop/bin/../logs/hadoop-hadoop-secondarynamenode-potr134pc26.outstarting jobtracker, logging to/usr/local/hadoop/bin/../logs/hadoop-hadoop-jobtracker-potr134pc26.outlocalhost: starting tasktracker, logging to/usr/local/hadoop/bin/../logs/hadoop-hadoop-tasktracker-potr134pc26.out


had...@potr134pc26:/usr/local/hadoop/bin$ jps
27461 Jps
27354 TaskTracker
27158 SecondaryNameNode
27250 JobTracker
26923 NameNode
had...@potr134pc26:/usr/local/hadoop/bin$ ./hadoop dfsadmin -report
Configured Capacity: 0 (0 KB)
Present Capacity: 0 (0 KB)
DFS Remaining: 0 (0 KB)
DFS Used: 0 (0 KB)
DFS Used%: %
Under replicated blocks: 0
Blocks with corrupt replicas: 0
Missing blocks: 0

-------------------------------------------------
Datanodes available: 0 (0 total, 0 dead)

---(((( AT THIS POINT WHEN I CHECKED THE LOG THE DATANODES STILL WASN'T UPAND RUNNING)----------

mkdir /usr/local/hadoop-datastore
had...@potr134pc26:/usr/local/hadoop/bin$ ./stop-all.sh
stopping jobtracker
localhost: stopping tasktracker
stopping namenode
localhost: no datanode to stop
localhost: stopping secondarynamenode
had...@potr134pc26:/usr/local/hadoop/bin$ ./start-all.sh

had...@potr134pc26:/usr/local/hadoop/bin$ jps
28038 NameNode
28536 Jps
28154 DataNode
28365 JobTracker
28470 TaskTracker
28272 SecondaryNameNode

./hadoop dfs -copyFromLocal /home/hadoop/Desktop/*.txt txtinput
copyFromLocal: `txtinput': specified destination directory doest not exist
had...@potr134pc26:/usr/local/hadoop/bin$ ./hadoop dfs -mkdir txtinput

had...@potr134pc26:/usr/local/hadoop/bin$ ./hadoop dfs -copyFromLocal/home/hadoop/Desktop/*.txt txtinput10/02/10 16:44:36 WARN hdfs.DFSClient: DataStreamer Exception:org.apache.hadoop.ipc.RemoteException: java.io.IOException: File/user/hadoop/txtinput/20417.txt could only be replicated to 0 nodes,instead of 1atorg.apache.hadoop.hdfs.server.namenode.FSNamesystem.getAdditionalBlock(FSNamesystem.java:1267)atorg.apache.hadoop.hdfs.server.namenode.NameNode.addBlock(NameNode.java:422)

        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)

atsun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)atsun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)

        at java.lang.reflect.Method.invoke(Method.java:597)
        at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:508)
        at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:959)
        at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:955)
        at java.security.AccessController.doPrivileged(Native Method)
        at javax.security.auth.Subject.doAs(Subject.java:396)
        at org.apache.hadoop.ipc.Server$Handler.run(Server.java:953)

        at org.apache.hadoop.ipc.Client.call(Client.java:739)
        at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:220)
        at $Proxy0.addBlock(Unknown Source)
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)

atsun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)atsun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)

        at java.lang.reflect.Method.invoke(Method.java:597)

atorg.apache.hadoop.io.retry.RetryInvocationHandler.invokeMethod(RetryInvocationHandler.java:82)atorg.apache.hadoop.io.retry.RetryInvocationHandler.invoke(RetryInvocationHandler.java:59)

        at $Proxy0.addBlock(Unknown Source)

atorg.apache.hadoop.hdfs.DFSClient$DFSOutputStream.locateFollowingBlock(DFSClient.java:2904)atorg.apache.hadoop.hdfs.DFSClient$DFSOutputStream.nextBlockOutputStream(DFSClient.java:2786)atorg.apache.hadoop.hdfs.DFSClient$DFSOutputStream.access$2000(DFSClient.java:2076)atorg.apache.hadoop.hdfs.DFSClient$DFSOutputStream$DataStreamer.run(DFSClient.java:2262)

10/02/10 16:44:36 WARN hdfs.DFSClient: Error Recovery for block null baddatanode[0] nodes == null10/02/10 16:44:36 WARN hdfs.DFSClient: Could not get block locations.Source file "/user/hadoop/txtinput/20417.txt" - Aborting...10/02/10 16:44:36 WARN hdfs.DFSClient: DataStreamer Exception:org.apache.hadoop.ipc.RemoteException: java.io.IOException: File/user/hadoop/txtinput/7ldvc10.txt could only be replicated to 0 nodes,instead of 1atorg.apache.hadoop.hdfs.server.namenode.FSNamesystem.getAdditionalBlock(FSNamesystem.java:1267)atorg.apache.hadoop.hdfs.server.namenode.NameNode.addBlock(NameNode.java:422)

        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)

atsun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)atsun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)

        at java.lang.reflect.Method.invoke(Method.java:597)
        at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:508)
        at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:959)
        at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:955)
        at java.security.AccessController.doPrivileged(Native Method)
        at javax.security.auth.Subject.doAs(Subject.java:396)
        at org.apache.hadoop.ipc.Server$Handler.run(Server.java:953)

        at org.apache.hadoop.ipc.Client.call(Client.java:739)
        at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:220)
        at $Proxy0.addBlock(Unknown Source)
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)

atsun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)atsun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)

        at java.lang.reflect.Method.invoke(Method.java:597)

atorg.apache.hadoop.io.retry.RetryInvocationHandler.invokeMethod(RetryInvocationHandler.java:82)atorg.apache.hadoop.io.retry.RetryInvocationHandler.invoke(RetryInvocationHandler.java:59)

        at $Proxy0.addBlock(Unknown Source)

10/02/10 16:44:36 WARN hdfs.DFSClient: Error Recovery for block null baddatanode[0] nodes == null10/02/10 16:44:36 WARN hdfs.DFSClient: Could not get block locations.Source file "/user/hadoop/txtinput/7ldvc10.txt" - Aborting...copyFromLocal: java.io.IOException: File /user/hadoop/txtinput/20417.txtcould only be replicated to 0 nodes, instead of 1atorg.apache.hadoop.hdfs.server.namenode.FSNamesystem.getAdditionalBlock(FSNamesystem.java:1267)atorg.apache.hadoop.hdfs.server.namenode.NameNode.addBlock(NameNode.java:422)

        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)

atsun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)atsun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)

        at java.lang.reflect.Method.invoke(Method.java:597)
        at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:508)
        at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:959)
        at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:955)
        at java.security.AccessController.doPrivileged(Native Method)
        at javax.security.auth.Subject.doAs(Subject.java:396)
        at org.apache.hadoop.ipc.Server$Handler.run(Server.java:953)

java.io.IOException: File /user/hadoop/txtinput/7ldvc10.txt could only bereplicated to 0 nodes, instead of 1atorg.apache.hadoop.hdfs.server.namenode.FSNamesystem.getAdditionalBlock(FSNamesystem.java:1267)atorg.apache.hadoop.hdfs.server.namenode.NameNode.addBlock(NameNode.java:422)

        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)

atsun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)atsun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)

        at java.lang.reflect.Method.invoke(Method.java:597)
        at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:508)
        at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:959)
        at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:955)
        at java.security.AccessController.doPrivileged(Native Method)
        at javax.security.auth.Subject.doAs(Subject.java:396)
        at org.apache.hadoop.ipc.Server$Handler.run(Server.java:953)


had...@potr134pc26:/usr/local/hadoop/bin$ ./hadoop dfsadmin -report
Configured Capacity: 0 (0 KB)
Present Capacity: 0 (0 KB)
DFS Remaining: 0 (0 KB)
DFS Used: 0 (0 KB)
DFS Used%: %
Under replicated blocks: 0
Blocks with corrupt replicas: 0
Missing blocks: 0

-------------------------------------------------
Datanodes available: 0 (0 total, 0 dead)

On Wed, 10 Feb 2010, E. Sammer wrote:

On 2/10/10 3:57 PM, Nick Klosterman wrote:
It appears I have incompatible namespaceIDs. Any thoughts on how to
resolve that?
This is what the full datanodes log is saying:
Was this data node part of a another DFS cluster at some point? It looks likeyou've reformatted the name node since the datanode connected to it. Thedatanode will refuse to connect to a namenode with a different namespaceIdbecause the the data node would have blocks (possibly with the same ids) fromanother cluster. It's a stop gap safety mechanism. You'd have to destroy thedata directory on the data node to "reinitialize" it so it picks up the newnamespaceId from the name node at which point it will be allowed to connect.
Just to be clear, this will also kill all data that was stored on the datanode, so don't do this lightly.
HTH.
--
Eric Sammer
[email protected]
http://esammer.blogspot.com

Re: Ubuntu Single Node Tutorial failure. No live or dead nodes.

Reply via email to