Hello! I'm having trouble following your tutorial on Getting started with hadoop [GSWH] (from http://wiki.apache.org/lucene-hadoop/GettingStartedWithHadoop). Is it not up to date?
I'm using hadoop-0.12.2 that i've just downloaded today... The GSWH guide says: "Set the variable fs.default.name to the Namenode's intended host:port", but the defaults in hadoop-default.xml say it is file:/// and describe it as "The name of the default file system. A URI whose scheme and authority determine the FileSystem implementation" so I left it has file:///... Next it says: "Set the variable mapred.job.tracker to the jobtrackers intended host:port" and in the description from hadoop-default.xml I read: "The host and port that the MapReduce job tracker runs at. If "local", then jobs are run in-process as a single map and reduce task." so I left it as "local". Then i changed the following data paths: dfs.name.dir, dfs.data.dir, dfs.client.buffer.dir and mapred.local.dir to use a directory i specify instead of $(hadoop.tmp.dir). I "ensure that the dfs.name.dir directory exists" and format the namenode with '$HADOOP_INSTALL/hadoop/bin/hadoop namenode -format'... all goes well (or at least i get no errors) When i try to start my single node cluster with: '$HADOOP_INSTALL/hadoop/bin/start-all.sh' I get the following output: $ hadoop/bin/start-all.sh starting namenode, logging to /home/ciclope/hadoop-install/hadoop/bin/../logs/hadoop-ciclope-namenode-ps-guedes-desktop.out [EMAIL PROTECTED]'s password: localhost: starting datanode, logging to /home/ciclope/hadoop-install/hadoop/bin/../logs/hadoop-ciclope-datanode-ps-guedes-desktop.out [EMAIL PROTECTED]'s password: localhost: starting secondarynamenode, logging to /home/ciclope/hadoop-install/hadoop/bin/../logs/hadoop-ciclope-secondarynamenode-ps-guedes-desktop.out localhost: Exception in thread "main" java.lang.IllegalArgumentException: port out of range:-1 localhost: at java.net.InetSocketAddress.<init>(InetSocketAddress.java:118) localhost: at org.apache.hadoop.dfs.DataNode.createSocketAddr(DataNode.java:106) localhost: at org.apache.hadoop.dfs.SecondaryNameNode.<init>(SecondaryNameNode.java:93) localhost: at org.apache.hadoop.dfs.SecondaryNameNode.main(SecondaryNameNode.java:474) starting jobtracker, logging to /home/ciclope/hadoop-install/hadoop/bin/../logs/hadoop-ciclope-jobtracker-ps-guedes-desktop.out [EMAIL PROTECTED]'s password: localhost: starting tasktracker, logging to /home/ciclope/hadoop-install/hadoop/bin/../logs/hadoop-ciclope-tasktracker-ps-guedes-desktop.out and from the logs: ERROR org.apache.hadoop.dfs.NameNode: java.lang.IllegalArgumentException: port out of range:-1 ERROR org.apache.hadoop.dfs.DataNode: java.lang.IllegalArgumentException: port out of range:-1 ERROR org.apache.hadoop.mapred.JobTracker: java.lang.RuntimeException: Bad mapred.job.tracker: local ERROR org.apache.hadoop.mapred.TaskTracker: Can not start task tracker because java.lang.RuntimeException: Bad mapred.job.tracker: local and finally for the secondary namenode the erros goes to the .out file: java.lang.IllegalArgumentException: port out of range:-1 humm... this must be a really stupid mistake... so i'm either being really dumb or the Guide isn't very well specified... probably both! Can anybody help? If it's a problem with the wiki i'm very much willing to help fix it... Thanks in advance, Pedro Guedes
