Re: 3 machine cluster trouble

Pat Ferrel Thu, 24 May 2012 11:35:16 -0700

ok, so all nodes are configured the same except for master/slavedifferences. They are all running hdfs all daemons seem to be runningwhen I do a start-all.sh from the master. However the master Map/ReduceAdministration page shows only two live nodes. The HDFS page shows 3.

Looking at the log files on the new slave node I see no outright errorsbut see this in the tasktracker log file. All machines have 8G memory. Ithink the important part below is TaskTracker'stotalMemoryAllottedForTasks is -1. I've searched for others with thisproblem but haven't found something for my case, which is just trying tostartup. No tasks have been run.

2012-05-24 11:20:46,786 INFO org.apache.hadoop.mapred.TaskTracker:Starting tracker tracker_occam3:localhost/127.0.0.1:457002012-05-24 11:20:46,792 INFO org.apache.hadoop.mapred.TaskTracker:Starting thread: Map-events fetcher for all reduce tasks ontracker_occam3:localhost/127.0.0.1:457002012-05-24 11:20:46,792 INFO org.apache.hadoop.mapred.TaskTracker:Using ResourceCalculatorPlugin :org.apache.hadoop.util.LinuxResourceCalculatorPlugin@5abd09e82012-05-24 11:20:46,795 WARN org.apache.hadoop.mapred.TaskTracker:TaskTracker's totalMemoryAllottedForTasks is -1. TaskMemoryManager isdisabled.2012-05-24 11:20:46,795 INFO org.apache.hadoop.mapred.IndexCache:IndexCache created with max memory = 104857602012-05-24 11:20:46,800 INFO org.apache.hadoop.mapred.TaskTracker:Shutting down: Map-events fetcher for all reduce tasks ontracker_occam3:localhost/127.0.0.1:457002012-05-24 11:20:46,800 INFOorg.apache.hadoop.filecache.TrackerDistributedCacheManager: Cleanup...

java.lang.InterruptedException: sleep interrupted
    at java.lang.Thread.sleep(Native Method)

atorg.apache.hadoop.filecache.TrackerDistributedCacheManager$CleanupThread.run(TrackerDistributedCacheManager.java:926)2012-05-24 11:20:46,900 INFO org.apache.hadoop.ipc.Server: Stoppingserver on 457002012-05-24 11:20:46,901 INFO org.apache.hadoop.ipc.Server: IPC Serverhandler 3 on 45700: exiting2012-05-24 11:20:46,901 INFO org.apache.hadoop.ipc.Server: IPC Serverhandler 1 on 45700: exiting2012-05-24 11:20:46,902 INFO org.apache.hadoop.ipc.Server: IPC Serverhandler 2 on 45700: exiting2012-05-24 11:20:46,902 INFO org.apache.hadoop.ipc.Server: Stopping IPCServer listener on 457002012-05-24 11:20:46,901 INFO org.apache.hadoop.ipc.Server: IPC Serverhandler 0 on 45700: exiting2012-05-24 11:20:46,904 INFOorg.apache.hadoop.ipc.metrics.RpcInstrumentation: shut down2012-05-24 11:20:46,904 INFO org.apache.hadoop.mapred.TaskTracker:Shutting down StatusHttpServer2012-05-24 11:20:46,904 INFO org.apache.hadoop.ipc.Server: IPC Serverhandler 7 on 45700: exiting2012-05-24 11:20:46,903 INFO org.apache.hadoop.ipc.Server: IPC Serverhandler 6 on 45700: exiting2012-05-24 11:20:46,903 INFO org.apache.hadoop.ipc.Server: IPC Serverhandler 4 on 45700: exiting2012-05-24 11:20:46,904 INFO org.apache.hadoop.ipc.Server: IPC Serverhandler 5 on 45700: exiting2012-05-24 11:20:46,904 INFO org.apache.hadoop.ipc.Server: Stopping IPCServer Responder2012-05-24 11:20:46,909 INFO org.mortbay.log: StoppedSelectChannelConnector@0.0.0.0:50060




On 5/23/12 3:55 PM, James Warren wrote:

Hi Pat -

The setting for hadoop.tmp.dir is used both locally and on HDFS and
therefore should be consistent across your cluster.

http://stackoverflow.com/questions/2354525/what-should-be-hadoop-tmp-dir

cheers,
-James

On Wed, May 23, 2012 at 3:44 PM, Pat Ferrel<p...@occamsmachete.com>  wrote:

I have a two machine cluster and am adding a new machine. The new node has
a different location for hadoop.tmp.dir than the other two nodes and
refuses to start the datanode when started in the cluster. When I change
the location pointed to by hadoop.tmp.dir to be the same on all machines it
starts up fine on all machines.

Shouldn't I be able to have the master and slave1 set as:
<property>
<name>hadoop.tmp.dir</name>
<value>/app/hadoop/tmp</value>
<description>A base for other temporary directories.</description>
</property>

And slave2 set as:
<property>
<name>hadoop.tmp.dir</name>
<value>/media/d2/app/hadoop/**tmp</value>
<description>A base for other temporary directories.</description>
</property>

??? Slave2 runs standalone in single node mode just fine. Using 0.20.205.

Re: 3 machine cluster trouble

Reply via email to