Issue with getTaskTrackers

Ken Krugler Fri, 16 Apr 2010 10:00:56 -0700

Hi all,

I'm running 0.19.2 in EC2, and running into an occasional problem withClusterStatus.getTaskTrackers().

The call to getTaskTrackers() is being made in the job jar's mainfunction, before the job starts running I need to control some aspectsof my job, for example setting the number of reduce tasks to beexactly equal to the number of servers, which should be equal to thenumber of task trackers.

Every so often (currently < 5%) the call to getTaskTrackers() willreturn a value less than expected - e.g. 2 instead of 6. This happenseven when ClusterStatus.getJobTrackerState() returns State.RUNNING.

I'm assuming the problem is that some of the task trackers are takingextra time to spin up. I saw HADOOP-5337 (https://issues.apache.org/jira/browse/HADOOP-5337), which seems related, though that's for restarts vs. initial startup.

Given that the JobTracker waits for slaves to self-report, theredoesn't seem to be a totally reliable, automatic solution to thisissue, but I thought I'd ask to see if there's something I'm missing.


Thanks,

-- Ken

--------------------------------------------
Ken Krugler
+1 530-210-6378
http://bixolabs.com
e l a s t i c   w e b   m i n i n g

Issue with getTaskTrackers

Reply via email to