Eelco Lempsink wrote:
I'm not trying to run it on a cluster though, only on one host with
multiple CPU's. So I guess the local filesystem is shared and therefore
it should be fine.
Yes, that should be fine.
However, If I try with fs.default.name set to "file:///tmp/hadoop-test/"
still nothing happens.
To provide some more info, the TaskRunner keeps repeating this:
INFO org.apache.hadoop.mapred.TaskRunner: task_0001_r_000001_0 Need 12
map output(s)
INFO org.apache.hadoop.mapred.TaskRunner: task_0001_r_000001_0 Need 12
map output location(s)
INFO org.apache.hadoop.mapred.TaskRunner: task_0001_r_000001_0 Got 0 new
map outputs from jobtracker and 0 map outputs from previous failures
INFO org.apache.hadoop.mapred.TaskRunner: task_0001_r_000001_0 Got 0
known map output location(s); scheduling...
INFO org.apache.hadoop.mapred.TaskRunner: task_0001_r_000001_0 Scheduled
0 of 0 known outputs (0 slow hosts and 0 dup hosts)
I'm unsure if this is a bug or a misconfiguration.
Misconfiguration would be my first guess, since lots of folks (including
the unit tests) run multiple daemons on a single host. This message
looks like the reduce tasks are starting, but no map tasks are yet
complete. That could be because the map tasks are somehow stuck, or for
some other reason. It's hard to say.
Doug