Sorting ...

Mark question Sat, 21 May 2011 18:22:22 -0700

I'm trying to sort Sequence files using the Hadoop-Example TeraSort. But
after taking a couple of minutes .. output is empty.


HDFS has the following Sequence files:
-rw-r--r--   1 Hadoop supergroup  196113760 2011-05-21 12:16
/user/Hadoop/out/part-00000
-rw-r--r--   1 Hadoop supergroup  250935096 2011-05-21 12:16
/user/Hadoop/out/part-00001
-rw-r--r--   1 Hadoop supergroup  262943648 2011-05-21 12:17
/user/Hadoop/out/part-00002
-rw-r--r--   1 Hadoop supergroup  114888492 2011-05-21 12:17
/user/Hadoop/out/part-00003

After running:  hadoop jar hadoop-mapred-examples-0.21.0.jar terasort out
sorted
Error is:
   ............
11/05/21 18:13:12 INFO mapreduce.Job:  map 74% reduce 20%
11/05/21 18:13:14 INFO mapreduce.Job: Task Id :
attempt_201105202144_0039_m_000009_0, Status : FAILED
java.io.EOFException: read past eof

I'm trying to find what the input format for the TeraSort is, but it is not
specified.

Thanks for any thought,
Mark

Sorting ...

Reply via email to