I'm trying to sort Sequence files using the Hadoop-Example TeraSort. But after taking a couple of minutes .. output is empty.
HDFS has the following Sequence files: -rw-r--r-- 1 Hadoop supergroup 196113760 2011-05-21 12:16 /user/Hadoop/out/part-00000 -rw-r--r-- 1 Hadoop supergroup 250935096 2011-05-21 12:16 /user/Hadoop/out/part-00001 -rw-r--r-- 1 Hadoop supergroup 262943648 2011-05-21 12:17 /user/Hadoop/out/part-00002 -rw-r--r-- 1 Hadoop supergroup 114888492 2011-05-21 12:17 /user/Hadoop/out/part-00003 After running: hadoop jar hadoop-mapred-examples-0.21.0.jar terasort out sorted Error is: ............ 11/05/21 18:13:12 INFO mapreduce.Job: map 74% reduce 20% 11/05/21 18:13:14 INFO mapreduce.Job: Task Id : attempt_201105202144_0039_m_000009_0, Status : FAILED java.io.EOFException: read past eof I'm trying to find what the input format for the TeraSort is, but it is not specified. Thanks for any thought, Mark
