Rajesh Balamohan created TEZ-2377:
-------------------------------------
Summary: RandomWriter ends up using TextOutputFormat instead of
SequenceFileOutputFormat
Key: TEZ-2377
URL: https://issues.apache.org/jira/browse/TEZ-2377
Project: Apache Tez
Issue Type: Bug
Reporter: Rajesh Balamohan
Assignee: Rajesh Balamohan
{code}
yarn jar ./dist/tez/tez-tests-0.7.0-SNAPSHOT.jar randomwriter
"-Dmapreduce.randomwriter.totalbytes=10737418" /tmp/test1
{code}
This ends up generating TextOutputFormat.
{code}
yarn jar ./dist/tez/tez-tests-0.7.0-SNAPSHOT.jar sort
"-Dmapreduce.framework.name=yarn-tez" -r 5 /tmp/test1 /tmp/test_sorted
{code}
This ends up throwing error
{noformat}
Failure while running task:java.io.IOException:
hdfs://tez-vm:56565/tmp/test1/part-00000 not a SequenceFile
at org.apache.hadoop.io.SequenceFile$Reader.init(SequenceFile.java:1851)
at
org.apache.hadoop.io.SequenceFile$Reader.initialize(SequenceFile.java:1811)
at
org.apache.hadoop.io.SequenceFile$Reader.<init>(SequenceFile.java:1760)
at
org.apache.hadoop.io.SequenceFile$Reader.<init>(SequenceFile.java:1774)
at
org.apache.hadoop.mapreduce.lib.input.SequenceFileRecordReader.initialize(SequenceFileRecordReader.java:54)
at
org.apache.tez.mapreduce.lib.MRReaderMapReduce.setupNewRecordReader(MRReaderMapReduce.java:149)
at
org.apache.tez.mapreduce.lib.MRReaderMapReduce.<init>(MRReaderMapReduce.java:78)
at
org.apache.tez.mapreduce.input.MRInput.initializeInternal(MRInput.java:475)
at
org.apache.tez.mapreduce.input.MRInputLegacy.init(MRInputLegacy.java:108)
{noformat}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)