Jeff Zhang created TEZ-2271:
-------------------------------

             Summary: OutputFormat is not recognized correctly 
                 Key: TEZ-2271
                 URL: https://issues.apache.org/jira/browse/TEZ-2271
             Project: Apache Tez
          Issue Type: Bug
            Reporter: Jeff Zhang
            Assignee: Jeff Zhang


TEZ-2162 didn't resolve it completely. Paste the comment from TEZ-2162
bq. But there's one exceptional case that if user specify OutputFormat through 
conf, then even MROutput is on mapper side, user still need to use property 
MRJobConfig.NEW_API_REDUCER_CONFIG rather than NEW_API_MAPPER_CONFIG. Or I 
think we can take useNewAPI as true as any of NEW_API_MAPPER_CONFIG, 
NEW_API_REDUCER_CONFIG is set true (patch didn't include this now)

This issue happens when translating from Mapper-only MR job to tez. 
{code}
12:50:07,425 - Thread( main) - (Job.java:1386) - Job job_1427876879456_0010 
failed with state FAILED due to: Vertex failed, vertexName=initialmap, 
vertexId=vertex_1427876879456_0010_1_00, diagnostics=[Task failed, 
taskId=task_1427876879456_0010_1_00_000000, diagnostics=[TaskAttempt 0 failed, 
info=[Error: Failure while running task:java.lang.ClassCastException: 
org.apache.hadoop.mapred.FileOutputCommitter cannot be cast to 
org.apache.hadoop.mapreduce.lib.output.FileOutputCommitter
        at 
org.apache.hadoop.mapreduce.lib.output.FileOutputFormat.getWorkOutputPath(FileOutputFormat.java:222)
        at 
org.apache.sqoop.mapreduce.TextImportMapper.setup(TextImportMapper.java:48)
        at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:142)
        at 
org.apache.sqoop.mapreduce.AutoProgressMapper.run(AutoProgressMapper.java:64)
        at 
org.apache.tez.mapreduce.processor.map.MapProcessor.runNewMapper(MapProcessor.java:237)
        at 
org.apache.tez.mapreduce.processor.map.MapProcessor.run(MapProcessor.java:124)
        at 
org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:330)
        at 
org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable$1.run(TezTaskRunner.java:180)
        at 
org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable$1.run(TezTaskRunner.java:172)
        at java.security.AccessController.doPrivileged(Native Method)
        at javax.security.auth.Subject.doAs(Subject.java:415)
        at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1628)
        at 
org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable.callInternal(TezTaskRunner.java:172)
        at 
org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable.callInternal(TezTaskRunner.java:168)
        at org.apache.tez.common.CallableWithNdc.call(CallableWithNdc.java:36)
        at java.util.concurrent.FutureTask.run(FutureTask.java:262)
        at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
        at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
        at java.lang.Thread.run(Thread.java:745)
{code} 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to