Prabhu Joseph created MAPREDUCE-7216:
----------------------------------------
Summary: TeraSort Job Fails on S3
Key: MAPREDUCE-7216
URL: https://issues.apache.org/jira/browse/MAPREDUCE-7216
Project: Hadoop Map/Reduce
Issue Type: Bug
Reporter: Prabhu Joseph
Assignee: Prabhu Joseph
TeraSort Job fails on S3 with below exception. Terasort creates OutputPath and
writes partition filename but DirectoryStagingCommitter expects output path to
not exist.
{code}
9/06/07 14:13:34 INFO mapreduce.Job: Job job_1559891760159_0011 failed with
state FAILED due to: Job setup failed :
org.apache.hadoop.fs.PathExistsException: `s3a://bucket/OUTPUT': Setting job as
Task committer attempt_1559891760159_0011_m_000000_0: Destination path exists
and committer conflict resolution mode is "fail"
at
org.apache.hadoop.fs.s3a.commit.staging.StagingCommitter.failDestinationExists(StagingCommitter.java:878)
at
org.apache.hadoop.fs.s3a.commit.staging.DirectoryStagingCommitter.setupJob(DirectoryStagingCommitter.java:71)
at
org.apache.hadoop.mapreduce.v2.app.commit.CommitterEventHandler$EventProcessor.handleJobSetup(CommitterEventHandler.java:255)
at
org.apache.hadoop.mapreduce.v2.app.commit.CommitterEventHandler$EventProcessor.run(CommitterEventHandler.java:235)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)
{code}
Creating partition filename in /tmp or some other directory fixes the issue.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]