Steve Loughran created MAPREDUCE-7091:
-----------------------------------------

             Summary: Speed up terasort on S3a
                 Key: MAPREDUCE-7091
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-7091
             Project: Hadoop Map/Reduce
          Issue Type: Improvement
          Components: examples
    Affects Versions: 3.1.0
            Reporter: Steve Loughran


Terasort is very slow on S3, because it still uses the classic rename-to-commit 
algorithm on the sort, even while teragen and the reporting can use the new 
committer

Reason: {{org.apache.hadoop.examples.terasort.TeraOutputFormat}} has overriden 
{{getOutputCommitter}} even though it doesn't need to.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to