Josh Wills created CRUNCH-241:
---------------------------------

             Summary: Write side outputs from the Mapper stage of a MapReduce 
job
                 Key: CRUNCH-241
                 URL: https://issues.apache.org/jira/browse/CRUNCH-241
             Project: Crunch
          Issue Type: Bug
          Components: Core
            Reporter: Josh Wills
            Assignee: Josh Wills
         Attachments: CRUNCH-241.patch

Right now, Crunch always writes output files from the "last" stage of whatever 
kind of job it runs: either the reduce-side of a MapReduce job, or the map-side 
of a map-only job. This often leads to situations where we have to re-process 
the same input twice, once for the map-side outputs and again for the 
reduce-side outputs.

This change adds the ability for Crunch to write side outputs from the mapper 
phase of a MapReduce job (i.e., we can write output Targets from both the map 
side and the reduce side.) This should help lots of pipelines that implement 
these types of writes execute much faster.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to