[
https://issues.apache.org/jira/browse/MAPREDUCE-4868?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13528629#comment-13528629
]
Radim Kolar commented on MAPREDUCE-4868:
----------------------------------------
Did you tried Spring Batch? You can boot it in setup() and do whatever you want
with data, including multiple steps and multithreading.
> Allow multiple iteration for map
> --------------------------------
>
> Key: MAPREDUCE-4868
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4868
> Project: Hadoop Map/Reduce
> Issue Type: Improvement
> Components: mrv2
> Affects Versions: 3.0.0, 2.0.3-alpha
> Reporter: Jerry Chen
> Fix For: 3.0.0, 2.0.3-alpha
>
> Original Estimate: 168h
> Remaining Estimate: 168h
>
> Currently, the Mapper class allows advanced users to override "public void
> run(Context context)" method for more control over the execution of the
> mapper, while Context interface limit the operations over the data which is
> the foundation of "more control".
> One of use cases is that when I am considering a hive optimziation problem, I
> want to go two passes over the input data instead of using a another job or
> task ( which may slower the whole process). Each pass do the same thing but
> with a different parameters.
> This is a new paradigm of Map Reduce usage and can be archived easily by
> extend Context interface a little with the more control over the data such as
> reset the input.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira