[jira] [Updated] (MAPREDUCE-4868) Allow multiple iteration for map

2014-09-09 Thread Allen Wittenauer (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-4868?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated MAPREDUCE-4868:

Fix Version/s: (was: 2.4.0)
   (was: 3.0.0)

 Allow multiple iteration for map
 

 Key: MAPREDUCE-4868
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4868
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: mrv2
Affects Versions: 3.0.0, 2.0.3-alpha
Reporter: Jerry Chen
   Original Estimate: 168h
  Remaining Estimate: 168h

 Currently, the Mapper class allows advanced users to override public void 
 run(Context context) method for more control over the execution of the 
 mapper, while Context interface limit the operations over the data which is 
 the foundation of more control.
 One of use cases is that when I am considering a hive optimziation problem, I 
 want to go two passes over the input data instead of using a another job or 
 task ( which may slower the whole process). Each pass do the same thing but 
 with a different parameters.
 This is a new paradigm of Map Reduce usage and can be archived easily by 
 extend Context interface a little with the more control over the data such as 
 reset the input.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (MAPREDUCE-4868) Allow multiple iteration for map

2014-02-24 Thread Arun C Murthy (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-4868?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Arun C Murthy updated MAPREDUCE-4868:
-

Fix Version/s: (was: 2.3.0)
   2.4.0

 Allow multiple iteration for map
 

 Key: MAPREDUCE-4868
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4868
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: mrv2
Affects Versions: 3.0.0, 2.0.3-alpha
Reporter: Jerry Chen
 Fix For: 3.0.0, 2.4.0

   Original Estimate: 168h
  Remaining Estimate: 168h

 Currently, the Mapper class allows advanced users to override public void 
 run(Context context) method for more control over the execution of the 
 mapper, while Context interface limit the operations over the data which is 
 the foundation of more control.
 One of use cases is that when I am considering a hive optimziation problem, I 
 want to go two passes over the input data instead of using a another job or 
 task ( which may slower the whole process). Each pass do the same thing but 
 with a different parameters.
 This is a new paradigm of Map Reduce usage and can be archived easily by 
 extend Context interface a little with the more control over the data such as 
 reset the input.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (MAPREDUCE-4868) Allow multiple iteration for map

2013-08-27 Thread Arun C Murthy (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-4868?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Arun C Murthy updated MAPREDUCE-4868:
-

Fix Version/s: (was: 2.1.0-beta)
   2.3.0

 Allow multiple iteration for map
 

 Key: MAPREDUCE-4868
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4868
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: mrv2
Affects Versions: 3.0.0, 2.0.3-alpha
Reporter: Jerry Chen
 Fix For: 3.0.0, 2.3.0

   Original Estimate: 168h
  Remaining Estimate: 168h

 Currently, the Mapper class allows advanced users to override public void 
 run(Context context) method for more control over the execution of the 
 mapper, while Context interface limit the operations over the data which is 
 the foundation of more control.
 One of use cases is that when I am considering a hive optimziation problem, I 
 want to go two passes over the input data instead of using a another job or 
 task ( which may slower the whole process). Each pass do the same thing but 
 with a different parameters.
 This is a new paradigm of Map Reduce usage and can be archived easily by 
 extend Context interface a little with the more control over the data such as 
 reset the input.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (MAPREDUCE-4868) Allow multiple iteration for map

2013-02-15 Thread Arun C Murthy (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-4868?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Arun C Murthy updated MAPREDUCE-4868:
-

Fix Version/s: (was: 2.0.3-alpha)
   2.0.4-beta

 Allow multiple iteration for map
 

 Key: MAPREDUCE-4868
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4868
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: mrv2
Affects Versions: 3.0.0, 2.0.3-alpha
Reporter: Jerry Chen
 Fix For: 3.0.0, 2.0.4-beta

   Original Estimate: 168h
  Remaining Estimate: 168h

 Currently, the Mapper class allows advanced users to override public void 
 run(Context context) method for more control over the execution of the 
 mapper, while Context interface limit the operations over the data which is 
 the foundation of more control.
 One of use cases is that when I am considering a hive optimziation problem, I 
 want to go two passes over the input data instead of using a another job or 
 task ( which may slower the whole process). Each pass do the same thing but 
 with a different parameters.
 This is a new paradigm of Map Reduce usage and can be archived easily by 
 extend Context interface a little with the more control over the data such as 
 reset the input.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (MAPREDUCE-4868) Allow multiple iteration for map

2012-12-10 Thread Jerry Chen (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-4868?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jerry Chen updated MAPREDUCE-4868:
--

Description: 
Currently, the Mapper class allows advanced users to override public void 
run(Context context) method for more control over the execution of the mapper, 
while Context interface limit the operations over the data which is the 
foundation of more control.

One of use cases is that when I am considering a hive optimziation problem, I 
want to go two passes over the input data instead of using a another job or 
task ( which may slower the whole process). Each pass do the same thing but 
with a different parameters.

This is a new paradigm of Map Reduce usage and can be archived easily by extend 
Context interface a little with the more control over the data such as reset 
the input.

  was:
Currently, the Mapper class allows advanced users to override public void 
run(Context context) method for more control over the map the execution of the 
mapper, while Context interface limit the operations over the data which is the 
foundation of more control.

One of use cases is that when I am considering a hive optimziation problem, I 
want to go two passes over the input data instead of using a another job or 
task ( which may slower the whole process). Each pass do the same thing but 
with a different parameters.

This is a new paradigm of Map Reduce usage and can be archived easily by extend 
Context interface a little with the more control over the data such as reset 
the input.


 Allow multiple iteration for map
 

 Key: MAPREDUCE-4868
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4868
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: mrv2
Affects Versions: 3.0.0, 2.0.3-alpha
Reporter: Jerry Chen
 Fix For: 3.0.0, 2.0.3-alpha

   Original Estimate: 168h
  Remaining Estimate: 168h

 Currently, the Mapper class allows advanced users to override public void 
 run(Context context) method for more control over the execution of the 
 mapper, while Context interface limit the operations over the data which is 
 the foundation of more control.
 One of use cases is that when I am considering a hive optimziation problem, I 
 want to go two passes over the input data instead of using a another job or 
 task ( which may slower the whole process). Each pass do the same thing but 
 with a different parameters.
 This is a new paradigm of Map Reduce usage and can be archived easily by 
 extend Context interface a little with the more control over the data such as 
 reset the input.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira