[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12774181#action_12774181
 ] 

Arun C Murthy commented on MAPREDUCE-1183:
------------------------------------------

bq. How would we implement this? Would we serialize these to the splits file? 
To a new per-job file? In a parameter to the job-submission RPC?

I'm considering a separate, per-job file via the DistributedCache.

bq. Long-term it would be nice if job-submissions could be easily made by 
non-Java applications. [...]

I agree! I'd propose we track that via a new jira though, to keep patches 
manageable...

> Serializable job components: Mapper, Reducer, InputFormat, OutputFormat et al
> -----------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-1183
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1183
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>          Components: client
>    Affects Versions: 0.21.0
>            Reporter: Arun C Murthy
>            Assignee: Arun C Murthy
>
> Currently the Map-Reduce framework uses Configuration to pass information 
> about the various aspects of a job such as Mapper, Reducer, InputFormat, 
> OutputFormat, OutputCommitter etc. and application developers use 
> org.apache.hadoop.mapreduce.Job.set*Class apis to set them at job-submission 
> time:
> {noformat}
> Job.setMapperClass(IdentityMapper.class);
> Job.setReducerClass(IdentityReducer.class);
> Job.setInputFormatClass(TextInputFormat.class);
> Job.setOutputFormatClass(TextOutputFormat.class);
> ...
> {noformat}
> The proposal is that we move to a model where end-users interact with 
> org.apache.hadoop.mapreduce.Job via actual objects which are then serialized 
> by the framework:
> {noformat}
> Job.setMapper(new IdentityMapper());
> Job.setReducer(new IdentityReducer());
> Job.setInputFormat(new TextInputFormat("in"));
> Job.setOutputFormat(new TextOutputFormat("out"));
> ...
> {noformat}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to