On Fri, Feb 12, 2010 at 4:32 AM, Gaurav Vashishth <vashgau...@gmail.com> wrote:
>
> I have the Map Reduce function whose job is to process the database , MySql,
> and give us some output. For this purpose, I have created the map reduce
> fucntion and have used the DBInputFormat, but Im confused in how the
> JobTracker will produce the splits here.
>
> I want that first 'n' records from the database should be processed by
> single map task and so on and if jobtracker splits the record and give less
> than 'n' records, it would be problem.
>
> Is there any API for getting this done or Im missing something.
>

Maybe you have to write your own splitter?  One that makes sure each
task has N rows?  Is there a splitter that is part of DBInputFormat?
Can you look at how it works?  Maybe you can specify rows per task
just with a configuration?
St.Ack

Reply via email to