On Fri, Feb 12, 2010 at 4:32 AM, Gaurav Vashishth <vashgau...@gmail.com> wrote: > > I have the Map Reduce function whose job is to process the database , MySql, > and give us some output. For this purpose, I have created the map reduce > fucntion and have used the DBInputFormat, but Im confused in how the > JobTracker will produce the splits here. > > I want that first 'n' records from the database should be processed by > single map task and so on and if jobtracker splits the record and give less > than 'n' records, it would be problem. > > Is there any API for getting this done or Im missing something. >
Maybe you have to write your own splitter? One that makes sure each task has N rows? Is there a splitter that is part of DBInputFormat? Can you look at how it works? Maybe you can specify rows per task just with a configuration? St.Ack