Ok thanks for the reply. Do you have any sample code which can demonstrate
how to do this?

-Gaurav


Amandeep Khurana wrote:
> 
> DBInputFormat splits the count() from the RDBMS table into the number of
> mappers. If you want to split using your own scheme, you'll have to write
> your own input format or tweak the existing one.
> 
> 
> Amandeep Khurana
> Computer Science Graduate Student
> University of California, Santa Cruz
> 
> 
> On Fri, Feb 12, 2010 at 12:08 PM, Stack <st...@duboce.net> wrote:
> 
>> On Fri, Feb 12, 2010 at 4:32 AM, Gaurav Vashishth <vashgau...@gmail.com>
>> wrote:
>> >
>> > I have the Map Reduce function whose job is to process the database ,
>> MySql,
>> > and give us some output. For this purpose, I have created the map
>> reduce
>> > fucntion and have used the DBInputFormat, but Im confused in how the
>> > JobTracker will produce the splits here.
>> >
>> > I want that first 'n' records from the database should be processed by
>> > single map task and so on and if jobtracker splits the record and give
>> less
>> > than 'n' records, it would be problem.
>> >
>> > Is there any API for getting this done or Im missing something.
>> >
>>
>> Maybe you have to write your own splitter?  One that makes sure each
>> task has N rows?  Is there a splitter that is part of DBInputFormat?
>> Can you look at how it works?  Maybe you can specify rows per task
>> just with a configuration?
>> St.Ack
>>
> 
> 

-- 
View this message in context: 
http://old.nabble.com/DBInputFormat-tp27562875p27572830.html
Sent from the HBase User mailing list archive at Nabble.com.

Reply via email to