Hi,

I have this use case - I need to spawn as many mappers as the number of
lines in a file in HDFS. This file isn't big (only 10-50 lines). Actually
each line represents the path of another data source that the Mappers will
work on. So each mapper will read 1 line, (the map() method will need to be
called only once), and work on the data source.

What's the best way to construct InputSplit, InputFormat and RecordReader
to achieve this? I would appreciate any example code :)

Best,
Deepak

Reply via email to