Lehel, You may use the NLineInputFormat with N=1: http://hadoop.apache.org/common/docs/current/api/org/apache/hadoop/mapred/lib/NLineInputFormat.html
On Sun, May 20, 2012 at 2:48 PM, biro lehel <lehel.b...@yahoo.com> wrote: > Dear all, > > I have one single input file, which contains, on every line, some > hydrological calibration models (data). Each line of the file should be > processed and then the output from every line written to another single > output file. > > I understood that hadoop spawns mapper tasks with the same number as how many > input files there are (meaning, in my case, a single mapper would be > generated). However, I want that a mapper to be dealing with only a single > line from my input file (nr. of mapper tasks = number of lines in my file). > > What is the best way to obtain such behavior? How should I specify this to > Hadoop? > > Any suggestions are more than welcome. > > Thank you, > Lehel. -- Harsh J