In your setup() look at context.getInputSplit(), this will be a FileSplit in your case. From there you can do a getPath() to see the both the directory structure and the split value.

On May 18, 2010, at 10:01 AM, psdc1978 wrote:

Hi,

I'm study the MapReduce code, and I've the following questions:

1 - I'm running the wordcount example. I've 3 txt files as input. Each txt file is about 120Mb.

During the execution of the map tasks, a number of map tasks will read the txt files. Each file is divided in split files. I would like to know to each txt file corresponds a split. For example, for the A.txt file, it will be created 2 splits (split0 and split1) of 64Mb each. I would like to know that split0 and split1 belongs to A.txt. Is it possible? If I've to do some code, is there any object that contains this data?

2 -
The Job task uses a job.split file. What contains this file and what is the purpose of this file?

Thanks,

--
PSC

Reply via email to