Hi All, I am doing some experiments by running WordCount example on hadoop. I have a cluster with 7 nodes .I want to run WordCount example with 3mappers and 3 reducers and compare the response time with another experiments when number of mappers and reducers increased to 6 and 12 and so on. For first experiment i set number of the mappers and reducer to 3 in wordCount example source code .and also set the number of replications to 3 in hadoop configurations.Also the maximum number of tasks per node is set to 1 . But when i run the sample with a big data like 2.5 G ,i can see 44 map tasks and 3 reduce tasks are running !!
What parameters do i need to set to have like (3Mappers,3 Reducers), (6M,6R) and (12M,12R) and as i mentioned i have a cluster with 1 namenode and 6 datanodes. Is number of replications related to the number of mappers and reducers ?! Regards, Samaneh