Hi all !
I'm working on hadoop and currently i'm using the examples provided
(WordCound & Grep especially).
I've managed to make those examples work on a local machine, and now I'd
like to go on to the next step: parallelization. My rpoblem is that I
don't know where to configure this. According to hadoop wiki, the files
mapred-default.xml and hadoop-site.xml should be modified, but my actual
problem is that I don't know which ones of these files should be changed
? I mean, do I have to unzip the hadoop-core jar, modify the xml files,
and zip it back ? Or these files maybe just need to be present in the
directory of the included jar ?
Sorry if the information was present somewhere else, I haven't found it.
Thanks in advance,
Samuel