Hi, I am having a 5 node cluster for hadoop usage. All nodes are multi-core. I am running a shell command in Map function of my program and this shell command takes one file as an input. Many of such files are copied in the HDFS.
So in summary map function will run a command like ./run <file1> <outputfile1> Could you please suggest the optimized way to do this..like if I can use multi core processing of nodes and many of such maps in parallel. Thanks, Aayush
