Re: basic doubt on number of reduce tasks

2012-03-02 Thread Bejoy Ks
Vamshi If you have set the number of reduce slots in a node to 5 and if you have 4 nodes, then your cluster can run a max of 5*4 = 20 reduce tasks at a time. If more reduce tasks are present those has to wait till reduce slots becomes available. In reducer the data locality is not

Re: yarn NoClassDefFoundError from LinuxContainerExecutor

2012-03-02 Thread Mingjie Lai
It relates to: https://issues.apache.org/jira/browse/MAPREDUCE-3505 Thanks. On 03/01/2012 07:09 AM, Ioan Eugen Stan wrote: Hi Mingjie, I don't know about Yarn, but NoClassDefFoundError appear when you have a class that was present during compile time but no longer available during runtime.

Re: no log function for map/red in a cluster setup

2012-03-02 Thread GUOJUN Zhu
Thank you very much. But that does not help. I did try to symbolic link one into my working directory -files conf/log4j.proeprties#mylog4j.properties, and then put a specified configuration in jvm options property namemapred.child.java.opts/name value

MR output to a file instead of directory?

2012-03-02 Thread Jianhui Zhang
Hi all, The FileOutputFormat/FileOutputCommitter always treats an output path as a directory and write files under it, even if there is only one Reducer. Is there any way to configure an OutputFormat to write all data into a file? Thanks, James