Hello, I am want to use cc1.4xlarge cluster for some data processing, to spin clusters I am using cloudera scripts. hadoop-ec2-init-remote.sh has default configuration until c1.xlarge but not configuration for cc1.4xlarge, can someone give formula how does this values calculated based on hardware?
C1.XLARGE MAX_MAP_TASKS=8 - mapred.tasktracker.map.tasks.maximum MAX_REDUCE_TASKS=4 - mapred.tasktracker.reduce.tasks.maximum CHILD_OPTS=-Xmx680m - mapred.child.java.opts CHILD_ULIMIT=1392640 - mapred.child.ulimit I am guessing but I think CHILD_OPTS = (total ram on the box - 1gb) /(MAX_MAP_TASKS, MAX_REDUCE_TASKS) But not sure how to calculate rest Regards, Aleksandr
