Hello,

I am want to use cc1.4xlarge cluster for some data processing, to spin clusters 
I am using cloudera scripts. hadoop-ec2-init-remote.sh has default 
configuration until c1.xlarge but not configuration for cc1.4xlarge, can 
someone give formula how does this values calculated based on hardware?

C1.XLARGE
    MAX_MAP_TASKS=8 -  mapred.tasktracker.map.tasks.maximum
    MAX_REDUCE_TASKS=4 - mapred.tasktracker.reduce.tasks.maximum
    CHILD_OPTS=-Xmx680m - mapred.child.java.opts
    CHILD_ULIMIT=1392640 - mapred.child.ulimit

I am guessing but I think 

CHILD_OPTS = (total ram on the box - 1gb) /(MAX_MAP_TASKS, MAX_REDUCE_TASKS)

But not sure how to calculate rest

Regards,
Aleksandr 

 

Reply via email to