Hi, I read on Mahout in Action that I should set -Dmapred.map.tasks=X where X would be the number of cores of my cluster. I have been running experiments on amazon EC2 m.large instances. I have been using kmeans over 1.1GB dataset. I never set up that flag. I noticed that on a 10 machine cluster the maximum cpu usage is 60%.
Am I proceeding right? Shall I setup the flag? how? thanks -- View this message in context: http://lucene.472066.n3.nabble.com/is-it-necessary-set-mapred-map-tasks-running-mahout-on-a-cluster-tp1975103p1975103.html Sent from the Mahout User List mailing list archive at Nabble.com.
