On 2010-12-10 16:31, Chris Woolum wrote: > So if I run a heap size of 512, would it be a good idea to set a max > of 5 map and 5 reduce tasks or should I set more there and run a > tasktracker max of 10?
"5 map & 5 reduces" for what? maximum tasks per tasktracker? that may be right in your case, at least you should try it. But if you refer to the default number of map & reduce tasks per job, then usually you should set the number of maps higher (e.g. 5 or 7 times the number of nodes) and the number of reduces to something like 3 times the number of nodes, or less. -- Best regards, Andrzej Bialecki <>< ___. ___ ___ ___ _ _ __________________________________ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com

