> I'm not too much of a fan of stealing control of the whole box - my local > hadoop admin would really not like me. :)
Completely agree. Our implementations should not do this, thats why ALS runs per default with a single thread per mapper. Just wanted to point out that there are some tricks one can play to greatly enhance performance, given the environment (e.g. spawned cluster on EC2, research setting) supports them. -sebastian
