Yes, I was manipulating io.sort.factor too, it speeds up reducer, values
around 30 gives good result for me.
But my problem is not reducer, my problem is Bt-job map taks that spills
to drive.
You mentioned Combiner, how can I turn it on ? I'm running my job from
console like that
mahout ssvd --rank 400 --computeU true --computeV true --reduceTasks 3
--input ${INPUT} --output ${OUTPUT} -ow --tempDir /tmp/ssvdtmp/
document at
https://cwiki.apache.org/MAHOUT/stochastic-singular-value-decomposition.data/SSVD-CLI.pdf
doesn't mention anything about combiner.
Thanks for your answer.
W dniu 22.05.2013 14:59, Sean Owen pisze:
I feel like I've seen this too and it's just a bug. You're not running
out of memory.
Are you also setting io.sort.factor? that can help too. You might try
as high as 100.
Also have you tried a Combiner? if you can apply it it should help too
as it is designed to reduce the amount of stuff spilled.