On Jul 15, 2009, at 5:25 PM, zaki rahaman wrote:
I hope I'm understanding your setup correctly but by running on one
machine,
you're not fully exploiting the capabilities of Hadoop's Map/Reduce.
Gains
in computation time will only be seen by increasing the number of
cores or
nodes.
Yep.
If you need access to more computing power, you might want to
consider using Amazon's EC2 (they have preconfigured AMIs for Hadoop
but
youd have to configure and install Mahout, a process which I'm not
totally
familiar with as of yet as I'm still trying to do it myself).
Please add to http://cwiki.apache.org/MAHOUT/mahoutec2.html if you
can. Given a Hadoop AMI, it shouldn't be all that hard to setup a
Job, I wouldn't think. Would be good to have a script that does it,
though.
-Grant