-Running a job locally means that you do not need a hadoop cluster for
it. It just runs in memory. So, you can run it on any machine.
-This question is more of a Hadoop question than Mahout's. You need to
execute the jar on the machine which has hadoop home set which has the
information about the cluster.
It will run in the same way as any hadoop job.
-Yes, you can use eclipse Hadoop plugin to submit jobs in cluster.
On 16-05-2012 15:30, Chandra Mohan, Ananda Vel Murugan wrote:
I have few basic doubts in Mahout.
* What is the difference between running a mahout job locally and
in Hadoop?
* I wrote a simple mahout job to do K-means clustering using my
data. I packaged it as jar and tried running it. It worked and did the
clustering in a Hadoop single node cluster. I am planning to move this
job to a multi node cluster. Should I execute mahout command from job
tracker node only? Or can I execute it from any node in cluster and be
assured that it uses all the nodes in the cluster. How mahout works in a
multi node cluster?
* Can I use eclipse Hadoop plugin to submit mahout jobs in
cluster?
Thanks for any assistance.
Regards,
Anand.C