Yes, you can do this - the services are not coupled with one another.
Just start tasktrackers on one set of machines, and datanodes on
another set of machines (via bin/hadoop-daemon.sh start
{tasktracker,datanode} or so, individually.)
You will lose out on complete data locality during processing, however.
On Wed, Oct 12, 2011 at 9:07 AM, Xianqing Yu <[email protected]> wrote:
> Hi people,
>
> I have a question about how to setup hadoop cluster. Could I set TaskTracker
> and DataNode running on the different machines? Which means one machine with
> Tasktracker only, and one machine has DataNode daemon only.
>
> Thanks,
>
> Xianqing
>
--
Harsh J