Hi All: I am migrating from a small grid to a larger one. The small grid runs fine with no issues. On the larger grid, with nearly identical configuration files (just changing host names and file paths), I can get dfs to run, but not all the TaskTrackers. Specifically, the task trackers on the slave nodes fail to initialize, failing on a bind error to the master node using port 0. The task tracker logs are below for the master (which starts up successfully) and from one of the slaves (which fails to start). Any thoughts/comments would be most appreciated. Note that if I log in to the worker node, and using Python do a socket.connect() to the master on the master's port (43913 for this run) I can connect successfully. How do the slave nodes know what port to use when connecting? Any help appreciate as I am tearing out what little is left of my hair :-). Thanks, C G
--------------------------------- Looking for last minute shopping deals? Find them fast with Yahoo! Search.