Brian Bockelman wrote:

On Dec 4, 2008, at 6:58 AM, Steve Loughran wrote:

Leeau wrote:
Dear,
I want to config a 4-site hadoop cluster. but failed. Who can help me to
know why? and how can i start it? thanks.

you need to learn to read stack traces

Ah, the common claim of the Java programmer to the Unix admin.

As the person at the end of the phone call when the message is "Java.net.IOException "no route to host" ", I think it is the job of the ops team to know that "No route to host" means DNS is playing up. Whereas in fact they would stop at the word java and reach for the developers, which would waste a lot of time before they would eventually acknowledge the problem lay in the external DNS servers, not our code.


I believe the correct response is "you need to learn to write readable logging messages".


I semi-agree. Semi because the ways an installation can fail is very much greater than the ways the dev team can think of, and a lot of those errors your code sees gets thrown up from below. when an application tries to be clever and give you the likeliest error, it can actually be misleading. As a case in point, when windows is playing up the OS often says "memory is getting low" because in the days of 128MB machines, low memory was the problem. Now with 3-4GB of RAM, the cause is something else, yet the error messages are still

If you look at how Hadoop is evolving, the latest versions add things like the actual port that is in use, because we were all fed up with seeing in the logs that a port was in use, but not which one. Yet I still think that everyone running a hadoop cluster should embrace java stack traces. Not least because it lets you look into the code to see what is playing up.

-steve

Reply via email to