What's your xceivers set to? What's the ulimit -n set for hdfs/hadoop user... (You didn't say which release/version you were using.)
> Date: Sun, 1 May 2011 17:47:18 -0700 > Subject: one of our datanodes stops working after few hours > From: [email protected] > To: [email protected] > > I took a jstack (http://pastebin.com/5v6mHg3t). After few hours, its > literally staggers to a halt and gets very very slow... Any ideas > whats its blocking on? > (main issue is that fsreads for RS get really slow when that happens). > > -Jack
