Re: Monitoring with Ganglia

2009-03-19 Thread Tamir Kamara
Thanks Brian ! Works great. On Thu, Mar 19, 2009 at 3:39 PM, Brian Bockelman wrote: > Hey Tamir, > > Instead of > > mapred.class=org.apache.hadoop.metrics.ganglia.GangliaContext31 (for > Ganglia3.1.x) > > use: > > mapred.class=org.apache.hadoop.metrics.ganglia.GangliaContext31 > > Java is trying

Re: Monitoring with Ganglia

2009-03-19 Thread Brian Bockelman
Hey Tamir, Instead of mapred.class=org.apache.hadoop.metrics.ganglia.GangliaContext31 (for Ganglia3.1.x) use: mapred.class=org.apache.hadoop.metrics.ganglia.GangliaContext31 Java is trying to interpret the parenthetical aside as part of the class name. Brian PS: In distributed systems

Re: Monitoring with Ganglia

2009-03-19 Thread Tamir Kamara
Hi Brian, Do you mean the hadoop-metrics file? It looks like this: # Configuration of the "mapred" context for ganglia # mapred.class=org.apache.hadoop.metrics.ganglia.GangliaContext (defalut for Ganglia3.0.x) mapred.class=org.apache.hadoop.metrics.ganglia.GangliaContext31 (for Ganglia3.1.x) map

Re: Monitoring with Ganglia

2009-03-19 Thread Brian Bockelman
Hey Tamir, This is a very strange stack trace: java.lang.ClassNotFoundException: org.apache.hadoop.metrics.ganglia.GangliaContext31 (for Ganglia3.1.x) at java.net.URLClassLoader$1.run(URLClassLoader.java:200) at java.security.AccessController.doPrivileged(Native Method)

Re: Monitoring with Ganglia

2009-03-19 Thread Brian Bockelman
Hey Tamir, It appears the webserver stripped off your attachment. Do you have more of a stack trace available? Brian On Mar 19, 2009, at 7:25 AM, Tamir Kamara wrote: Hi, The full lsof | grep java is attached. I see a line with the jar: / usr/local/hadoop-0.18.4/hadoop-0.18.4-dev-core.jar w

Re: Monitoring with Ganglia

2009-03-19 Thread Tamir Kamara
Hi, The full lsof | grep java is attached. I see a line with the jar: /usr/local/hadoop-0.18.4/hadoop-0.18.4-dev-core.jar which is the new one the "ant clean jar" command created. On Thu, Mar 19, 2009 at 2:00 PM, Brian Bockelman wrote: > > On Mar 19, 2009, at 6:56 AM, Tamir Kamara wrote: > > H

Re: Monitoring with Ganglia

2009-03-19 Thread Brian Bockelman
On Mar 19, 2009, at 6:56 AM, Tamir Kamara wrote: Hi Brian, I see GangliaContext31.class in the jar and GangliaContext31.java in the src folder. By the way, I only used the last version of each patch. Should I apply the different files per patch from the earliest to the latest ? Nope.

Re: Monitoring with Ganglia

2009-03-19 Thread Tamir Kamara
Hi Brian, I see GangliaContext31.class in the jar and GangliaContext31.java in the src folder. By the way, I only used the last version of each patch. Should I apply the different files per patch from the earliest to the latest ? Thanks, Tamir On Thu, Mar 19, 2009 at 1:38 PM, Brian Bockelman wr

Re: Monitoring with Ganglia

2009-03-19 Thread Brian Bockelman
Hey Tamir, Can you see the file GangliaContext31.java in your jar? In the source directory? Brian On Mar 19, 2009, at 2:33 AM, Tamir Kamara wrote: Hi, All my testing were fine with Ganglia 3.0, I used HADOOP-3422 patch to fix the metric names provided by hadoop and it worked. Because

Re: Monitoring with Ganglia

2009-03-19 Thread Tamir Kamara
Hi, All my testing were fine with Ganglia 3.0, I used HADOOP-3422 patch to fix the metric names provided by hadoop and it worked. Because I had to recompile hadoop (base 0.18.3) I also used Hadoop-4675 in order to use the latest Ganglia (3.1). After changing the metrics file to report with the Gan

Re: Monitoring with Ganglia

2009-03-17 Thread jason hadoop
Make all of your hadoop-metrics properties use the standard IP address of your master node. Then add a straight udp receive block to the gmond.conf of your master node. Then point your gmetad.conf at your master node. There are complete details in forthcoming book, and with this in it, should be a

Re: Monitoring with Ganglia

2009-03-17 Thread Tamir Kamara
I sent my gmond.conf in my previous email... and the address is like carlos wrote. I'll change the hadoop-metrics file and check again. However, I would prefer to use a method I'm more familiar with - like unicast tcp communication. Do you know what I need to change in ganglia and / or hadoop to u

Re: Monitoring with Ganglia

2009-03-17 Thread Brian Bockelman
On Mar 17, 2009, at 10:08 AM, Carlos Valiente wrote: On Tue, Mar 17, 2009 at 14:56, Tamir Kamara wrote: I don't know too much about multicast... and I'm using the default gmond conf file. The default multicast address seems to be 239.2.11.71, so that's the one for your hadoop-metrics.pro

Re: Monitoring with Ganglia

2009-03-17 Thread Carlos Valiente
On Tue, Mar 17, 2009 at 14:56, Tamir Kamara wrote: > I don't know too much about multicast... and I'm using the default gmond > conf file. The default multicast address seems to be 239.2.11.71, so that's the one for your hadoop-metrics.properties. > Wouldn't using the multicast address mean I'll

Re: Monitoring with Ganglia

2009-03-17 Thread Tamir Kamara
file metrics work fine. I don't know too much about multicast... and I'm using the default gmond conf file. Wouldn't using the multicast address mean I'll need to specify a different address for each node so that the data won't get to all nodes running gmond ? On Tue, Mar 17, 2009 at 4:46 PM, Br

Re: Monitoring with Ganglia

2009-03-17 Thread Brian Bockelman
Yup, that's the next question: what's your recv channel in gmond.conf on that node? You can just send along the whole gmond.conf if you're not sure. If you set the metrics to be logged to a file, do they appear there? I.e., have you verified the metrics are working at all for the node?

Re: Monitoring with Ganglia

2009-03-17 Thread Carlos Valiente
On Tue, Mar 17, 2009 at 14:06, Tamir Kamara wrote: > My hadoop-metrics looks like this: > > dfs.class=org.apache.hadoop.metrics.ganglia.GangliaContext > dfs.period=10 > dfs.servers=localhost:8649 > > mapred.class=org.apache.hadoop.metrics.ganglia.GangliaContext > mapred.period=10 > mapred.servers=

Re: Monitoring with Ganglia

2009-03-17 Thread Tamir Kamara
Hi, I found the link after hours of going through Google search results... My hadoop-metrics looks like this: dfs.class=org.apache.hadoop.metrics.ganglia.GangliaContext dfs.period=10 dfs.servers=localhost:8649 mapred.class=org.apache.hadoop.metrics.ganglia.GangliaContext mapred.period=10 mapred

Re: Monitoring with Ganglia

2009-03-17 Thread Brian Bockelman
Hey Tamir, I assume you want something like this: http://rcf.unl.edu/ganglia/?c=red-workers&h=node155&m=load_one&r=hour&s=descending&hc=4 (That link's old - where'd you find it? I'll update it...) Can you send out the relevant lines from the hadoop-metrics file? Also, can you do the followin