May 23, marino vetuschi zuccolini wrote forth saying...

> Hello to all
>
> I've a dual frontend with two eth cards. The internal net (eth0) is
> 10.0.0.* and spans from 1 (the frontend)  to 6 (5 dual slaves): the
> nodes are called baxeico** (from 00 to 05). Gmond runs on all the
> nodes as well ganglia-php-rrd . Gexec and authd run well.
> 
> Gstat lists only the front end (!! with the name of the eth1 net,
> which is on the Real World!!). This happens also on my web page of
> ganglia.
> 
> Before a crash of the frontend due to a kernel bug, maybe during MPI
> comm between nodes, all the nodes were listed as baxeico00-baxeico05.
> I've also added a route add -host 239.2.11.71 eth0 and the tcpdump
> listing is clear.
> 
> What is broken when there is a sudden death of the frontend that
> doesn't restart during the next boot?

running "route add -host 239.2.11.71 eth0" doesn't last between reboots.  
you need to add it to an init script to be run at boottime.

-matt


Reply via email to