Alan Robertson wrote:
> Terry L. Inzauro wrote:
>> Andrew Beekhof wrote:
>>> On 4/11/07, Terry L. Inzauro <[EMAIL PROTECTED]> wrote:
>>>> list,
>>>>
>>>> this is a continuation of another thread that was started a few weeks
>>>> back.  the original thread was
>>>> started in regards
>>>> to the setup of pingd. this thread is in regards to pingd not being
>>>> able to start for whatever
>>>> reason and i suspect my resource
>>>> groups are not starting as a result ;(
>>>>
>>>> a little background:
>>>>
>>>> - two resource groups are defined. i want to split the two resource
>>>> groups between nodes when both
>>>> nodes are online. if both
>>>> nodes are not online, then obviously, fail the resource resource group
>>>> to the other available node.
>>>> - pingd configuration was previously verified correct by Alan R.
>>>> - crm_verify passes
>>>> - BasicSanityCheck 'does not pass' (fails on pingd checks)
>>> pingd isn't failing...
>>>
>>> Apr 11 12:44:07 roxetta CTS: BadNews: heartbeat[13770]:
>>> 2007/04/11_12:44:05 ERROR: glib: Error sending packet: Operation not
>>> permitted
>>> Apr 11 12:44:07 roxetta CTS: BadNews: heartbeat[13770]:
>>> 2007/04/11_12:44:05 ERROR: write failure on ping 127.0.0.1.: Operation
>>> not permitted
>>>
>>> these messages are from the heartbeat communications layer - and if
>>> thats not working, then pingd has no hope at all.
>>>
>>> i have no idea why pinging localhost should fail - firewall?
>>>
>>>> - without pingd, the resource groups function as expected
>>>> - heartbeat has been restarted
>>>> - heartbeat hangs on stopping so i do the following ;)
>>>>         for i in `ps -ef | grep heart  | awk '{print $2}'`; do kill
>>>> $i; done
>>>>
>>>> i have noticed log entries in the log file that are obviously related
>>>> to pingd.  this however 'may'
>>>> not be the case.
>>>> would anyone be interested in lending a hand?
>>>>
>>>> heartbeat version = 2.0.8-r2
>>>> OS = gentoo 2006.1
>>>> kernel = 2.6.18 (i have tested both hardened<with grsecurity and pax>
>>>> as well as generic)
>>>>
>>>>
>>>>
>>>> cibadmin -Q output , ptest output, BasicSanityCheck output and
>>>> messages file are all attached as a
>>>> .tar.bz2.
>>>>
>>>>
>>>> believe me when i tell you that i am stumped. any assistance is
>>>> greatly appreciated.
>>>>
>>>>
>>>>
>>>> _Terry
>>>>
>>>>
>>>>
>>>>
>>>> _______________________________________________
>>>> Linux-HA mailing list
>>>> [EMAIL PROTECTED]
>>>> http://lists.linux-ha.org/mailman/listinfo/linux-ha
>>>> See also: http://linux-ha.org/ReportingProblems
>>>>
>>>>
>> no firewall. i tested with and without iptables. in fact i even unloaded ALL 
>> iptables modules just
>> to be certain.  so then i thought to myself.  pax? perhaps grsecurity? no 
>> luck there either.  i
>> rebuild a kernel without all of the grsec and pax hooks.  no luck.
>>
>>
>>
>> destiny crm # lsmod
>> Module                  Size  Used by
>> softdog                 4752  0
>> tun                     9184  0
>> e100                   28360  0
>> sym53c8xx              64820  0
>> eepro100               25552  0
>> scsi_transport_spi     18752  1 sym53c8xx
>>
>> destiny crm # ping 127.0.0.1
>> PING 127.0.0.1 (127.0.0.1) 56(84) bytes of data.
>> 64 bytes from 127.0.0.1: icmp_seq=1 ttl=64 time=0.097 ms
>> 64 bytes from 127.0.0.1: icmp_seq=2 ttl=64 time=0.054 ms
>>
>> --- 127.0.0.1 ping statistics ---
>> 2 packets transmitted, 2 received, 0% packet loss, time 1002ms
>> rtt min/avg/max/mdev = 0.054/0.075/0.097/0.023 ms
>>
>>
>> so i re-ran BasicSAanityCheck....same result.   any ideas?
> 
> Here is something to run and check...
> 
> ifconfig lo;ip addr show lo; route;ip route show
> 
> Here's what it produces on my machine:
> lo        Link encap:Local Loopback
>           inet addr:127.0.0.1  Mask:255.0.0.0
>           inet6 addr: ::1/128 Scope:Host
>           UP LOOPBACK RUNNING  MTU:16436  Metric:1
>           RX packets:520006 errors:0 dropped:0 overruns:0 frame:0
>           TX packets:520006 errors:0 dropped:0 overruns:0 carrier:0
>           collisions:0 txqueuelen:0
>           RX bytes:190990507 (182.1 Mb)  TX bytes:190990507 (182.1 Mb)
> 
> 1: lo: <LOOPBACK,UP> mtu 16436 qdisc noqueue
>     link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
>     inet 127.0.0.1/8 scope host lo
>     inet6 ::1/128 scope host
>        valid_lft forever preferred_lft forever
> Kernel IP routing table
> Destination     Gateway         Genmask         Flags Metric Ref    Use
> Iface
> 10.10.10.0      *               255.255.255.0   U     0      0        0 eth1
> link-local      *               255.255.0.0     U     0      0        0 eth1
> loopback        *               255.0.0.0       U     0      0        0 lo
> default         gw              0.0.0.0         UG    0      0        0 eth1
> 10.10.10.0/24 dev eth1  proto kernel  scope link  src 10.10.10.5
> 169.254.0.0/16 dev eth1  scope link
> 127.0.0.0/8 dev lo  scope link
> default via 10.10.10.254 dev eth1
> 
> 
> I don't know what I'm looking for to be different, but it's at least
> somewhere to start...
> 
> 

i know. there's not much to go on here....and i'm not a coder, but perhaps 
someone can look in glib
as to why/when that error function would actually gets called?



destiny crm # ip addr show dev lo
2: lo: <LOOPBACK,UP,10000> mtu 16436 qdisc noqueue
    link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
    inet 127.0.0.1/8 brd 127.255.255.255 scope host lo


destiny crm # ip route show
10.254.254.0/30 dev eth1  proto kernel  scope link  src 10.254.254.2
x.x.x.x/28 dev eth0  proto kernel  scope link  src 63.147.188.174
10.0.0.0/24 dev eth2  proto kernel  scope link  src 10.0.0.2
127.0.0.0/8 dev lo  scope link
default via x.x.x.x dev eth0


destiny crm # ifconfig lo
lo        Link encap:Local Loopback
          inet addr:127.0.0.1  Mask:255.0.0.0
          UP LOOPBACK RUNNING  MTU:16436  Metric:1
          RX packets:37780 errors:0 dropped:0 overruns:0 frame:0
          TX packets:37780 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:0
          RX bytes:5114592 (4.8 Mb)  TX bytes:5114592 (4.8 Mb)

_______________________________________________
Linux-HA mailing list
[EMAIL PROTECTED]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to