Hi,

On Mon, Feb 09, 2009 at 09:25:51AM +0100, Peter Luciak wrote:
> Hello all,
>
> I have a 2 node cluster, interconnected via crossed-cable and serial cable. 
> I noticed that after some time, the serial link went down, with this 
> message in the kernel log:
>
> kernel: serial8250: too much work for irq4

First time I hear about this. A kernel problem?

> This is on CentOS 5.2 x86_64 with 2.6.18-92.1.13.el5 and 
> heartbeat-2.1.4-2.1. Has anyone encountered a similar problem? It's a 
> Fujitsu Siemens RX300 server. More importantly, is it possible to 
> reestablish the serial link without takeover/heartbeat restart? (I think if 
> I kill the ttyS0 hbread and hbwrite processes the other node will take 
> over).

No, it won't. Heartbeat can recover from this error.
But also because you have another communication link. 

Thanks,

Dejan

>
> Thanks very much for answers,
> Peter
>
> Here is /proc/interrupts output:
>
> # cat /proc/interrupts
>            CPU0       CPU1       CPU2       CPU3
>   0: 1532684436          0          0          0    IO-APIC-edge  timer
>   1:        904          0       2988          0    IO-APIC-edge  i8042
>   4:       7490          0   60869910          0    IO-APIC-edge  serial
>   8:          1          0          0          0    IO-APIC-edge  rtc
>   9:          0          0          0          0   IO-APIC-level  acpi
>  12:       3957          0      24955          0    IO-APIC-edge  i8042
>  98:     152335          0    9145347          0   IO-APIC-level 
> ehci_hcd:usb1, uhci_hcd:usb2
> 106:          0          0          0          0   IO-APIC-level 
> uhci_hcd:usb3
> 114:          0          0          0          0   IO-APIC-level 
> uhci_hcd:usb4
> 122:         22          0          0          0   IO-APIC-level 
> uhci_hcd:usb5
> 130:       5261          0   11382292          0   IO-APIC-level  ahci
> 138: 1124609169          0          0          0         PCI-MSI  eth0
> 146:   41138037          0          0          0         PCI-MSI  eth1
> 169:      15451       3060   15725460     291463   IO-APIC-level  megasas
> NMI:      69963      49649      41985      39739
> LOC: 1532683254 1532683175 1532683085 1532683006
> ERR:          0
> MIS:          0
>
> -- 
> Peter LUCIAK ([email protected])
> IBL Software Engineering, http://www.iblsoft.com/
> Mierov?? 103, 82105 Bratislava, Slovakia
> Phone: +421-2-32662111, Fax: +421-2-32662110
> Direct: +421-2-32662175
> _______________________________________________
> Linux-HA mailing list
> [email protected]
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to