Re: [Linux-HA] Re: About time-out of STONITH.

2008-09-09 Thread HIDEO YAMAUCHI
Hi Dejan, > I see the point of using setpgid(2). Thanks for the patch. I'll > give it a more thorough review next week along with the patch I > already have for this problem. All right. The last patch entrusts you. Please send a patch if I help you. Thank you. Hideo Yamauchi. --- Dejan Muhame

Re: [Linux-HA] Re: About time-out of STONITH.

2008-09-09 Thread Dejan Muhamedagic
Hi Hideo-san, On Tue, Sep 09, 2008 at 04:35:43PM +0900, HIDEO YAMAUCHI wrote: > Hi, > > I confirmed even 2.1.4 versions about this problem. > However, the problem was not solved in the new version either. > > We want to solve this problem early by all means for the future. > > I made the patch

Re: [Linux-HA] Still not working ...

2008-09-09 Thread Dejan Muhamedagic
On Mon, Sep 08, 2008 at 02:54:08PM +0200, Franck Huet wrote: > Dejan Muhamedagic a ?crit : >> On Fri, Sep 05, 2008 at 11:42:34AM +0200, Franck Huet wrote: >> >>> Dejan Muhamedagic a ?crit : >>> Hi, On Thu, Sep 04, 2008 at 09:48:14AM +0200, Franck Huet wrote: >>>

Re: [Linux-HA] Re: Pacemaker 0.7 and pingd

2008-09-09 Thread Serge Dubrouski
Please ignore previous log. THE right one is attached here. On Tue, Sep 9, 2008 at 2:30 PM, Serge Dubrouski <[EMAIL PROTECTED]> wrote: > Attached. > > On Tue, Sep 9, 2008 at 1:51 PM, Andrew Beekhof <[EMAIL PROTECTED]> wrote: >> can you attach some more logs please? >> >> On Tue, Sep 9, 2008 at 19:

Re: [Linux-HA] Re: Pacemaker 0.7 and pingd

2008-09-09 Thread Serge Dubrouski
Attached. On Tue, Sep 9, 2008 at 1:51 PM, Andrew Beekhof <[EMAIL PROTECTED]> wrote: > can you attach some more logs please? > > On Tue, Sep 9, 2008 at 19:00, Serge Dubrouski <[EMAIL PROTECTED]> wrote: >> On Tue, Sep 9, 2008 at 10:21 AM, Serge Dubrouski <[EMAIL PROTECTED]> wrote: >>> I have followi

Re: [Linux-HA] Re: Pacemaker 0.7 and pingd

2008-09-09 Thread Andrew Beekhof
can you attach some more logs please? On Tue, Sep 9, 2008 at 19:00, Serge Dubrouski <[EMAIL PROTECTED]> wrote: > On Tue, Sep 9, 2008 at 10:21 AM, Serge Dubrouski <[EMAIL PROTECTED]> wrote: >> I have following configuration in Pacemaker 0.65 + Heartbeat 2.1.3: >> >> in ha.cf >> >> respawn root /usr

[Linux-HA] Re: Pacemaker 0.7 and pingd

2008-09-09 Thread Serge Dubrouski
On Tue, Sep 9, 2008 at 10:21 AM, Serge Dubrouski <[EMAIL PROTECTED]> wrote: > I have following configuration in Pacemaker 0.65 + Heartbeat 2.1.3: > > in ha.cf > > respawn root /usr/lib/heartbeat/pingd -m 100 -d 1s > ping 192.168.2.1 > > In cib.xml > > > > operation="not_defi

[Linux-HA] Pacemaker 0.7 and pingd

2008-09-09 Thread Serge Dubrouski
I have following configuration in Pacemaker 0.65 + Heartbeat 2.1.3: in ha.cf respawn root /usr/lib/heartbeat/pingd -m 100 -d 1s ping 192.168.2.1 In cib.xml That worked all right and allowed to switch services between nodes in case of networ

Re: [Linux-HA] NFS locks

2008-09-09 Thread Xinwei Hu
You need to bind your /var/lib/nfs/sm directory to somewhere shared across all nodes (i.e. on cluster file system), and sm-notify them on starting nfs server on the new node. In that case, nfs clients are notified to use the new node when switching happens. 2008/9/9 Geoff Dunk <[EMAIL PROTECTED]>

Re: [Linux-HA] Watchdog configuration with SBD

2008-09-09 Thread NAKAHIRA Kazutomo
Thank you again for useful information. I modified heartbeat init script in my test environment referring to your script and it works fine. (Please see attached script.) Best Regards, NAKAHIRA Kazutomo Lars Marowsky-Bree wrote: > On 2008-09-09T15:28:31, NAKAHIRA Kazutomo <[EMAIL PROTECTED]> wrot

RE: [Linux-HA] NFS locks

2008-09-09 Thread Geoff Dunk
Nils I use the following command which resides within /etc/fstab HPC:/gpfs/home/gpfs/home nfs defaults 0 0 I do get exactly that message about stale NFS mounts. I am not sure quite what you mean about major and minor numbers of devices? Regards Geoff Dunk HighPerformance Computing

Re: [Linux-HA] the maximum message size which bcast can handle

2008-09-09 Thread Andrew Beekhof
2008/9/9 Junko IKEDA <[EMAIL PROTECTED]>: >> > net/ipv4/udp.c(2.6.18-92.el5) >> > >> > 495 int udp_sendmsg(struct kiocb *iocb, struct sock *sk, struct \ >> > msghdr *msg, >> > 496 size_t len) >> > 497 { >> > >> > 511 if (len > 0x) >> > 512 return -EMSGSIZ

RE: [Linux-HA] the maximum message size which bcast can handle

2008-09-09 Thread Junko IKEDA
> > net/ipv4/udp.c(2.6.18-92.el5) > > > >   495 int udp_sendmsg(struct kiocb *iocb, struct sock *sk, struct \ > > msghdr *msg, > >   496 size_t len) > >   497 { > > > >   511 if (len > 0x) > >   512 return -EMSGSIZE; > > > > > > in line 511, the limit

Re: [Linux-HA] Re: About time-out of STONITH.

2008-09-09 Thread Lars Marowsky-Bree
On 2008-09-09T16:35:43, HIDEO YAMAUCHI <[EMAIL PROTECTED]> wrote: Hi, let's move this discussion to the dev list. diff -r 928aeea6ee26 stonithd.c --- a/stonithd.cTue Sep 09 16:04:47 2008 +0900 +++ b/stonithd.cTue Sep 09 16:05:13 2008 +0900 @@ -3212,6 +3214,7 @@ stonithRA_stop( st

RE: [Linux-HA] NFS locks

2008-09-09 Thread Hildebrand, Nils, 232
Hi, what mount command do you use on your clients? Do you get error-meesages about stale NFS mounts on your clients? Are major and minor number of the device where your NFS-filesystem resides the same on all your cluster-nodes? Kind regards, Nils > -Original Message- > From: [EMAIL

Re: [Linux-HA] the maximum message size which bcast can handle

2008-09-09 Thread Lars Marowsky-Bree
On 2008-09-09T10:46:01, Junko IKEDA <[EMAIL PROTECTED]> wrote: > net/ipv4/udp.c(2.6.18-92.el5) > >   495 int udp_sendmsg(struct kiocb *iocb, struct sock *sk, struct \ > msghdr *msg, >   496 size_t len) >   497 { > >   511 if (len > 0x) >   512 ret

Re: [Linux-HA] Watchdog configuration with SBD

2008-09-09 Thread Lars Marowsky-Bree
On 2008-09-09T15:28:31, NAKAHIRA Kazutomo <[EMAIL PROTECTED]> wrote: > Hi, Lars > > Thank you for your advice. > > I decided to use the sbd command with "-W" option to enable watchdog. > It operates well when starting from the command line. > > But, I encountered other problem when sbd watch proce

[Linux-HA] Re: About time-out of STONITH.

2008-09-09 Thread HIDEO YAMAUCHI
Hi, I confirmed even 2.1.4 versions about this problem. However, the problem was not solved in the new version either. We want to solve this problem early by all means for the future. I made the patch which used a setpgid function for. This patch works without a problem on a simple test. How