Bug#425434: Heartbeat-2 taking up all resources
On the last few days I managed to reinstall a new debian stable (etch) on my little nslu2, and I gave heartbeat another try. On backports.org i found heartbeat-2.1.2-1~bpo40+2, i installed it but no way to let it working. I tried with overcommit (/proc/sys/vm/overcommit_memory 0 and 1), no other processes running, heartbeat only configured to cluster a dhcp server. Unlike the previous attempts, this time I had no LVM and no reiserfs, only an ext3+swap partitioning. No luck with heartbeat 'till now, it shows the same behaviour, it goes out of memory even if there's plenty of free swap. -- Federico Belvisi -- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]
Bug#425434: #425434: Heartbeat-2 taking up all resources
Horms ha scritto: have you had a chance to see if this problem still manifests in 2.1.2-1? I believe that is has been built for arm now. Sorry for my ultra late reply, I had some other works to do in the last months. I remember i tried when the 2.1.2-1 was built for arm, but i am running stable and i obviously couldn't install as-is on stable branch. I downloaded the source deb package and tried to build for stable, but i wasn't able to accomplish this: it loops forever, it seems looping. It does configure, make and at a certain point it restart configure and so on. I guess I have to upgrade to testing and try this version, unfortunately atm i have little time to accomplish this. Maybe in the next week or two i can give it a try. I'm really sorry. -- Federico Belvisi -- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]
Bug#425434: #425434: Heartbeat-2 taking up all resources
I can't see the new 2.1.2-1 on packages.debian.org for arm architecture: on download page, it only shows 2.1.1-1 even on source package, so i can't compile it from source for my architecture. I read on internet that other people affected with a similar problem with other packages used echo 1 /proc/sys/vm/overcommit_memory I tried this and I solved some problems I had with other packages, but nothing changed on heartbeat 2.x. I'll give a try when 2.1.2-1 will hit packages.debian.org. Thank you for your interest in this bug report. -- Federico Belvisi -- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]
Bug#425434: [Linux-ha-dev] Re: Bug#425434: Heartbeat-2 taking up all resources
Simon Horman ha scritto: Or if you want the bleeding-edge dev tree http://hg.linux-ha.org/dev/archive/tip.tar.bz2 I tried to compile this but it has some problem. I think something in debianization doens't work, since it loops compiling. I discovered that after 10h of compiling... I'm not so skillful on such things. -- Federico Belvisi -- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]
Bug#425434: [Linux-ha-dev] Re: Bug#425434: Heartbeat-2 taking up all resources
Simon Horman ha scritto: On Tue, May 22, 2007 at 02:46:23PM +0200, Federico Belvisi wrote: On this page i only found .deb packages for i386 architecture, but i'm using this on an arm architecture. I found some source .rpm for opensuse but porting it to debian arm is a little too difficult to me. I'll give a try anyway in a spare time. I'm not sure what page you are looking at, but the packages in debian are reasonably up to date (though there seems to be some hold up on arm :( ). http://packages.qa.debian.org/h/heartbeat.html Alternatively, it shouln't be to difficult to roll your own .deb packages from the source provided (there is source on the page, right?) The pages i was looking at was the one told by Andrew Beekhof, specifically this: http://software.opensuse.org/download/server:/ha-clustering/ that has a 2.0.9 version but only for i386, not arm. He said that 2.0.8 has some memory leak among other problems. -- Federico Belvisi -- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]
Bug#425434: [Linux-ha-dev] Re: Bug#425434: Heartbeat-2 taking up all resources
Simon Horman ha scritto: I spoke breifly with Andrew, and he says that you can get the source tar ball from http://hg.linux-ha.org/dev/archive/SLE10-SP1.tar.bz2 I tried this link, I had a nice adventure compiling this package but eventually i ended up with a correct .deb. Anyway it doesn't work, it is like the official Debian packages, it eat up the cpu and shows the same behaviour as before, so no good. :( Tomorrow i'll try the bleeding edge package and i'll let you know. -- Federico Belvisi -- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]
Bug#425434: Heartbeat-2 taking up all resources
Simon Horman ha scritto: On Mon, May 21, 2007 at 06:39:47PM +0200, Federico Belvisi wrote: Package: heartbeat-2 Version: 2.0.7-2 I'm not sure what the minimal cpu and memory requirements for heartbeat-2 are, but perhaps someone the linux-ha-dev list has some idea, I have Cced them. In the mean time, could you supply some information on ammount of CPU and memory your machine has. I cced this mail to bugzilla (hoping this is correct, if not let me know). This is the cpu from /proc/cpuinfo: CJ7:~# cat /proc/cpuinfo Processor : XScale-IXP42x Family rev 1 (v5l) BogoMIPS: 266.24 Features: swp half fastmult edsp CPU implementer : 0x69 CPU architecture: 5TE CPU variant : 0x0 CPU part: 0x41f CPU revision: 1 Cache type : undefined 5 Cache clean : undefined 5 Cache lockdown : undefined 5 Cache format: Harvard I size : 32768 I assoc : 32 I line length : 32 I sets : 32 D size : 32768 D assoc : 32 D line length : 32 D sets : 32 Hardware: Linksys NSLU2 Revision: Serial : and this is /proc/meminfo: CJ7:~# cat /proc/meminfo MemTotal:29988 kB MemFree: 2164 kB Buffers: 392 kB Cached: 8008 kB SwapCached: 5204 kB Active: 11808 kB Inactive: 6884 kB HighTotal: 0 kB HighFree:0 kB LowTotal:29988 kB LowFree: 2164 kB SwapTotal: 497972 kB SwapFree: 471008 kB Dirty: 180 kB Writeback: 0 kB AnonPages: 10084 kB Mapped: 5896 kB Slab: 3692 kB PageTables: 1008 kB NFS_Unstable:0 kB Bounce: 0 kB CommitLimit:512964 kB Committed_AS: 135880 kB VmallocTotal: 991232 kB VmallocUsed: 17104 kB VmallocChunk: 966652 kB As you see it has about 32Mb ram + 500mb swap. At the moment there are some daemons running, like ssh, apache, mysql, and heartbeat v1 and virtual memory is moderately empty. No particular task is done, just a white running with services up with default configurations (i have yet to configure them). A typical top header is here: top - 08:29:12 up 8 min, 1 user, load average: 0.05, 0.54, 0.39 Tasks: 81 total, 1 running, 80 sleeping, 0 stopped, 0 zombie Cpu(s): 1.0%us, 1.0%sy, 0.0%ni, 98.0%id, 0.0%wa, 0.0%hi, 0.0%si, 0.0%st Mem: 29988k total,28880k used, 1108k free, 548k buffers Swap: 497972k total,27256k used, 470716k free, 8516k cached I found something interesting in /var/log/messages: i copy a little of it here because i found something strange. After starting heartbeat-2 there are kernel messages complaining on memory. There are no log like that when i started heartbeat v1. By converse, this happens every time i started heartbeat-2. Yesterday i tried several times and every time there are kernel messages like that: May 21 16:13:14 CJ7 heartbeat: [5025]: WARN: Core dumps could be lost if multiple dumps occur May 21 16:13:14 CJ7 heartbeat: [5025]: WARN: Consider setting /proc/sys/kernel/core_uses_pid (or equivalent) to 1 for maximum supportability May 21 16:13:14 CJ7 heartbeat: [5025]: WARN: Deprecated 'legacy' auto_failback option selected. May 21 16:13:14 CJ7 heartbeat: [5025]: WARN: Please convert to 'auto_failback on'. May 21 16:13:14 CJ7 heartbeat: [5025]: WARN: See documentation for conversion details. May 21 16:13:14 CJ7 heartbeat: [5025]: WARN: Logging daemon is disabled --enabling logging daemon is recommended May 21 16:13:14 CJ7 heartbeat: [5025]: info: ** May 21 16:13:14 CJ7 heartbeat: [5025]: info: Configuration validated. Starting heartbeat 2.0.7 May 21 16:13:15 CJ7 heartbeat: [5026]: info: heartbeat: version 2.0.7 May 21 16:13:35 CJ7 heartbeat: [5026]: info: Heartbeat generation: 12 May 21 16:13:35 CJ7 heartbeat: [5026]: info: No uuid found for current node - generating a new uuid. May 21 16:13:36 CJ7 heartbeat: [5026]: info: G_main_add_TriggerHandler: Added signal manual handler May 21 16:13:36 CJ7 heartbeat: [5026]: info: G_main_add_TriggerHandler: Added signal manual handler May 21 16:13:36 CJ7 heartbeat: [5026]: info: Removing /var/run/heartbeat/rsctmp failed, recreating. May 21 16:13:36 CJ7 heartbeat: [5026]: info: glib: ucast: write socket priority set to IPTOS_LOWDELAY on eth0 May 21 16:13:36 CJ7 heartbeat: [5026]: info: glib: ucast: bound send socket to device: eth0 May 21 16:13:36 CJ7 heartbeat: [5026]: info: glib: ucast: bound receive socket to device: eth0 May 21 16:13:36 CJ7 heartbeat: [5026]: info: glib: ucast: started on port 694 interface eth0 to 192.168.0.26 May 21 16:13:36 CJ7 heartbeat: [5026]: info: G_main_add_SignalHandler: Added signal handler for signal 17 May 21 16:13:48 CJ7 heartbeat: [5026]: info: Local status now set to: 'up' May 21 16:14:09 CJ7 kernel: oom-killer: gfp_mask=0x201d2, order=0 May 21 16:14:09 CJ7 kernel: Mem-info: May 21 16:14:09 CJ7 kernel
Bug#425434: [Linux-ha-dev] Re: Bug#425434: Heartbeat-2 taking up all resources
On this page i only found .deb packages for i386 architecture, but i'm using this on an arm architecture. I found some source .rpm for opensuse but porting it to debian arm is a little too difficult to me. I'll give a try anyway in a spare time. -- Federico Belvisi -- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]
Bug#425434: Heartbeat-2 taking up all resources
Package: heartbeat-2 Version: 2.0.7-2 Architecture: arm I'm using a Linksys NSLU2 with Debian Etch stable. I tried to set up a simple failover between nslu2 and a Gentoo server (i386). The only thing i want to failover is dhcp3-server (no ip failover, only dhcp3-server). I installed the stable heartbeat on gentoo and the stable branch heartbeat on Debian. Gentoo is primary, Debian/NSLU2 is backup. I configured all the things up but soon i noticed that there were some little problems: gentoo complained that the heartbeat versions were different, although some failover was working. The problem was with auto_failback not working. I soon realized that in a newbie fashion i installed heartbeat instead of heartbeat-2 on Debian. I removed the old heartbeat, installed heartbeat-2 and as soon as installation was at the end, my little nslu2 was slowing down. After 1 minute i realized that the only thing was to shutdown the nslu2. I mounted the nslu2 hard disk on pc and performed regular maintenance and deleted symlinks in order not to let heartbeat-2 start at boot. I double checked configuration files and they were the same as they were when the old heartbeat (branch 1.) was installed, and that they are the same as on Gentoo server. I booted the nslu2, and all was fine. Load averages were between 0.50 and 1. I then decided to start heartbeat-2 so i typed /etc/init.d/heartbeat start. System started to slow down, being less responsive. I typed top just to see what was process activity. Heartbeat was using many (maybe 43%) of memory, and load averages were steeply raising: in about 2 minutes load average was some like 15. I waited some minutes more but the shell wasn't updating (even if ssh session was still up), so i decided to shutdown again the nslu2. I retried upgrading heartbeat-2 from testing repository (version 2.0.8-1), but things were the same as above. I think heartbeat-2 on arm is doing something bad, by taking up all resources (cpu, memory). This don't happen with package heartbeat. -- Federico Belvisi -- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]