Bug#425434: Heartbeat-2 taking up all resources

2008-01-21 Thread Federico Belvisi
On the last few days I managed to reinstall a new debian stable (etch) on my
little nslu2, and I gave heartbeat another try. On backports.org i found
heartbeat-2.1.2-1~bpo40+2, i installed it but no way to let it working. I tried
with overcommit (/proc/sys/vm/overcommit_memory 0 and 1), no other processes
running, heartbeat only configured to cluster a dhcp server. Unlike the previous
attempts, this time I had no LVM and no reiserfs, only an ext3+swap 
partitioning.

No luck with heartbeat 'till now, it shows the same behaviour, it goes out of
memory even if there's plenty of free swap.


-- 
Federico Belvisi



-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Bug#425434: #425434: Heartbeat-2 taking up all resources

2007-09-28 Thread Federico Belvisi
Horms ha scritto:
 have you had a chance to see if this problem still manifests in 2.1.2-1?
 I believe that is has been built for arm now.

Sorry for my ultra late reply, I had some other works to do in the last months.
I remember i tried when the 2.1.2-1 was built for arm, but i am running stable
and i obviously couldn't install as-is on stable branch. I downloaded the source
deb package and tried to build for stable, but i wasn't able to accomplish this:
it loops forever, it seems looping. It does configure, make and at a certain
point it restart configure and so on.

I guess I have to upgrade to testing and try this version, unfortunately atm i
have little time to accomplish this. Maybe in the next week or two i can give it
a try.

I'm really sorry.

-- 
Federico Belvisi



-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Bug#425434: #425434: Heartbeat-2 taking up all resources

2007-08-01 Thread Federico Belvisi
I can't see the new 2.1.2-1 on packages.debian.org for arm architecture: on
download page, it only shows 2.1.1-1 even on source package, so i can't compile
it from source for my architecture.

I read on internet that other people affected with a similar problem with other
packages used

echo 1  /proc/sys/vm/overcommit_memory

I tried this and I solved some problems I had with other packages, but nothing
changed on heartbeat 2.x.

I'll give a try when 2.1.2-1 will hit packages.debian.org.
Thank you for your interest in this bug report.

--
Federico Belvisi


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Bug#425434: [Linux-ha-dev] Re: Bug#425434: Heartbeat-2 taking up all resources

2007-05-26 Thread Federico Belvisi
Simon Horman ha scritto:
  Or if you want the bleeding-edge dev tree
 http://hg.linux-ha.org/dev/archive/tip.tar.bz2
 

I tried to compile this but it has some problem. I think something in
debianization doens't work, since it loops compiling. I discovered that after
10h of compiling... I'm not so skillful on such things.


--
Federico Belvisi



-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Bug#425434: [Linux-ha-dev] Re: Bug#425434: Heartbeat-2 taking up all resources

2007-05-23 Thread Federico Belvisi
Simon Horman ha scritto:
 On Tue, May 22, 2007 at 02:46:23PM +0200, Federico Belvisi wrote:
 On this page i only found .deb packages for i386 architecture, but i'm using
 this on an arm architecture. I found some source .rpm for opensuse but 
 porting
 it to debian arm is a little too difficult to me.
 I'll give a try anyway in a spare time.
 
 I'm not sure what page you are looking at, but the packages in debian
 are reasonably up to date (though there seems to be some hold up 
 on arm :( ).
 
 http://packages.qa.debian.org/h/heartbeat.html
 
 Alternatively, it shouln't be to difficult to roll your own .deb
 packages from the source provided (there is source on the page, right?)
 

The pages i was looking at was the one told by Andrew Beekhof, specifically 
this:

http://software.opensuse.org/download/server:/ha-clustering/

that has a 2.0.9 version but only for i386, not arm. He said that 2.0.8 has some
memory leak among other problems.


--
Federico Belvisi



-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Bug#425434: [Linux-ha-dev] Re: Bug#425434: Heartbeat-2 taking up all resources

2007-05-23 Thread Federico Belvisi
Simon Horman ha scritto:

 
 I spoke breifly with Andrew, and he says that you can get
 the source tar ball from
 http://hg.linux-ha.org/dev/archive/SLE10-SP1.tar.bz2

I tried this link, I had a nice adventure compiling this package but eventually
i ended up with a correct .deb.

Anyway it doesn't work, it is like the official Debian packages, it eat up the
cpu and shows the same behaviour as before, so no good. :(

Tomorrow i'll try the bleeding edge package and i'll let you know.

--
Federico Belvisi




-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Bug#425434: Heartbeat-2 taking up all resources

2007-05-22 Thread Federico Belvisi
Simon Horman ha scritto:
 On Mon, May 21, 2007 at 06:39:47PM +0200, Federico Belvisi wrote:
 Package: heartbeat-2
 Version: 2.0.7-2
 
 I'm not sure what the minimal cpu and memory requirements for
 heartbeat-2 are, but perhaps someone the linux-ha-dev list has
 some idea, I have Cced them. In the mean time, could you
 supply some information on ammount of CPU and memory your
 machine has.
 

I cced this mail to bugzilla (hoping this is correct, if not let me know). This
is the cpu from /proc/cpuinfo:

CJ7:~# cat /proc/cpuinfo
Processor   : XScale-IXP42x Family rev 1 (v5l)
BogoMIPS: 266.24
Features: swp half fastmult edsp
CPU implementer : 0x69
CPU architecture: 5TE
CPU variant : 0x0
CPU part: 0x41f
CPU revision: 1
Cache type  : undefined 5
Cache clean : undefined 5
Cache lockdown  : undefined 5
Cache format: Harvard
I size  : 32768
I assoc : 32
I line length   : 32
I sets  : 32
D size  : 32768
D assoc : 32
D line length   : 32
D sets  : 32

Hardware: Linksys NSLU2
Revision: 
Serial  : 

and this is /proc/meminfo:
CJ7:~# cat /proc/meminfo
MemTotal:29988 kB
MemFree:  2164 kB
Buffers:   392 kB
Cached:   8008 kB
SwapCached:   5204 kB
Active:  11808 kB
Inactive: 6884 kB
HighTotal:   0 kB
HighFree:0 kB
LowTotal:29988 kB
LowFree:  2164 kB
SwapTotal:  497972 kB
SwapFree:   471008 kB
Dirty: 180 kB
Writeback:   0 kB
AnonPages:   10084 kB
Mapped:   5896 kB
Slab: 3692 kB
PageTables:   1008 kB
NFS_Unstable:0 kB
Bounce:  0 kB
CommitLimit:512964 kB
Committed_AS:   135880 kB
VmallocTotal:   991232 kB
VmallocUsed: 17104 kB
VmallocChunk:   966652 kB

As you see it has about 32Mb ram + 500mb swap. At the moment there are some
daemons running, like ssh, apache, mysql, and heartbeat v1 and virtual memory is
moderately empty. No particular task is done, just a white running with
services up with default configurations (i have yet to configure them). A
typical top header is here:

top - 08:29:12 up 8 min,  1 user,  load average: 0.05, 0.54, 0.39
Tasks:  81 total,   1 running,  80 sleeping,   0 stopped,   0 zombie
Cpu(s):  1.0%us,  1.0%sy,  0.0%ni, 98.0%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
Mem: 29988k total,28880k used, 1108k free,  548k buffers
Swap:   497972k total,27256k used,   470716k free, 8516k cached

I found something interesting in /var/log/messages: i copy a little of it here
because i found something strange. After starting heartbeat-2 there are kernel
messages complaining on memory. There are no log like that when i started
heartbeat v1. By converse, this happens every time i started heartbeat-2.
Yesterday i tried several times and every time there are kernel messages like 
that:

May 21 16:13:14 CJ7 heartbeat: [5025]: WARN: Core dumps could be lost if
multiple dumps occur
May 21 16:13:14 CJ7 heartbeat: [5025]: WARN: Consider setting
/proc/sys/kernel/core_uses_pid (or equivalent) to 1 for maximum supportability
May 21 16:13:14 CJ7 heartbeat: [5025]: WARN: Deprecated 'legacy' auto_failback
option selected.
May 21 16:13:14 CJ7 heartbeat: [5025]: WARN: Please convert to 'auto_failback 
on'.
May 21 16:13:14 CJ7 heartbeat: [5025]: WARN: See documentation for conversion
details.
May 21 16:13:14 CJ7 heartbeat: [5025]: WARN: Logging daemon is disabled
--enabling logging daemon is recommended
May 21 16:13:14 CJ7 heartbeat: [5025]: info: **
May 21 16:13:14 CJ7 heartbeat: [5025]: info: Configuration validated. Starting
heartbeat 2.0.7
May 21 16:13:15 CJ7 heartbeat: [5026]: info: heartbeat: version 2.0.7
May 21 16:13:35 CJ7 heartbeat: [5026]: info: Heartbeat generation: 12
May 21 16:13:35 CJ7 heartbeat: [5026]: info: No uuid found for current node -
generating a new uuid.
May 21 16:13:36 CJ7 heartbeat: [5026]: info: G_main_add_TriggerHandler: Added
signal manual handler
May 21 16:13:36 CJ7 heartbeat: [5026]: info: G_main_add_TriggerHandler: Added
signal manual handler
May 21 16:13:36 CJ7 heartbeat: [5026]: info: Removing /var/run/heartbeat/rsctmp
failed, recreating.
May 21 16:13:36 CJ7 heartbeat: [5026]: info: glib: ucast: write socket priority
set to IPTOS_LOWDELAY on eth0
May 21 16:13:36 CJ7 heartbeat: [5026]: info: glib: ucast: bound send socket to
device: eth0
May 21 16:13:36 CJ7 heartbeat: [5026]: info: glib: ucast: bound receive socket
to device: eth0
May 21 16:13:36 CJ7 heartbeat: [5026]: info: glib: ucast: started on port 694
interface eth0 to 192.168.0.26
May 21 16:13:36 CJ7 heartbeat: [5026]: info: G_main_add_SignalHandler: Added
signal handler for signal 17
May 21 16:13:48 CJ7 heartbeat: [5026]: info: Local status now set to: 'up'
May 21 16:14:09 CJ7 kernel: oom-killer: gfp_mask=0x201d2, order=0
May 21 16:14:09 CJ7 kernel: Mem-info:
May 21 16:14:09 CJ7 kernel

Bug#425434: [Linux-ha-dev] Re: Bug#425434: Heartbeat-2 taking up all resources

2007-05-22 Thread Federico Belvisi
On this page i only found .deb packages for i386 architecture, but i'm using
this on an arm architecture. I found some source .rpm for opensuse but porting
it to debian arm is a little too difficult to me.
I'll give a try anyway in a spare time.

--
Federico Belvisi


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Bug#425434: Heartbeat-2 taking up all resources

2007-05-21 Thread Federico Belvisi
Package: heartbeat-2
Version: 2.0.7-2

Architecture: arm
I'm using a Linksys NSLU2 with Debian Etch stable. I tried to set up a simple
failover between nslu2 and a Gentoo server (i386). The only thing i want to
failover is dhcp3-server (no ip failover, only dhcp3-server). I installed the
stable heartbeat on gentoo and the stable branch heartbeat on Debian. Gentoo is
primary, Debian/NSLU2 is backup.
I configured all the things up but soon i noticed that there were some little
problems: gentoo complained that the heartbeat versions were different, although
some failover was working. The problem was with auto_failback not working. I
soon realized that in a newbie fashion i installed heartbeat instead of
heartbeat-2 on Debian. I removed the old heartbeat, installed heartbeat-2 and as
soon as installation was at the end, my little nslu2 was slowing down. After 1
minute i realized that the only thing was to shutdown the nslu2. I mounted the
nslu2 hard disk on pc and performed regular maintenance and deleted symlinks in
order not to let heartbeat-2 start at boot. I double checked configuration files
 and they were the same as they were when the old heartbeat (branch 1.) was
installed, and that they are the same as on Gentoo server. I booted the nslu2,
and all was fine. Load averages were between 0.50 and 1. I then decided to start
heartbeat-2 so i typed /etc/init.d/heartbeat start. System started to slow
down, being less responsive. I typed top just to see what was process
activity. Heartbeat was using many (maybe 43%) of memory, and load averages were
steeply raising: in about 2 minutes load average was some like 15. I waited some
minutes more but the shell  wasn't updating (even if ssh session was still up),
so i decided to shutdown again the nslu2.

I retried upgrading heartbeat-2 from testing repository (version 2.0.8-1), but
things were the same as above.

I think heartbeat-2 on arm is doing something bad, by taking up all resources
(cpu, memory). This don't happen with package heartbeat.


--
Federico Belvisi



-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]