I had the same problem with 3-4 crash a day. Upgrading to 8.4.14 - 4.5
decrease the crash frequency to 1 a week (but I still get the "unable
to realloc XXXXX bytes").
My dirty "work around" is a cron that periodicaly check if an nsd
process is available and if not retstart aolserver.
Strange that it didn't work for you.
Did you have a look to your log, since your server crash at startup did
you look at the "scheduleds proc" . (I'm working with openacs that got a
lot of that at startup - through the init.tcl).
- An old bug : set your (I think it has been solved in aol 4.5 but I'm
not sure)
ns_param maxidle 000000000
ns_param maxopen 000000000
Hossein Sharifi a écrit :
Hi,
I'm trying to set up a new installation of AOLServer (on Fedora Core
5), and I've hit a problem that I can't seem to resolve.
The issue is that the nsd process consistently dies (every 20 seconds
- 5 minutes) with the error message "unable to realloc XXXXX bytes"
(some example values for X: 819200, 1005215, 1367511).
This might normailly indicate that I've run out of memory - however,
the server isn't even close to that state. It has 3GB of RAM, more
than 2GB free, and nothing else cpu-intensive running - yet, nsd dies
with 95-115 MB of usage (~70M RSS) each time. nsd on my current FC4
typically hovers around 600M (lots of caching) and is stable.
I suspect that something in FC5 is limiting the amount of memory a
process can use, but I've disabled SELinux and there's no ulimit.
I've also suspected a bug in either AOLServer and/or Tcl, but I've
tried 3 separate versions of TCL now (8.4.15, 8.4.11, and 8.5.x) each
with AOLserver 4.5 and 4.0.10. And of course, I've made sure that
threads are enabled in Tcl.
I've debugged the process, and the call stack isn't very useful -
realloc fails and calls abort() from the call stack of TCL
interpretation.
Additional info: Stacksize: 2MB
Modules: nspostgres-4.0, nscache
kernel: 2.6.17-1.2145_FC5smp
TCL and aolserver installed with configure/make, (not with RPMs).
Top from server, right before crashing:
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
3586 nsadmin 23 0 105m 63m 2856 S 15 2.1 0:31.48 nsd
Top from my current (non-crashing) server:
20201 nsadmin 17 0 532m 416m 2644 S 2.0 5.2 78:56.84 nsd
If anyone has any ideas, I'd greatly appreciate any help.
Thanks,
Hossein
--
AOLserver - http://www.aolserver.com/
To Remove yourself from this list, simply send an email to
<[EMAIL PROTECTED]> with the
body of "SIGNOFF AOLSERVER" in the email message. You can leave the
Subject: field of your email blank.
--
Programmers.ch
Solutions libres et Opensources
Tel: ++41 76 44 888 72
Site: http://www.programmers.ch
Site: http://openprojects.ch
--
AOLserver - http://www.aolserver.com/
To Remove yourself from this list, simply send an email to <[EMAIL PROTECTED]>
with the
body of "SIGNOFF AOLSERVER" in the email message. You can leave the Subject:
field of your email blank.