Hi,
I'm trying to set up a new installation of AOLServer (on Fedora Core 5),
and I've hit a problem that I can't seem to resolve.
The issue is that the nsd process consistently dies (every 20 seconds -
5 minutes) with the error message "unable to realloc XXXXX bytes" (some
example values for X: 819200, 1005215, 1367511).
This might normailly indicate that I've run out of memory - however, the
server isn't even close to that state. It has 3GB of RAM, more than 2GB
free, and nothing else cpu-intensive running - yet, nsd dies with 95-115
MB of usage (~70M RSS) each time. nsd on my current FC4 typically
hovers around 600M (lots of caching) and is stable.
I suspect that something in FC5 is limiting the amount of memory a
process can use, but I've disabled SELinux and there's no ulimit.
I've also suspected a bug in either AOLServer and/or Tcl, but I've tried
3 separate versions of TCL now (8.4.15, 8.4.11, and 8.5.x) each with
AOLserver 4.5 and 4.0.10. And of course, I've made sure that threads are
enabled in Tcl.
I've debugged the process, and the call stack isn't very useful -
realloc fails and calls abort() from the call stack of TCL interpretation.
Additional info:
Stacksize: 2MB
Modules: nspostgres-4.0, nscache
kernel: 2.6.17-1.2145_FC5smp
TCL and aolserver installed with configure/make, (not with RPMs).
Top from server, right before crashing:
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
3586 nsadmin 23 0 105m 63m 2856 S 15 2.1 0:31.48 nsd
Top from my current (non-crashing) server:
20201 nsadmin 17 0 532m 416m 2644 S 2.0 5.2 78:56.84 nsd
If anyone has any ideas, I'd greatly appreciate any help.
Thanks,
Hossein
--
AOLserver - http://www.aolserver.com/
To Remove yourself from this list, simply send an email to <[EMAIL PROTECTED]>
with the
body of "SIGNOFF AOLSERVER" in the email message. You can leave the Subject:
field of your email blank.