Hi,

I'm trying to set up a new installation of AOLServer (on Fedora Core 5), and I've hit a problem that I can't seem to resolve.

The issue is that the nsd process consistently dies (every 20 seconds - 5 minutes) with the error message "unable to realloc XXXXX bytes" (some example values for X: 819200, 1005215, 1367511).

This might normailly indicate that I've run out of memory - however, the server isn't even close to that state. It has 3GB of RAM, more than 2GB free, and nothing else cpu-intensive running - yet, nsd dies with 95-115 MB of usage (~70M RSS) each time. nsd on my current FC4 typically hovers around 600M (lots of caching) and is stable.

I suspect that something in FC5 is limiting the amount of memory a process can use, but I've disabled SELinux and there's no ulimit. I've also suspected a bug in either AOLServer and/or Tcl, but I've tried 3 separate versions of TCL now (8.4.15, 8.4.11, and 8.5.x) each with AOLserver 4.5 and 4.0.10. And of course, I've made sure that threads are enabled in Tcl.

I've debugged the process, and the call stack isn't very useful - realloc fails and calls abort() from the call stack of TCL interpretation.

Additional info: Stacksize: 2MB
Modules: nspostgres-4.0, nscache
kernel: 2.6.17-1.2145_FC5smp
TCL and aolserver installed with configure/make, (not with RPMs).

Top from server, right before crashing:
PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND
3586 nsadmin   23   0  105m  63m 2856 S   15  2.1   0:31.48 nsd

Top from my current (non-crashing) server:
20201 nsadmin   17   0  532m 416m 2644 S  2.0  5.2  78:56.84 nsd

If anyone has any ideas, I'd greatly appreciate any help.

Thanks,
Hossein


--
AOLserver - http://www.aolserver.com/

To Remove yourself from this list, simply send an email to <[EMAIL PROTECTED]> 
with the
body of "SIGNOFF AOLSERVER" in the email message. You can leave the Subject: 
field of your email blank.

Reply via email to