Re: [AOLSERVER] Something wrong after 2006-05-12 21:25 (was Re: Weird memory leak problem in AOLserver 3.4.2/3.x)

2006-05-19 Thread Stan Kaufman

'Jesus' Jeff Rogers wrote:
I had problems starting at the exact same time but on Solaris, where 
they manifested as a EINVAL return from pthread_cond_tomedwait.  After 
a day of tracing the problem with debug builds and working with my 
sysadmin to track what changed (of course, nothing had)  I cam to the 
same 1 billion second issue.


Which coincidentally is the expiry time (MaxOpen and MaxIdle) set on 
my database connections. 
My system is ACS-derived, so I wouldn't be surprised if these database 
settings are common in other ACS-derived systems.



What do you think is the reason that not all systems encounter this 1B 
second issue? The passage of time is the one factor inevitably shared by 
every system running aolserver, yet not every system barfs in the same 
fashion. Why?



--
AOLserver - http://www.aolserver.com/

To Remove yourself from this list, simply send an email to [EMAIL PROTECTED] 
with the
body of SIGNOFF AOLSERVER in the email message. You can leave the Subject: 
field of your email blank.


Re: [AOLSERVER] Something wrong after 2006-05-12 21:25 (was Re: Weird memory leak problem in AOLserver 3.4.2/3.x)

2006-05-19 Thread Stan Kaufman

'Jesus' Jeff Rogers wrote:
Simple, because it's a config file setting, not anything to do with 
the underlying system.  If your config file has


[ns/db/pool/main]
MaxOpen=10
MaxIdle=10

(which I think was done to work around some ancient bug in an ancient 
version of the nsoracle driver) then you get the problem.  If your 
timeouts are more reasonable or 0 to explicitly specify never timeout, 
then no problem.


Ah. For whatever reason, in my config files MaxOpen and MaxIdle were 
commented out; that must be why my systems didn't encounter this problem.


Not defining these two vars must have been the way the 3.2.5 config file 
was distributed back in the day, as this is not something I would have 
thought to do. Leaving them undefined seems to have no ill effect.


So are people defining them to 0 or simply undefining them?


--
AOLserver - http://www.aolserver.com/

To Remove yourself from this list, simply send an email to [EMAIL PROTECTED] 
with the
body of SIGNOFF AOLSERVER in the email message. You can leave the Subject: 
field of your email blank.


Re: [AOLSERVER] Something wrong after 2006-05-12 21:25 (was Re: Weird memory leak problem in AOLserver 3.4.2/3.x)

2006-05-19 Thread Stan Kaufman

Stan Kaufman wrote:
Not defining these two vars must have been the way the 3.2.5 config 
file was distributed back in the day


Yup, that was the case: http://openacs.org/doc/openacs-3/nsd.txt

Interesting that that inoculated OpenACS 3.2.5 systems from this problem.


--
AOLserver - http://www.aolserver.com/

To Remove yourself from this list, simply send an email to [EMAIL PROTECTED] 
with the
body of SIGNOFF AOLSERVER in the email message. You can leave the Subject: 
field of your email blank.


Re: [AOLSERVER] Something wrong after 2006-05-12 21:25 (was Re: Weird memory leak problem in AOLserver 3.4.2/3.x)

2006-05-19 Thread Stan Kaufman

Dossy Shiobara wrote:

Generally, the only time I've seen people config the MaxOpen and MaxIdle
to 1B seconds is when they're using an ACS or OpenACS recommended
configuration.

It might be a good idea for the OpenACS folks to edit/update their
documentation and sample configs to correct this, as well ... although
it'll never be May 13, 2006 ever again, so maybe it's a non-issue.  :-)
  



Why doesn't AOLServer v4.x have problems with MaxOpen and MaxIdle set to 
1B -- as they are in OpenACS 5.x configs?


Do MaxOpen and MaxIdle even need to be defined at all? They weren't in 
OpenACS 3.2.5/AOLServer 3.3.1+ad13; what's different with the current stack?



--
AOLserver - http://www.aolserver.com/

To Remove yourself from this list, simply send an email to [EMAIL PROTECTED] 
with the
body of SIGNOFF AOLSERVER in the email message. You can leave the Subject: 
field of your email blank.


Re: [AOLSERVER] Something wrong after 2006-05-12 21:25 (was Re: Weird memory leak problem in AOLserver 3.4.2/3.x)

2006-05-18 Thread Stan Kaufman

Cynthia Kiser wrote:

Quoting Gustaf Neumann [EMAIL PROTECTED]:
  

It seems, as if all problems are in tcl 8.3.*.


But just a data point from the not happening here side. I have

info patchlevel 8.3.2
  
FWIW, on my boxes where aolserver appears to function correctly, the tcl 
in the aolserver lib is 8.3:


/usr/local/aolserver/lib/tcl8.3/

On one box I have tcl 8.3.3 installed generally, and on another 8.4.9 
(as revealed by checking [info patchlevel] from tclsh). But I presume 
that in both cases, aolserver is still using the tcl in its lib 
directory, correct? Since [ns_info patchlevel] isn't implemented in 
3.3.1+ad13, is there some way to tell for sure which tcl aolserver is using?


Anyway, if it is the case that aolserver is using its own version of 
tcl, then the tcl 8.3.x is the problem theory wouldn't appear to stand.



--
AOLserver - http://www.aolserver.com/

To Remove yourself from this list, simply send an email to [EMAIL PROTECTED] 
with the
body of SIGNOFF AOLSERVER in the email message. You can leave the Subject: 
field of your email blank.


Re: [AOLSERVER] Something wrong after 2006-05-12 21:25 (was Re: Weird memory leak problem in AOLserver 3.4.2/3.x)

2006-05-18 Thread Stan Kaufman

Andrew Piskorski wrote:

If all you want to know is the Tcl version AOLserver is using, it's
trivially easy, just use the Tcl info patchlevel command from inside
AOLserver.
Right; duh. Thanks. So, the version of tcl my 3.3.1+ad13 sites are using 
is 8.3.2, and they appear to reboot without the VM and scheduled proc 
problems. FWIW.



--
AOLserver - http://www.aolserver.com/

To Remove yourself from this list, simply send an email to [EMAIL PROTECTED] 
with the
body of SIGNOFF AOLSERVER in the email message. You can leave the Subject: 
field of your email blank.


Re: [AOLSERVER] Weird memory leak problem in AOLserver 3.4.2/3.x

2006-05-17 Thread Stan Kaufman

Zachary Shaw wrote:

before may 13th nanosleep was in the form

[pid   614] nanosleep({0, 34478},  unfinished ...

after the 12th there were nanosleeps in the form
[pid   614] nanosleep({9, 934211000},  unfinished ...
  


This looks like a blatant clue -- to someone who understands this. 
nanosleep is a *nix lib command 
(http://linux.about.com/library/cmd/blcmdl2_nanosleep.htm), which makes 
me think the problem may not be with AOLServer. Simple-mindedly grepping 
for nanosleep through the ad33.13 source and the aolserver includes etc 
returns nothing, so the code isn't calling nanosleep directly. I wonder 
if this is a TCL issue, though grepping through tcl source also returns 
nothing. Hmm...



one additional piece of information.  It appears that our logs stopped rolling
on may 13th.
  


My logs are rolling just fine -- both the error and access logs. Weirder 
and weirder.



--
AOLserver - http://www.aolserver.com/

To Remove yourself from this list, simply send an email to [EMAIL PROTECTED] 
with the
body of SIGNOFF AOLSERVER in the email message. You can leave the Subject: 
field of your email blank.


Re: [AOLSERVER] Something wrong after 2006-05-12 21:25 (was Re: Weird memory leak problem in AOLserver 3.4.2/3.x)

2006-05-17 Thread Stan Kaufman

Dossy Shiobara wrote:
No, I meant ns_info patchlevel -- to get the full version of 
AOLserverthat's running -- but yes, I should have asked for info 
patchleveltoo, to find out what version of Tcl is being used.


patchlevel appears not to be a switch to ns_info in 
AOLserver/3.3.1+ad13 (as Janine pointed out):


[17/May/2006:15:08:53][14372.11231392][-conn5-] Error: unknown command 
patchlevel:  should be address, argv0, builddate, callbacks, config, 
hostname, label, locks, log, name, pageroot, pid, platform, scheduled, 
server, sockcallbacks, tag, tcllib, threads, version, or winnt

   while executing
ns_info patchlevel

In any case, here is info about a system that appears *not* to have this 
trouble:


% info patchlevel
8.4.9

ns_info version:
3.3.1+ad13

uname:
Linux x 2.4.27 #1 Mon Jul 4 21:39:37 PDT 2005 i686 GNU/Linux
(Debian sarge)

glibc:
libstdc++2.10-glibc2.2


--
AOLserver - http://www.aolserver.com/

To Remove yourself from this list, simply send an email to [EMAIL PROTECTED] 
with the
body of SIGNOFF AOLSERVER in the email message. You can leave the Subject: 
field of your email blank.


Re: [AOLSERVER] Weird memory leak problem in AOLserver 3.4.2/3.x

2006-05-16 Thread Stan Kaufman
Dave Siktberg david.siktberg at WEBILITY.MD writes:

 restarting AOL Server last night around 11PM EDT.  Scheduled procs on both
 were working fine up until the restart -- then none.  Scheduled procs are

On a development box, I just started up clones of my
AOLserver/3.3.1+ad13/OpenACS 3.2.5 sites, and they appear to run fine --
including scheduled procs. I'm not seeing any VM problems either.

 I can also fiddle with my development server.  Could this be a date-related
 problem?  I think I'll set the system date back several days, restart, and
 see what happens.  Any other ideas?

Using the current date caused no problems for me; what was the effect of
back-dating?

 Any other information I can provide or things I can do to help debug this?
 As with Janine, upgrading AOLServer is not a feasible option in the short
 term.

Same here.

Could this be an OS problem?

Stan


--
AOLserver - http://www.aolserver.com/

To Remove yourself from this list, simply send an email to [EMAIL PROTECTED] 
with the
body of SIGNOFF AOLSERVER in the email message. You can leave the Subject: 
field of your email blank.