Re: [osol-discuss] Resource Limit

Brian Ruthven - Sun UK Tue, 26 May 2009 06:20:21 -0700

My advice would be to set up coreadm to capture whichever process isdoing this rather than guessing. [ This works on the assumption that theprocess is being sent a SEGV, and hence will dump core if so enabled. ]

I would also advise against changing a global value for the sake of one(misbehaving?) application. I would much rather identify the process,then either log a bug (if this is errant behaviour), understand whatconfiguration is necessary to reduce the stack usage, or provide aper-process change to the limit (i.e. a shell script wrapper to startthe process) rather than blindly setting a global variable.

My suspicion is that there is a recursive function somewhere which isconsuming the stack segment, causing the limit to be reached and theprocess is terminated. Simply raising the limit will give it moreheadroom, but not actually move your problem further forward - the coredumps (if any) will simply be larger :-)

Obviously this is speculation, but I would suggest coreadm (with globalcores and core logging enabled) at least temporarily to catch theprocess. Searching the /var/svc/log files may also reveal a processwhich died unexpectedly, perhaps with the service simply restarting. Ifyou suspect Xorg, then start with the cde-login or gdm SMF services tosee if it reported anything there.


Regards,
Brian


Mike DeMarco wrote:

Thanks for your post Brian:
It is hard to tell what process is throwing this error as it is only displayed as the PID and since the process dies and svc watcher attempts to restart it the PID number changes.>From /var/adm/messages around the time that this error is generated xorg is also trying to start. So My best guess is that it is xorg that wants more stack size.
Is there a way to increase the global max-stack-size above its default of 10Meg? 
/etc/project does not do this. I have found that /etc/project is not working 
properly even under Solaris10u5 & u6, I am working this through Sun support now.
I'm guessing here, but: the limit probably comes from
the shell-imposedstack limit (default 10Mb / 10240Kb).
This is normally put in place to catch bad apps or
recursive functionsthat don't stop recursing. It helps stop one processquickly consumingvast amounts of RAM (OK, a simplistic approcach, I
know).

The question is - what is the app, and why does it
need more than 10Mbfor stack?
If it really does need that much, then it may need to
have the stacklimit increased (perhaps a shell script wrapper withthe appropriate"ulimit -s" command, or maybe there's a more cleverway to do this now:-) ).
One word of caution, especially for 32-bit processes:
Don't be tooliberal with the setting for the stack limit. Becauseof the location ofstack and libraries within the process virtualaddress space, the stacklimit effectively removes that amount of memory fromthe process. In a32-bit process, only 4Gb is available, and settingthe stack limit to1Gb would actually only leave approx 3Gb for theprocess to use (despitethe stack only being a few Kb in size).
I would suggest looking at what the process is, and
see whether there isa software fault there first, before fiddling withthe stack limit.coreadm should help you catch what the process is (asout of stackgenerates SIGSEGV I believe) if you don't already
know.


Regards,
Brian


--
Brian Ruthven                                        Sun Microsystems UK
Solaris Revenue Product Engineering             Tel: +44 (0)1252 422 312
Sparc House, Guillemont Park, Camberley, GU17 9QG

_______________________________________________
opensolaris-discuss mailing list
opensolaris-discuss@opensolaris.org

Re: [osol-discuss] Resource Limit

Reply via email to