Rodrick,

I have no idea. You¹ve shared no information on how your cluster is
configured or anything else, so you¹re asking us to speculate on the inner
workings of a black box.

Again, my advice to you is to run gexec through strace ‹ regardless of how
it¹s been implemented (C, C++, Java, whatever), it will follow the
application through the various library and system calls it makes, so if it
in fact makes a call to Œsetrlimit()¹, it will show up pretty clearly in the
output.

Anything else at this point from this side of the conversation would be pure
guesswork, which is not likely to be helpful to you.

good luck,
Klaus

On 8/25/08 1:38 PM, "Rodrick Brown" <[EMAIL PROTECTED]>did etch
on stone tablets:

> I highly suspect this is gexec the application in question is a java app which
> has no native method to set resources ie. Getrlimit()
>  
> Where does the following values come from?
>  
> [EMAIL PROTECTED] ~]$ gexec -n 0 bash -c 'ulimit -n'
> 1 1024
> 3 1024
> 2 1024
> 9 1024
> 8 1024
> 6 1024
> 7 1024
> 16 1024
> 4 1024
> 5 1024
> 10 1024
> 11 1024
> 14 1024
> 12 1024
> 15 1024
> 13 1024
>  
> If I ssh into any  nodes in this cluster and run ulimit ­n I see the following
>  
> [EMAIL PROTECTED] ~]$ for i in $GEXEC_SVRS; do ssh $i 'ulimit -n' ;done
> 65536
> 65536
> 65536 
> 65536
> 65536
> 65536
> 65536
> 65536
> 65536
> 65536
> 65536
> 65536
> 65536
> 65536
> 65536
> 65536
> 65536
>  
>  
> 
> From: Klaus Steden [mailto:[EMAIL PROTECTED]
> Sent: Monday, August 25, 2008 4:32 PM
> To: Rodrick Brown; [email protected]
> Subject: Re: [Ganglia-general] ulimit problems with gexec
>  
> 
> Hi Rodrick,
> 
> I¹d suggest using something like strace to follow the application through
> execution ... if you see an rlimit call in the output, then you know where
> it¹s coming from ... otherwise, I¹d still suspect the shell.
> 
> cordially,
> Klaus
> 
> 
> On 8/25/08 1:26 PM, "Rodrick Brown" <[EMAIL PROTECTED]>did etch on
> stone tablets:
> Steden, I¹m not trying to set the limit beyond the systems upper limit,
> currently the systems upper limit is 65636
>  
> I have no problem setting ulimit ­n to any value that¹s less than 65536 this
> works fine. 
>  
>  I have the following entry in /etc/security/limits.conf
>  
> *               -      nofile          65536
>  
> The system wide limit is set to 65536
>  
> When I log one of my clusters where gexec is called by the same user and run
> ulimit ­a I see the following:
>  
> [EMAIL PROTECTED] ~]$ ulimit -a
> core file size          (blocks, -c) 0
> data seg size           (kbytes, -d) unlimited
> max nice                       (-e) 0
> file size              (blocks, -f) unlimited
> pending signals                (-i) 139264
> max locked memory       (kbytes, -l) 32
> max memory size        (kbytes, -m) unlimited
> open files                     (-n) 60000
> pipe size            (512 bytes, -p) 8
> POSIX message queues     (bytes, -q) 819200
> max rt priority                (-r) 0
> stack size             (kbytes, -s) 10240
> cpu time              (seconds, -t) unlimited
> max user processes             (-u) 139264
> virtual memory          (kbytes, -v) unlimited
> file locks                     (-x) unlimited
>  
> when I start my app from gexec its using the default 1024 limit and not what I
> have set in ~/.bash{rc,_profile} for that user.
>  
> So I don¹t understand where this 1024 value is being called from.
>  
>  
> 
> From: Klaus Steden [mailto:[EMAIL PROTECTED]
> Sent: Monday, August 25, 2008 3:31 PM
> To: Rodrick Brown; [email protected]
> Subject: Re: [Ganglia-general] ulimit problems with gexec
> Importance: High
> 
> 
> Hello Rodrick,
> 
> Regular users are only able to ­lower­ their ulimits, never increase them. You
> need to either be the superuser, or to set higher resource limits by default.
> This has nothing to do with Ganglia, and it¹s not something they¹d be able to
> fix for you. 
> 
> cheers,
> Klaus
> 
> On 8/25/08 11:33 AM, "Rodrick Brown" <[EMAIL PROTECTED]>did etch
> on stone tablets:
> Can someone help me figure out what¹s limiting processes started from gexec to
> use the default 1024 open files limit Œulimit ­n¹?
> In my .bash_profile/.bashrc I have ulimit ­Sn 60000
>  
> Running my application manually w/o being called from gexec works fine,
> however if I call gexec my app is limited to 1024 FD limit which is causing
> major problems. 
>  
>  
> [EMAIL PROTECTED] ~]$ gexec -n 1 bash -c 'ulimit -Sn60000; ulimit -n'
> 0 /bin/bash: line 0: ulimit: open files: cannot modify limit: Invalid argument
> 0 1024
>  
> [EMAIL PROTECTED] scripts]$ ssh nycplx01a 'ulimit -Sn60000; ulimit -n'
> 60000
>  
> I¹ve looked at the sources for gexecd and don¹t see any calls to setrlimit()
> so I¹m not sure why this is happening any help would be appreciated.
> Thanks. 
>  
>  
> ---
> Rodrick R. Brown 
> Director, Systems Engineering
> Ballista Securities, LLC
> 120 Wall St. Suite 2400
> P: 646 307 4709
> C: 347 702 0012 
> F: 646 219-5872 
> 
> 
> -------------------------------------------------------------------------
> This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
> Build the coolest Linux based applications with Moblin SDK & win great prizes
> Grand prize is a trip for two to an Open Source event anywhere in the world
> http://moblin-contest.org/redirect.php?banner_id=100&url=/
> <http://moblin-contest.org/redirect.php?banner_id=100&amp;url=/>
> <http://moblin-contest.org/redirect.php?banner_id=100&amp;url=/>
> 
> 
> 
> _______________________________________________
> Ganglia-general mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/ganglia-general
> 
> 
>  
> 


-------------------------------------------------------------------------
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/
_______________________________________________
Ganglia-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/ganglia-general

Reply via email to