Re: [gridengine users] failing mem_free request

2012-09-18 Thread Dave Love
Joseph Farran jfar...@uci.edu writes:

 Dave,

 I am having the same/similar issues as Brian's but with 8.1.2.But for me, 
 it's even worse.

 There are only two resources I can request which are mem_total and 
 swap_total.   All others fail.

OK, thanks.  I'll have another look at the code sometime and try to
guess what might be wrong, and why I can't reproduce it.

-- 
Community Grid Engine:  http://arc.liv.ac.uk/SGE/
___
users mailing list
users@gridengine.org
https://gridengine.org/mailman/listinfo/users


Re: [gridengine users] failing mem_free request

2012-09-17 Thread Brian Smith

Hi, Dave,

I tested only mem_free in my environment.  We're on el6.3 running SGE 
8.1.1, x86_64.  My symptoms came from jobs not dispatching and turning 
schedd_jobinfo on to reveal that that mem_free was an unknown parameter 
on my hosts.  All I know is after adding the attribute to the 
complex_values list for my exec hosts, we successfully worked-around the 
issue.  I'm working on getting a virtualized development environment 
online to test against.  Once I do, I'll provide more output on the problem.


-Brian

Brian Smith
Assistant Director, Research Computing
Information Technology
University of South Florida
4202 E. Fowler Ave. SVC4010
Office Phone: +1 813 974-1467
Organization URL: http://rc.usf.edu

On 09/16/2012 08:44 AM, Dave Love wrote:

Brian Smith b...@usf.edu writes:


Hi, Joseph,

Wow.  Just ran into this one myself (mem_free).  Adding a complex value to
each host set to mem_free=(ram on host) made for a good work-around.  The
variable is still being reported, so it still reports the real mem_free
(not the value you set), but the problem seems to go away when I do this.
  Just FYI.


Is the report you get from qalter -w p the same as previously reported?
Is this specific to mem_free or more general, and can anyone reproduce
it in a basically default configuration?  It's difficult to address when
it's not reproducible, and doesn't seem worth spending much time on.


___
users mailing list
users@gridengine.org
https://gridengine.org/mailman/listinfo/users


Re: [gridengine users] failing mem_free request

2012-09-17 Thread Dave Love
Brian Smith b...@usf.edu writes:

 Hi, Dave,

 I tested only mem_free in my environment.  We're on el6.3 running SGE
 8.1.1, x86_64.  My symptoms came from jobs not dispatching and turning
 schedd_jobinfo on to reveal that that mem_free was an unknown
 parameter on my hosts.  All I know is after adding the attribute to
 the complex_values list for my exec hosts, we successfully
 worked-around the issue.  I'm working on getting a virtualized
 development environment online to test against.  Once I do, I'll
 provide more output on the problem.

I didn't mean to put anyone to a lot of trouble (unless it's worth it to
anyone).  It just might help to know if, for instance, you can request
swap_free or virtual_free, or whether it's just mem_free that's broken
for some reason.

-- 
Community Grid Engine:  http://arc.liv.ac.uk/SGE/
___
users mailing list
users@gridengine.org
https://gridengine.org/mailman/listinfo/users


Re: [gridengine users] failing mem_free request

2012-09-17 Thread Joseph Farran

Dave,

I am having the same/similar issues as Brian's but with 8.1.2.But for me, 
it's even worse.

There are only two resources I can request which are mem_total and 
swap_total.   All others fail.


$ qrsh -l mem_total=1M
Last login: Mon Sep 10 22:02:39 2012 from login-1-1.local

Kickstarted 10:30 16-Mar-2012
[me@compute-1-13 ~]$ exit

$ qrsh -l mem_free=1M
error: no suitable queues
$ qrsh -l mem_used=1M
error: no suitable queues



$ qrsh -l swap_total=1M
Last login: Mon Sep 17 09:49:50 2012 from login-1-1.local

[me@compute-1-13 ~]$ exit

$ qrsh -l swap_free=1M
error: no suitable queues
$ qrsh -l swap_used=1M
error: no suitable queues

$ qrsh -l swap_free=1M
$ qrsh -l swap_used=1M
error: no suitable queues


Anything else we can try to help narrow in on the issue?

Joseph



On 09/17/2012 09:33 AM, Dave Love wrote:

Brian Smithb...@usf.edu  writes:


Hi, Dave,

I tested only mem_free in my environment.  We're on el6.3 running SGE
8.1.1, x86_64.  My symptoms came from jobs not dispatching and turning
schedd_jobinfo on to reveal that that mem_free was an unknown
parameter on my hosts.  All I know is after adding the attribute to
the complex_values list for my exec hosts, we successfully
worked-around the issue.  I'm working on getting a virtualized
development environment online to test against.  Once I do, I'll
provide more output on the problem.

I didn't mean to put anyone to a lot of trouble (unless it's worth it to
anyone).  It just might help to know if, for instance, you can request
swap_free or virtual_free, or whether it's just mem_free that's broken
for some reason.



___
users mailing list
users@gridengine.org
https://gridengine.org/mailman/listinfo/users