Re: [gridengine users] Multi-GPU setup

2019-08-14 Thread Ian Kaufman
t; > > > > > > > ___ > > > users mailing list > > > users@gridengine.org > > > https://gridengine.org/mailman/listinfo/users > > > > -- > > > Andreas Haupt| E-Mail: andreas.ha...@des

Re: [gridengine users] Limiting each user's slots across all nodes

2019-03-12 Thread Ian Kaufman
And do you define host groups in the PE? On Tue, Mar 12, 2019 at 9:53 AM David Trimboli wrote: > > On 3/12/2019 12:05 PM, Ian Kaufman wrote: > > Are mynode{17-24} in a queue that is configured to use your "threads" PE? > > > Yes. If you disable the limit, the subm

Re: [gridengine users] Limiting each user's slots across all nodes

2019-03-12 Thread Ian Kaufman
-l vf=1G -l > h="mynode17|mynode18|mynode19|mynode20|mynode21|mynode22|mynode23|mynode24" > -pe threads 1 anyscript.sh > > It'll work if you remove "-pe threads 1". > _______ > users mailing list > users@gridengine

Re: [gridengine users] Grid Engine Sluggish

2019-01-26 Thread Ian Kaufman
IO issues? NFS server providing data and possibly jobs running over NDS shares as opposed to running on local disk? On Sat, Jan 26, 2019, 11:23 AM Joseph Farran Hi Daniel. > > Yes I do have large job-arrays around 7k tasks BUT I have had larger job > arrays of 500k without seeing this kind of

Re: [gridengine users] Installing man pages

2019-01-25 Thread Ian Kaufman
> unload/fix the apparmor profiles already loaded in the kernel. > > > > I haven't worked with apparmor, so you may need to search the internet > > for solutions. > > > I'll take a look. My brief perusal suggests that it's a nontrivial task > to do so. I'll see what I find. > > Thanks! > > ___ > users mailing list > users@gridengine.org > https://gridengine.org/mailman/listinfo/users > -- Ian Kaufman Research Systems Administrator UC San Diego, Jacobs School of Engineering ikaufman AT ucsd DOT edu ___ users mailing list users@gridengine.org https://gridengine.org/mailman/listinfo/users

Re: [gridengine users] mpirun without ssh

2018-03-22 Thread Ian Kaufman
> and it's not really fair for other user that use qsub. > > Then the simpliest way avoiding this is to forbiden the connection > > I'm not specialist of PAM and authentication have you got a link ? > > > > > ___ > users mailing

Re: [gridengine users] mpirun without ssh

2018-03-22 Thread Ian Kaufman
skyl...@u.washington.edu) > -- Genome Sciences Department, System Administrator > -- Foege Building S046, (206)-685-7354 > -- University of Washington School of Medicine > ___________ > users mailing list > users@gridengine.org > https://gridengi

Re: [gridengine users] Strange behavior with functional scheduling

2017-10-09 Thread Ian Kaufman
25 > weight_tickets_functional 1 > weight_tickets_share 0 > > Perhaps these settings might be causing our issue? Seems unlikely though, > as we're not taking project or department into account in our scheduling. > > Thanks, > > DR > > > On

Re: [gridengine users] Strange behavior with functional scheduling

2017-10-09 Thread Ian Kaufman
ng and man page > reading on the relevant topics and settings, but wasn't able to find a good > explanation for the behavior we're seeing. Any help greatly appreciated! > > Thanks, > > DR > ___ > users mailing list > users@grideng

Re: [gridengine users] mpirun noticed that job rank 0 with PID 27581 on node compute-0-9.local exited on signal 11 (Segmentation fault)

2017-10-06 Thread Ian Kaufman
> exited on signal 11 (Segmentation fault) > > > Thanks, > Subashini.K > > ___ > users mailing list > users@gridengine.org > https://gridengine.org/mailman/listinfo/users > > -- Ian Kaufman Research Systems Administrator UC San

Re: [gridengine users] Max jobs per user

2017-10-06 Thread Ian Kaufman
lease > notify the sender immediately and delete this email from your computer. > > ___ > users mailing list > users@gridengine.org > https://gridengine.org/mailman/listinfo/users > > -- Ian Kaufman Research Systems Administrator UC San Diego, Jacobs School of E

Re: [gridengine users] mpirun segmentation fault

2017-10-06 Thread Ian Kaufman
; [compute-0-4:09088] *** End of error message *** > > > /opt/gridengine/default/spool/compute-0-4/job_scripts/5020911: line 12: > 9088 Segmentation fault (core dumped) mpirun -np 4 gmx mdrun -ntmpi 1 > -ntomp 8 -v -deffnm eql2 > > > > What is the reason behind this? >

Re: [gridengine users] /opt/gridengine/default/spool/compute-0-3/job_scripts/XXXXXXX: line 25: 6425 Segmentation fault (core dumped)

2017-10-06 Thread Ian Kaufman
e more about this error? > > > Please help. > > Thanks, > Subashini.K > > ___ > users mailing list > users@gridengine.org > https://gridengine.org/mailman/listinfo/users > > -- Ian Kaufman Resea

Re: [gridengine users] RUNNING GROMACS SIMULATIONS THROUGH SCRIPT FILE

2017-08-11 Thread Ian Kaufman
ethod to set the path in the above submit.sh file? >>> >>> Executable: /usr/local/gromacs/bin/gmx >>> Library dir: /usr/local/gromacs/share/gromacs/top >>> >>> >>> Can anyone help me? >>> >>> Thanks, >>> Subash

Re: [gridengine users] new error I've never seen before! ("sge_shepherd won't run -- dynamic library missing?")

2017-08-09 Thread Ian Kaufman
y actually > be missing ... > > > Chris > > > ___ > users mailing list > users@gridengine.org > https://gridengine.org/mailman/listinfo/users > -- Ian Kaufman Research Systems Administrator UC San Diego, Jacobs School of Engineering ikaufman AT ucsd DOT edu

Re: [gridengine users] (resend) dealing with AD usernames that contain "@" character

2017-08-02 Thread Ian Kaufman
and enumerate users and groups who are in child domains like > NAFTA.COMPANY.COM and EAME.COMPANY.COM etc. > > -dag > > > > Ian Kaufman wrote: > >> If you support multiple domains, are you able to guarantee unique short >> names? It seems to me that could be a problem. If it is

Re: [gridengine users] (resend) dealing with AD usernames that contain "@" character

2017-08-02 Thread Ian Kaufman
; > > ___________ > users mailing list > users@gridengine.org > https://gridengine.org/mailman/listinfo/users > -- Ian Kaufman Research Systems Administrator UC San Diego, Jacobs School of Engineering ikaufman AT ucsd DOT edu ___ users mailing list users@gridengine.org https://gridengine.org/mailman/listinfo/users

Re: [gridengine users] DISPLAY problem in RHEL6.8

2017-07-18 Thread Ian Kaufman
se of the named > recipient(s) above. Any unauthorized use or disclosure of this email is > strictly prohibited. If you are not the intended recipient(s), please > notify the sender immediately and delete this email from your computer. > > ___ > users mailing list > u

Re: [gridengine users] Fwd: eqw for qsub jobs

2016-09-28 Thread Ian Kaufman
09/27/2016 23:29:50 >>> 1 >>> 1144482 0.55500 sas64 username Eqw 09/27/2016 23:30:40 >>> 1 >>> 1144484 0.55500 sas64 username Eqw 09/27/2016 23:31:30 >>>

Re: [gridengine users] Hardware thoughts?

2016-07-20 Thread Ian Kaufman
do some kind of visualization > on them, but I've never had complaints about them being under-powered yet. > > Any thoughts you might have are appreciated. > > Thanks > Biggles > > _______ > users mailing list > users@gridengine.

Re: [gridengine users] Reported memory usage too high

2016-06-02 Thread Ian Kaufman
ng a node. >>> >>> Thank you, >>> Nico >>> >>> >>> ___ >>> users mailing list >>> users@gridengine.org >>> https://gridengine.org/mailman/listinfo/users >>> >>> >> -- >> Alex Chekhol

Re: [gridengine users] How to set a minimum free memory limit for any task submission on SGE?

2016-06-01 Thread Ian Kaufman
llocated and then use qacct > to look at the actual max usage so they know what they should ask for next > time. We had some teething troubles with this for a few weeks after it was > introduced, but it's all been working smoothly for a long time now. > > -- Ian Kaufman Research System

Re: [gridengine users] All queues dropped because of overload or full

2016-05-25 Thread Ian Kaufman
sioning and service > management behind the scenes. > > Chris > > > > Pat Haley wrote: > > > > It looks similar but one big difference is when I run "qconf -sh" I > > see all my compute nodes listed along with my frontend. However > > "

Re: [gridengine users] How to set up h_vmem as a consumable resource

2015-02-25 Thread Ian Kaufman
users@gridengine.org https://gridengine.org/mailman/listinfo/users -- Ian Kaufman Research Systems Administrator UC San Diego, Jacobs School of Engineering ikaufman AT ucsd DOT edu ___ users mailing list users@gridengine.org https://gridengine.org

Re: [gridengine users] How to set up h_vmem as a consumable resource

2015-02-24 Thread Ian Kaufman
, but we've been unable to figure out how to set this up. We are using 6.1u3. thanks ___ users mailing list users@gridengine.org https://gridengine.org/mailman/listinfo/users -- Ian Kaufman Research Systems Administrator UC San Diego, Jacobs School

Re: [gridengine users] Epilog to print out usage summary?

2015-01-23 Thread Ian Kaufman
___ users mailing list users@gridengine.org https://gridengine.org/mailman/listinfo/users -- Ian Kaufman Research Systems Administrator UC San Diego, Jacobs School of Engineering ikaufman AT ucsd DOT edu

Re: [gridengine users] Cannot request resource if it is a load value of memory type: SGE reports it as unknown resource

2015-01-23 Thread Ian Kaufman
until a week ago. Ilya. Original Message Subject: Re: [gridengine users] Cannot request resource if it is a load value of memory type: SGE reports it as unknown resource From: Ian Kaufman ikauf...@eng.ucsd.edu To: Ilya M 4ilya.m+g...@gmail.com Date: 1/23/15, 11:38 AM

Re: [gridengine users] Enforce users to use specific amount of memory/slot

2014-06-30 Thread Ian Kaufman
for this reason, to get memory limits based on resident memory, if it seems worth it enough in the end. -M ___ users mailing list users@gridengine.org https://gridengine.org/mailman/listinfo/users -- Ian Kaufman Research Systems Administrator UC

Re: [gridengine users] Enforce users to use specific amount of memory/slot

2014-06-30 Thread Ian Kaufman
nodes. Maybe there is some sorts of resource equivalency between slot and memory can achieve that? Thanks D Sent from my iPad On 1 Jul 2014, at 5:57 am, Ian Kaufman ikauf...@eng.ucsd.edu wrote: I don't get the problem here. If a single core job (let's assume it cannot easily

Re: [gridengine users] schedd dies and error messages

2014-05-16 Thread Ian Kaufman
have, otherwise you will run into a backlog of jobs, and if an array job, this can even be more serious. Ian -- Ian Kaufman Research Systems Administrator UC San Diego, Jacobs School of Engineering ikaufman AT ucsd DOT edu ___ users mailing list users

Re: [gridengine users] Find out host a job is running on?

2014-04-23 Thread Ian Kaufman
___ users mailing list users@gridengine.org https://gridengine.org/mailman/listinfo/users -- Ian Kaufman Research Systems Administrator UC San Diego, Jacobs School of Engineering ikaufman AT ucsd DOT edu

Re: [gridengine users] array tasks memory usage

2014-04-15 Thread Ian Kaufman
. Is PhythonRender the one and only child of `sge_shepherd`? -- Reuti ___ users mailing list users@gridengine.org https://gridengine.org/mailman/listinfo/users -- Ian Kaufman Research Systems Administrator UC San Diego, Jacobs School of Engineering

Re: [gridengine users] SGE and GPU

2014-04-14 Thread Ian Kaufman
(and Phi). Not need to be complicated and powerful, just do basic work. Thanks, ___ users mailing list users@gridengine.org https://gridengine.org/mailman/listinfo/users -- Ian Kaufman Research Systems Administrator UC San Diego, Jacobs School

Re: [gridengine users] SGE and GPU

2014-04-14 Thread Ian Kaufman
may still happen to collide to each other on the same GPU on a multiple GPU node. If GE can have the memory to record the GPUs allocated to a job, then this can be perfect. On Mon, Apr 14, 2014 at 1:46 PM, Ian Kaufman ikauf...@eng.ucsd.edu wrote: I believe there already is support for GPUs

Re: [gridengine users] SGE and GPU

2014-04-14 Thread Ian Kaufman
of the queue name. -- Reuti On Mon, Apr 14, 2014 at 1:46 PM, Ian Kaufman ikauf...@eng.ucsd.edu wrote: I believe there already is support for GPUs - there is a GPU Load Sensor in Open Grid Engine. You may have to build it yourself, I haven't checked to see if it comes pre-packaged. Univa has Phi

Re: [gridengine users] SGE and GPU

2014-04-14 Thread Ian Kaufman
with PE=8. SGE allocate all the 3 nodes to me with 8 GPU slots. The problem is now: how my job knows what GPUs it can get on node1? Best On Mon, Apr 14, 2014 at 4:13 PM, Ian Kaufman ikauf...@eng.ucsd.edu wrote: Again, look into using it as a consumable resource as Gowtham posted above

Re: [gridengine users] SGE and GPU

2014-04-14 Thread Ian Kaufman
If everything is configured correctly, GridEngine will be aware that the GPU in node1 is in use, and schedule around it, ensuring that the 8 GPU job will get unused GPUs. Ian On Mon, Apr 14, 2014 at 1:38 PM, Ian Kaufman ikauf...@eng.ucsd.edu wrote: Look at the info presented here: http

Re: [gridengine users] SGE and GPU

2014-04-14 Thread Ian Kaufman
And here is some more info: http://serverfault.com/questions/322073/howto-set-up-sge-for-cuda-devices On Mon, Apr 14, 2014 at 1:39 PM, Ian Kaufman ikauf...@eng.ucsd.edu wrote: If everything is configured correctly, GridEngine will be aware that the GPU in node1 is in use, and schedule around

Re: [gridengine users] Is there any way to determine who submitted a jobID long after the job completed?

2014-04-14 Thread Ian Kaufman
Elliott Avenue 6th Floor, 6S139 Seattle, WA 98121 O: (206)-267-1097 ext 220 F: (206)-441-3033 ___ users mailing list users@gridengine.org https://gridengine.org/mailman/listinfo/users -- Ian Kaufman Research Systems Administrator UC San

Re: [gridengine users] problems with maxvmem

2014-03-03 Thread Ian Kaufman
. ___ users mailing list users@gridengine.org https://gridengine.org/mailman/listinfo/users -- Ian Kaufman Research Systems Administrator UC San Diego, Jacobs School of Engineering ikaufman AT ucsd DOT edu ___ users mailing list users

Re: [gridengine users] How to manage grid nodes

2013-10-03 Thread Ian Kaufman
/listinfo/users -- Ian Kaufman Research Systems Administrator UC San Diego, Jacobs School of Engineering ikaufman AT ucsd DOT edu ___ users mailing list users@gridengine.org https://gridengine.org/mailman/listinfo/users

Re: [gridengine users] How to manage grid nodes

2013-10-01 Thread Ian Kaufman
___ users mailing list users@gridengine.org https://gridengine.org/mailman/listinfo/users -- Ian Kaufman Research Systems Administrator UC San Diego, Jacobs School of Engineering ikaufman AT ucsd DOT edu ___ users mailing list users@gridengine.org

Re: [gridengine users] adaptive computing spam?

2013-09-06 Thread Ian Kaufman
://www.cape-horn-eng.com ___ users mailing list users@gridengine.org https://gridengine.org/mailman/listinfo/users -- Ian Kaufman Research Systems Administrator UC San Diego, Jacobs School of Engineering ikaufman AT ucsd DOT edu

Re: [gridengine users] h_vmem not honored at all?

2013-01-09 Thread Ian Kaufman
@gridengine.org https://gridengine.org/**mailman/listinfo/usershttps://gridengine.org/mailman/listinfo/users -- Ian Kaufman Research Systems Administrator UC San Diego, Jacobs School of Engineering ikaufman AT ucsd DOT edu ___ users mailing list users

Re: [gridengine users] vmem allocation

2012-12-19 Thread Ian Kaufman
/listinfo/users -- Ian Kaufman Research Systems Administrator UC San Diego, Jacobs School of Engineering ikaufman AT ucsd DOT edu ___ users mailing list users@gridengine.org https://gridengine.org/mailman/listinfo/users

Re: [gridengine users] vmem allocation

2012-12-19 Thread Ian Kaufman
-495-6914 Sending me a large file? Use my secure dropbox: https://cscb-filetransfer.wistar.upenn.edu/dropbox/btay...@wistar.org ** ** *From:* Ian Kaufman [mailto:ikauf...@eng.ucsd.edu] *Sent:* Wednesday, December 19, 2012 11:37 AM *To:* Brett Taylor *Cc:* users@gridengine.org

Re: [gridengine users] $'\r': command not found

2012-12-13 Thread Ian Kaufman
-- Ian Kaufman Research Systems Administrator UC San Diego, Jacobs School of Engineering ikaufman AT ucsd DOT edu ___ users mailing list users@gridengine.org https://gridengine.org/mailman/listinfo/users

Re: [gridengine users] $'\r': command not found

2012-12-11 Thread Ian Kaufman
/packages/Mathematica/7.0/Executables/math -run teller=$SGE_TASK_ID; ModelFotokatalyseTAT.m ___ users mailing list users@gridengine.org https://gridengine.org/mailman/listinfo/users -- Ian Kaufman Research Systems Administrator UC San Diego

Re: [gridengine users] vmem and maxvmem

2012-09-14 Thread Ian Kaufman
the qstat command. Ian -- Ian Kaufman Research Systems Administrator UC San Diego, Jacobs School of Engineering ikaufman AT ucsd DOT edu ___ users mailing list users@gridengine.org https://gridengine.org/mailman/listinfo/users

Re: [gridengine users] SGE and network switches and limitations?

2012-09-11 Thread Ian Kaufman
100K jobs in two months). Ian -- Ian Kaufman Research Systems Administrator UC San Diego, Jacobs School of Engineering ikaufman AT ucsd DOT edu ___ users mailing list users@gridengine.org https://gridengine.org/mailman/listinfo/users

Re: [gridengine users] Linux OOM killer oom_adj

2012-08-30 Thread Ian Kaufman
and 48GB of RAM, I can run nearly 12 jobs keeping the desired 1 per core. Ian -- Ian Kaufman Research Systems Administrator UC San Diego, Jacobs School of Engineering ikaufman AT ucsd DOT edu ___ users mailing list users@gridengine.org https

Re: [gridengine users] Linux OOM killer oom_adj

2012-08-30 Thread Ian Kaufman
. Additionally, the crew I work with use a workflow/job management tool that isn't thread aware, so every child Java app that gets launched gets its own JVM. That is high on the list of things to fix! Ian -- Ian Kaufman Research Systems Administrator UC San Diego, Jacobs School of Engineering ikaufman