Re: [gridengine users] Memory allocation woes. Any thoughts?

2012-12-12 Thread Reuti
Am 12.12.2012 um 02:17 schrieb Jake Carroll: Cool. Thanks for the response guys. See in line: On 12/12/12 6:45 AM, Reuti re...@staff.uni-marburg.de wrote: Am 11.12.2012 um 21:32 schrieb Gowtham: I second Alex's thoughts. In all our clusters, we only use h_vmem The difference is

Re: [gridengine users] qstat reports a job's priority as '-nan'

2012-12-12 Thread Gowtham
Thanks, Luca and Reuti. From my cluster notes, I noticed that during ArcGIS 10.1 server installation, it had prompted us to include the following lines in '/etc/security/limits.conf' root soft nofile 65535 root hard nofile 65535 root soft nproc 25059 root hard nproc 25059 The '-nan'

Re: [gridengine users] qstat reports a job's priority as '-nan'

2012-12-12 Thread Reuti
Am 12.12.2012 um 14:04 schrieb Gowtham: From my cluster notes, I noticed that during ArcGIS 10.1 server installation, it had prompted us to include the following lines in '/etc/security/limits.conf' root soft nofile 65535 root hard nofile 65535 root soft nproc 25059 root hard nproc

[gridengine users] Rescheduling parallel/OpenMPI jobs

2012-12-12 Thread William Hay
I've been working on adding BLCR checkpointing for OpenMPI jobs on our cluster. Although the checkpoint and restart themselves seem to work in the process I encountered a few issues if I reschedule a multi-node job via qmod -rq or qmod -rj. 1)I get errors in the messages file of nodes running

Re: [gridengine users] qstat reports a job's priority as '-nan'

2012-12-12 Thread Gowtham
No sir, These lines were added only to the front end; I have checked again and none of the compute nodes had them. Best regards, g -- Gowtham Information Technology Services Michigan Technological University (906) 487/3593 http://www.it.mtu.edu/ On Wed, 12 Dec 2012, Reuti wrote: | Am

Re: [gridengine users] Some generic questions: binding, parallel, over-subscription

2012-12-12 Thread Arnau Bria
On Tue, 11 Dec 2012 22:19:19 +0100 Reuti Reuti wrote: Hi, Hi Reuti, [...] NB: default requests for consumables I would put in the complex definition. Could you please explain this a little more? What differences are there between defining it in the complex or in the sge_request? [...]

Re: [gridengine users] Some generic questions: binding, parallel, over-subscription

2012-12-12 Thread Reuti
Am 12.12.2012 um 15:46 schrieb Arnau Bria: On Tue, 11 Dec 2012 22:19:19 +0100 Reuti Reuti wrote: Hi, Hi Reuti, [...] NB: default requests for consumables I would put in the complex definition. Could you please explain this a little more? What differences are there between

[gridengine users] MPI jobs on a multi-architecture cluster?

2012-12-12 Thread bergman
I've got a question that's very similar to Joseph Farran's query How do I request the CPU type in qrsh / qsub with SGE 8.1.2? [1], but which is a problem specifically with MPI jobs. I think we're running into a chipset-architecture issue (AMD vs Intel) in OpenMPI jobs. We're using SGE 6.2u5 and

Re: [gridengine users] MPI jobs on a multi-architecture cluster?

2012-12-12 Thread Brendan Moloney
We've got a locally-written program that dynamically links against a package that's compiled with optimizations for different chipsets (ATLAS[2]). We've built ATLAS with multiple versions, optimized for each architecture in our cluster. This is fine for serial jobs--the login environment