Re: [SGE-discuss] SGE systemd integration - can I contribute my patches?

2019-08-30 Thread Ondrej Valousek
If anyone is willing to try this out, here are my patches: https://extranet.adestotech.com/soge-8.1.9-patch.tar Let me know From: Daniel Povey Sent: Saturday, August 24, 2019 6:59 PM To: Ondrej Valousek Cc: sge-disc...@liverpool.ac.uk Subject: Re: [SGE-discuss] SGE systemd integration - can

[SGE-discuss] SGE systemd integration - can I contribute my patches?

2019-08-23 Thread Ondrej Valousek
an I send my patches somewhere so it can be possibly merged with the SoGE main repo? Thanks, Ondrej From: Ondrej Valousek Sent: Friday, August 9, 2019 1:40 PM To: 'us...@gridengine.org' mailto:us...@gridengine.org>> Subject: SGE & systemd integration Hi all, I am thinking of making

Re: [SGE-discuss] download links not working

2017-12-01 Thread Ondrej Valousek
mean for this email list and other resources hosted at > liverpool? > > Chris > > > > On Thu, 30 Nov 2017 15:43:51 + > Ondrej Valousek <ondrej.valou...@s3group.com> wrote: > > > Hello, > > I have noticed most of the download links at: > > > >

[SGE-discuss] download links not working

2017-11-30 Thread Ondrej Valousek
Hello, I have noticed most of the download links at: https://arc.liv.ac.uk/trac/SGE are broken. Can we have that fixed? Thx, Ondrej - The information contained in this e-mail and in any attachments is confidential and is designated solely for the attention of the intended recipient(s). If

Re: [SGE-discuss] CGROUP support in sgeexecd

2017-11-21 Thread Ondrej Valousek
Using 8.1.8. Will try 8.1.9, thanks... O. -Original Message- From: William Hay [mailto:w@ucl.ac.uk] Sent: Tuesday, November 21, 2017 12:56 PM To: Ondrej Valousek <ondrej.valou...@s3group.com> Cc: SGE-discuss@liv.ac.uk <sge-disc...@liverpool.ac.uk> Subject: Re: [SGE-dis

[SGE-discuss] Problem with sgemaster & higher number of clients

2017-08-23 Thread Ondrej Valousek
Hi List, When running qstat, I am sometimes receiving messages like: ''ERROR: failed receiving gdi request response for mid=1 (got syncron message receive timeout error)". Also, qping - info shows warning/error and high number of qmaster clients (> 40) at times when I receive messages like

Re: [SGE-discuss] Another QRSH problem

2017-06-01 Thread Ondrej Valousek
By default it's not consumable so therefore it is not enforced. >-Original Message- >From: juanesteban.jime...@mdc-berlin.de >[mailto:juanesteban.jime...@mdc-berlin.de] >Sent: Thursday, June 01, 2017 11:50 AM >To: Ondrej Valousek <ondrej.valou...@s3group.com>; Re

Re: [SGE-discuss] Another QRSH problem

2017-06-01 Thread Ondrej Valousek
Makes a sense to me. By setting h_vmem to 0 you define shell limit for address space to 0 - which means nothing can actually start as every malloc() will return E_NOMEM. Simple. >-Original Message- >From: SGE-discuss [mailto:sge-discuss-boun...@liverpool.ac.uk] On Behalf Of

[SGE-discuss] qstat error

2017-04-05 Thread Ondrej Valousek
Hi List, When running qstat -j -xml, qstat always return 0 - even in case job_id is incorrect. I guess we should return 1 in that case, right? Ondrej - The information contained in this e-mail and in any attachments is confidential and is designated solely for the attention of the

Re: [SGE-discuss] Customizing emails send from GridEngine (was: RE: Enforcing memory limit for job)

2017-03-23 Thread Ondrej Valousek
much? This way I believe we would get GE to show more accurate job exit status it case it was killed due to the lack of memory. Many thanks, Ondrej From: Reuti [mailto:re...@staff.uni-marburg.de] Sent: Tuesday, March 21, 2017 6:30 PM To: Ondrej Valousek <ondrej.valou...@s3group.com> Cc: W

Re: [SGE-discuss] Enforcing memory limit for job

2017-03-21 Thread Ondrej Valousek
> Sent: Tuesday, March 21, 2017 2:27 PM > To: Ondrej Valousek <ondrej.valou...@s3group.com> > Cc: William Hay <w@ucl.ac.uk>; sge-discuss@liv.ac.uk disc...@liverpool.ac.uk> > Subject: Re: [SGE-discuss] Enforcing memory limit for job > > > > Am 21.03.2017 um

Re: [SGE-discuss] Enforcing memory limit for job

2017-03-21 Thread Ondrej Valousek
age- > From: Reuti [mailto:re...@staff.uni-marburg.de] > Sent: Tuesday, March 21, 2017 2:05 PM > To: Ondrej Valousek <ondrej.valou...@s3group.com> > Cc: William Hay <w@ucl.ac.uk>; sge-discuss@liv.ac.uk disc...@liverpool.ac.uk> > Subject: Re: [SGE-discuss]

Re: [SGE-discuss] Enforcing memory limit for job

2017-03-21 Thread Ondrej Valousek
--Original Message- > From: William Hay [mailto:w@ucl.ac.uk] > Sent: Tuesday, March 14, 2017 4:18 PM > To: Ondrej Valousek <ondrej.valou...@s3group.com> > Cc: sge-discuss@liv.ac.uk <sge-disc...@liverpool.ac.uk> > Subject: Re: [SGE-discuss] Enforcing memory limit for job &

[SGE-discuss] Enforcing memory limit for job

2017-03-10 Thread Ondrej Valousek
Hi List, I need a help with setting default limits for jobs. I would need something that would limit job memory consumption to say 20Gb but was not consumable unless explicitly specified by a user. I thought setting h_vmem attribute in GE complex configuration would do the trick, but it is

Re: [SGE-discuss] Your "qrsh" request could not be scheduled, try again later.

2017-02-17 Thread Ondrej Valousek
> So you expect that the jobs should go to the BATCH queue (due to the setting > in sge_request) and wait indefinitely to be scheduled. > But you observe that they kick out soon with "Your "qrsh" request could not > be scheduled, try again later.". > > Do you have a personal ~/.sge_request? > >

Re: [SGE-discuss] Your "qrsh" request could not be scheduled, try again later.

2017-02-17 Thread Ondrej Valousek
I mean "man sge_request" -> so the outcome is what? Qrsh ignores sge_request file? Could you clarify? Thanks, Ondrej -Original Message- From: Reuti [mailto:re...@staff.uni-marburg.de] Sent: Friday, February 17, 2017 12:55 PM To: Ondrej Valousek <ondrej.valou...@s3gr

Re: [SGE-discuss] Your "qrsh" request could not be scheduled, try again later.

2017-02-17 Thread Ondrej Valousek
... -Original Message- From: Reuti [mailto:re...@staff.uni-marburg.de] Sent: Friday, February 17, 2017 12:25 PM To: Ondrej Valousek <ondrej.valou...@s3group.com> Cc: sge-disc...@liverpool.ac.uk Subject: Re: [SGE-discuss] Your "qrsh" request could not be scheduled, try again later.

Re: [SGE-discuss] Trouble installing SGE 8 on Centos 7

2017-01-12 Thread Ondrej Valousek
It's probably firewalld service running on qmaster. Try to stop it or add exception Otherwise GE has no problem running on Centos 7. -Original Message- From: SGE-discuss [mailto:sge-discuss-boun...@liverpool.ac.uk] On Behalf Of Maximilian Friedersdorff Sent: Thursday, January 12, 2017

[SGE-discuss] ghost jobs in GE

2017-01-06 Thread Ondrej Valousek
Hi List, I have a problem with a ghost jobs - these are (as per qstat) usually in state 'r' or 'dr' - so running, but in fact nothing is running on the specified node. I can even reboot the node, but they do not vanish - still there. They can be deleted, but it's still worrying as they are

[SGE-discuss] USE_CGROUPS

2016-12-20 Thread Ondrej Valousek
Hi List, I just enabled USE_CGROUPS execd parameters and I observe that - Relevant job cgroup is created in /dev/cpuset/sge - Task PIDs can not be found in /dev/cpuset/sge//tasks What could be wrong? I also use ENABLE_ADDGRP_KILL=true and USE_QSUB_GID=false, son of GE version

[SGE-discuss] Core files generation

2016-01-25 Thread Ondrej Valousek
Hi list, I have a strange problem. Jobs submitted to GE tend to crash and generate core files. I want avoid generation of core files as they tend to hog our shared storage, so I defined system-wide policy in /etc/security/limits.conf My problem - if I submit the job normally via qrsh, core

Re: [SGE-discuss] SGE 8.1.8 CGROUP question

2015-12-08 Thread Ondrej Valousek
...@liverpool.ac.uk] Sent: Tuesday, December 08, 2015 4:10 PM To: Ondrej Valousek <ondrej.valou...@s3group.com> Cc: sge-discuss@liv.ac.uk <sge-disc...@liverpool.ac.uk> Subject: Re: [SGE-discuss] SGE 8.1.8 CGROUP question Ondrej Valousek <ondrej.valou...@s3group.com> writes: > Hi List, >

Re: [SGE-discuss] SGE 8.1.8 CGROUP question

2015-12-08 Thread Ondrej Valousek
) would naturally do the job for me. Ondrej -Original Message- From: Ondrej Valousek Sent: Tuesday, December 08, 2015 4:19 PM To: 'Dave Love,,,' <d.l...@liverpool.ac.uk> Cc: sge-discuss@liv.ac.uk <sge-disc...@liverpool.ac.uk> Subject: RE: [SGE-discuss] SGE 8.1.8 C

Re: [SGE-discuss] SGE 8.1.8 CGROUP question

2015-12-07 Thread Ondrej Valousek
/2012/05/grid-engine-cgroups-integration.html Ondrej -Original Message- From: Reuti [mailto:re...@staff.uni-marburg.de] Sent: Monday, December 07, 2015 11:35 AM To: Ondrej Valousek <ondrej.valou...@s3group.com> Cc: sge-discuss@liv.ac.uk <sge-disc...@liverpool.ac.uk> Subject: Re:

Re: [SGE-discuss] SGE 8.1.8 CGROUP question

2015-12-07 Thread Ondrej Valousek
ortunately as it would be the cleanest solution). Ondrej -Original Message- From: Mark Dixon [mailto:m.c.di...@leeds.ac.uk] Sent: Monday, December 07, 2015 5:50 PM To: Ondrej Valousek <ondrej.valou...@s3group.com> Cc: sge-disc...@liverpool.ac.uk Subject: Re: [SGE-discuss] SGE 8.1.8

[SGE-discuss] SGE 8.1.8 CGROUP question

2015-12-04 Thread Ondrej Valousek
Hi List, We have just started using SGE 8.1.8 with the CGROUP support - works well, but I spotted that in spite of being CGROUP enabled, jobs are still submitted with the additional GID attached to them. My understanding that CGROUP will take care of tracking the job processes - hence no need