Am 05.03.2013 um 14:08 schrieb Ian Johnson: > Reuti, > > Yes. I can ssh from the submit host and the execution host.
Good. Is there any firewall configured on the submit host? -- Reuti > Thanks, > > Ian > > On Tue, 05 Mar 2013 13:07:34 -0000, Reuti <[email protected]> wrote: > >> Am 05.03.2013 um 10:39 schrieb Ian Johnson: >> >>> Reuti, >>> >>> I was wondering about both exit status being 0, of qrsh, and the error >>> being set on the queue. The output of qacct is: >>> >>> $ qacct -j 152 >>> <snip> >>> qrsh works, however, from the master host, which is both a submit and >>> administration host: as is the host I ran the "failing" qrsh process. >> >> For `qrsh` to work it needs a direct connection between the submit host and >> the exec host (or some TCP/IP forwarding on the master host). Does the >> machine where you initiated the failing `qrsh` have a direct connection to >> the exechost? >> >> -- Reuti >> >> >>> Thanks, >>> >>> Ian >>> >>> On Mon, 04 Mar 2013 17:36:47 -0000, Reuti <[email protected]> >>> wrote: >>> >>>> Am 04.03.2013 um 14:27 schrieb Ian Johnson: >>>> >>>>> Dear All, >>>>> >>>>> I built release 2011.11p1 of Open Grid Engine and I'm having a problem >>>>> with qrsh not scheduling an interactive job on an execution host. >>>>> Invoking: >>>>> >>>>> $ qrsh -q all.q -verbose >>>>> local configuration broker not defined - using global configuration >>>>> Your job 152 ("QRLOGIN") has been submitted >>>>> waiting for interactive job to be scheduled ... >>>>> $ echo $? >>>>> 0 >>>>> >>>>> And the exit status is 0! >>>>> >>>>> However, the queue is left in an error state: >>>>> >>>>> --------------------------------------------------------------------------------- >>>>> all.q@exec_1 BIP 0/0/4 0.00 linux-x64 >>>>> E >>>>> queue all.q marked QERROR as result of job 152's failure at host >>>>> exec_1 >>>>> --------------------------------------------------------------------------------- >>>>> >>>>> Would anyone know what's going on here, or has anyone seen this behaviour >>>>> before? >>>> >>>> What created the error? >>>> >>>> Are you know wondering about the exit code being zero, or the queue being >>>> in error state for unknown reason? There might be something in the >>>> messages file of the qmaster or the node specific one. >>>> >>>> What was recorded in: >>>> >>>> $ qacct -j 152 >>>> >>>> -- Reuti >>>> >>>> >>>> >>>>> -- >>>>> Thank you, >>>>> >>>>> Ian Johnson >>>>> Software Engineer >>>>> >>>>> Capita Translation and Interpreting >>>>> Riverside Court, Huddersfield Road, Delph, Oldham, OL3 5FZ | Tel (UK): >>>>> +44 845 367 7000 | Tel (US): +1 (800) 579-5010 >>>>> | [email protected] | Skype ID: ian.johnson_als >>>>> www.capitatranslationinterpreting.com >>>>> _______________________________________________ >>>>> users mailing list >>>>> [email protected] >>>>> https://gridengine.org/mailman/listinfo/users >>>> >>> >>> >>> -- >>> Kind regards, >>> >>> Ian Johnson >>> Software Engineer >>> >>> Capita Translation and Interpreting >>> Riverside Court, Huddersfield Road, Delph, Oldham, OL3 5FZ | Tel (UK): +44 >>> 845 367 7000 | Tel (US): +1 (800) 579-5010 >>> | [email protected] | Skype ID: ian.johnson_als >>> www.capitatranslationinterpreting.com >> > > > -- > Kind regards, > > Ian Johnson > Software Engineer > > Capita Translation and Interpreting > Riverside Court, Huddersfield Road, Delph, Oldham, OL3 5FZ | Tel (UK): +44 > 845 367 7000 | Tel (US): +1 (800) 579-5010 > | [email protected] | Skype ID: ian.johnson_als > www.capitatranslationinterpreting.com _______________________________________________ users mailing list [email protected] https://gridengine.org/mailman/listinfo/users
