Re: [galaxy-dev] Job handler : crash [SOLVED]

2013-02-06 Thread Christophe Caron


Thanks for your reply !

The patch has fixed the problem !

Christophe

Le 30/01/2013 00:58, Derrick Lin a écrit :

Hi, I had the similar issue a while ago, and it's fixed in

https://bitbucket.org/galaxy/galaxy-central/commits/c015b82b3944f967e2c859d5552c00e3e38a2da0

Hope this help
D


On Wed, Jan 30, 2013 at 9:39 AM, Christophe Caron 
christophe.ca...@sb-roscoff.fr wrote:


Hello,

We run Galaxy (2013 January version) with load balancing mode (5 x web
manager, 5 job handler) with Apache/Sun Grid Engine 6.0u4/CentOS 6.3

- Since 2 weeks, some handler job process crash during the Galaxy startup
with this error message in handlerx.log

.
Starting server in PID 13634.
serving on http://127.0.0.1:8091
galaxy.jobs.handler DEBUG 2013-01-29 20:06:48,902 Stopping job 22842:
galaxy.jobs.handler DEBUG 2013-01-29 20:06:48,902 stopping job 22842 in
drmaa runner


  - The system log files report a segfault with libdrmaa

kernel: python[13977]: segfault at 0 ip 7f2811805dc5 sp
7f27f4aac0a0 error 4 in libdrmaa.so.1.0[7f28116dd000+**185000]



Thanks for your help !

Christophe

--

Christophe Caron

Station Biologique / Service Informatique et Bio-informatique
Place Georges Teissier - CS 90074
29688 Roscoff Cedex

Analysis and Bioinformatics for Marine Science
http://abims.sb-roscoff.fr/

christophe.ca...@sb-roscoff.fr

tél: +33 (0)2 98 29 25 43 / +33 (0)6 07 83 54 77






__**_
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/





--

Christophe Caron

Station Biologique / Service Informatique et Bio-informatique
Place Georges Teissier - CS 90074
29688 Roscoff Cedex

Analysis and Bioinformatics for Marine Science
   http://abims.sb-roscoff.fr/

christophe.ca...@sb-roscoff.fr

tél: +33 (0)2 98 29 25 43 / +33 (0)6 07 83 54 77






___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

 http://lists.bx.psu.edu/

Re: [galaxy-dev] Job handler : crash

2013-01-29 Thread Derrick Lin
Hi, I had the similar issue a while ago, and it's fixed in

https://bitbucket.org/galaxy/galaxy-central/commits/c015b82b3944f967e2c859d5552c00e3e38a2da0

Hope this help
D


On Wed, Jan 30, 2013 at 9:39 AM, Christophe Caron 
christophe.ca...@sb-roscoff.fr wrote:

 Hello,

 We run Galaxy (2013 January version) with load balancing mode (5 x web
 manager, 5 job handler) with Apache/Sun Grid Engine 6.0u4/CentOS 6.3

 - Since 2 weeks, some handler job process crash during the Galaxy startup
 with this error message in handlerx.log

 .
 Starting server in PID 13634.
 serving on http://127.0.0.1:8091
 galaxy.jobs.handler DEBUG 2013-01-29 20:06:48,902 Stopping job 22842:
 galaxy.jobs.handler DEBUG 2013-01-29 20:06:48,902 stopping job 22842 in
 drmaa runner


  - The system log files report a segfault with libdrmaa

 kernel: python[13977]: segfault at 0 ip 7f2811805dc5 sp
 7f27f4aac0a0 error 4 in libdrmaa.so.1.0[7f28116dd000+**185000]



 Thanks for your help !

 Christophe

 --

 Christophe Caron

 Station Biologique / Service Informatique et Bio-informatique
 Place Georges Teissier - CS 90074
 29688 Roscoff Cedex

 Analysis and Bioinformatics for Marine Science
http://abims.sb-roscoff.fr/

 christophe.ca...@sb-roscoff.fr

 tél: +33 (0)2 98 29 25 43 / +33 (0)6 07 83 54 77






 __**_
 Please keep all replies on the list by using reply all
 in your mail client.  To manage your subscriptions to this
 and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/