Hello,

We run Galaxy (2013 January version) with load balancing mode (5 x web manager, 5 job handler) with Apache/Sun Grid Engine 6.0u4/CentOS 6.3


- Since 2 weeks, some handler job process crash during the Galaxy startup with this error message in handlerx.log

.....
Starting server in PID 13634.
serving on http://127.0.0.1:8091
galaxy.jobs.handler DEBUG 2013-01-29 20:06:48,902 Stopping job 22842:
galaxy.jobs.handler DEBUG 2013-01-29 20:06:48,902 stopping job 22842 in drmaa runner


 - The system log files report a segfault with libdrmaa

kernel: python[13977]: segfault at 0 ip 00007f2811805dc5 sp 00007f27f4aac0a0 error 4 in libdrmaa.so.1.0[7f28116dd000+185000]



Thanks for your help !

Christophe

--

Christophe Caron                        

Station Biologique / Service Informatique et Bio-informatique
Place Georges Teissier - CS 90074
29688 Roscoff Cedex

Analysis and Bioinformatics for Marine Science
   http://abims.sb-roscoff.fr/

christophe.ca...@sb-roscoff.fr

tél: +33 (0)2 98 29 25 43 / +33 (0)6 07 83 54 77






___________________________________________________________
Please keep all replies on the list by using "reply all"
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

 http://lists.bx.psu.edu/

Reply via email to