We are faced with some incomprehensible troubles with our Galaxy
instance (newly upgraded to 16.07, using SGE and PostgreSQL database).
Since two weeks, it started to suddenly give different kind of error
messages randomly, sometimes it gives "failure preparing job",
sometimes "The cluster DRM system terminated this job", sometimes it
finishes without error, even when relaunching the same wrapper with the
same input datasets.
In parallel, we have a dev instance for which we do not have these
troubles. The config files are substancially the same, except the
connection to the database which is obviously different.
We suspected an issue from PostgreSQL database. So we did some tests and
changed the connection with an empty postgresql databse and the troubles
seem to disappear.
Is there any scripts to check the integrity of the database? Any
recommendations to face this kind of troubles? It seems that there is
inconsistencies in the database that makes the system crash.
Thanks a lot for your help.
UMR IPME (IRD-UM2-CIRAD) Interactions Plantes Microorganismes Environnement
IRD - Institut de Recherche pour le Développement
911, avenue Agropolis
34394 MONTPELLIER CEDEX 5
tel IRD : + 33 (0)4 67 41 61 88
tel CIRAD : + 33 (0)4 67 61 57 21
Please keep all replies on the list by using "reply all"
in your mail client. To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
To search Galaxy mailing lists use the unified search at: