Hi,

What I remember is that while the lastz had died the python wrapper was still hanging around for these 2 days. Maybe it's related to this builtin scheduler of the lastz wrapper.
Unfortunately I don't have time now to reproduce the error conditions.

regards, Andreas

On 26.11.2012 15:47, Bob Harris wrote:
Howdy, Andreas,

Lastz memory requirements are dependent on the size of the input sequences 
(mainly on the size of
reference sequence) and, to a lesser extent, the genome's repeat content.

I'm a little confused/concerned by how this failure was indicated.  When run 
from a console, if
lastz has a memory allocation failure, a message is written to stderr (e.g. 
"call to realloc failed
to allocate 1,000,000,000 bytes") and the program exits to the shell with the 
status EXIT_FAILURE
(an ISO C definition which I presume corresponds to a standard shell error 
code).

Usually, if lastz isn't going to have enough memory, an allocation failure will 
occur early in the
run as all the long term data structures are being built.  This would normally 
be within the first
five minutes.  A later failure would (probably) mean that the long term 
structures were close to the
memory limit and then alignments for one of the query sequences required enough 
additional memory to
push us over the limit.

I assume galaxy's "Job did not produce output" message must be based on a lack 
of any output to
stdout.  What is strange is that fact that it took 2 days to get this message.  
Lack of output to
stdout suggests that the failure occurred before any queries were processed 
(strictly speaking,
before any alignments were output).  This should have occurred in the first few 
minutes of the run.

Would it be possible for you to point me at the input sequences for this run, 
so that I can try
running this via the console, and see if there's something happening in lastz 
that I don't understand?

Bob H


On Nov 26, 2012, at 3:35 AM, Andreas Kuntzagk wrote:

Hi,

I just noticed that this was another case of a tool that needs more then the 
1GB memory that is
default on our cluster. After adjusting the job_runner settings everything 
seems fine.

regards, Andreas

On 15.11.2012 09:08, Andreas Kuntzagk wrote:
Hi,

when I start a lastz job it submits a job to GridEngine. But this job seems to 
hang.
The process running is

python /data/galaxy/galaxy-dist/tools/sr_mapping/lastz_wrapper.py --ref_source= 
...

This wrapper seems to not start a lastz but lastz is in the path.
After 2 days then Galaxy shows "Job did not produce output"

Here is the paster.log from the start of the job.


galaxy.jobs DEBUG 2012-11-15 09:03:07,539 (254) Working directory for job is:
/data/galaxy/galaxy-dist/database/job_working_directory/000/254
galaxy.jobs.handler DEBUG 2012-11-15 09:03:07,540 dispatching job 254 to drmaa 
runner
galaxy.jobs.handler INFO 2012-11-15 09:03:07,688 (254) Job dispatched
galaxy.tools DEBUG 2012-11-15 09:03:07,980 Building dependency shell command 
for dependency 'lastz'
galaxy.tools WARNING 2012-11-15 09:03:07,981 Failed to resolve dependency on 
'lastz', ignoring
galaxy.jobs.runners.drmaa DEBUG 2012-11-15 09:03:08,669 (254) submitting file
/data/galaxy/galaxy-dist/database/pbs/galaxy_254.sh
galaxy.jobs.runners.drmaa DEBUG 2012-11-15 09:03:08,670 (254) command is: python
/data/galaxy/galaxy-dist/tools/sr_mapping/lastz_wrapper.py       
--ref_source=cached
--source_select=pre_set       --out_format=sam
--input2=/data/galaxy/galaxy-dist/database/files/027/dataset_27795.dat
--input1="/data/galaxy/galaxy-dist/tool-data/shared/ucsc/hg18/seq/hg18.2bit"
--ref_sequences="None"          --pre_set_options=yasra98       --identity_min=0
--identity_max=100       --coverage=0
--output=/data/galaxy/galaxy-dist/database/job_working_directory/000/254/galaxy_dataset_27800.dat
    --unmask=yes       --lastzSeqsFileDir=/data/galaxy/galaxy-dist/tool-data; cd
/data/galaxy/galaxy-dist; /data/galaxy/galaxy-dist/set_metadata.sh 
./database/files
/data/galaxy/galaxy-dist/database/job_working_directory/000/254 .
/data/galaxy/galaxy-dist/universe_wsgi.ini 
/data/galaxy/galaxy-dist/database/tmp/tmpL4MMJV
/data/galaxy/galaxy-dist/database/job_working_directory/000/254/galaxy.json
/data/galaxy/galaxy-dist/database/job_working_directory/000/254/metadata_in_HistoryDatasetAssociation_288_9ynzEN,/data/galaxy/galaxy-dist/database/job_working_directory/000/254/metadata_kwds_HistoryDatasetAssociation_288_AEIseO,/data/galaxy/galaxy-dist/database/job_working_directory/000/254/metadata_out_HistoryDatasetAssociation_288_RhsufT,/data/galaxy/galaxy-dist/database/job_working_directory/000/254/metadata_results_HistoryDatasetAssociation_288_86DYsR,/data/galaxy/galaxy-dist/database/job_working_directory/000/254/galaxy_dataset_27800.dat,/data/galaxy/galaxy-dist/database/job_working_directory/000/254/metadata_override_HistoryDatasetAssociation_288_HMRNEG


galaxy.jobs.runners.drmaa DEBUG 2012-11-15 09:03:08,676 run as user 
['kuntzagk', '600']
galaxy.jobs DEBUG 2012-11-15 09:03:08,676 (254) Changing ownership of working 
directory with:
/usr/bin/sudo -E scripts/external_chown_script.py
/data/galaxy/galaxy-dist/database/job_working_directory/000/254 kuntzagk 600
galaxy.jobs.runners.drmaa DEBUG 2012-11-15 09:03:09,148 (254) Job script for 
external submission is:
/data/galaxy/galaxy-dist/database/pbs/254.jt_json
141.80.188.178 - - [15/Nov/2012:09:03:10 +0200] "POST 
/galaxy/root/history_item_updates HTTP/1.1"
200 - "http://bbc.mdc-berlin.de/galaxy/history"; "Mozilla/5.0 (X11; Ubuntu; 
Linux x86_64; rv:16.0)
Gecko/20100101 Firefox/16.0"
galaxy.jobs.runners.drmaa INFO 2012-11-15 09:03:11,553 (254) queued as 1282605
galaxy.jobs.runners.drmaa DEBUG 2012-11-15 09:03:11,767 (254/1282605) state 
change: job is running


--
Andreas Kuntzagk

SystemAdministrator

Berlin Institute for Medical Systems Biology at the
Max-Delbrueck-Center for Molecular Medicine
Robert-Roessle-Str. 10, 13125 Berlin, Germany

http://www.mdc-berlin.de/en/bimsb/BIMSB_groups/Dieterich
___________________________________________________________
Please keep all replies on the list by using "reply all"
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

http://lists.bx.psu.edu/


--
Andreas Kuntzagk

SystemAdministrator

Berlin Institute for Medical Systems Biology at the
Max-Delbrueck-Center for Molecular Medicine
Robert-Roessle-Str. 10, 13125 Berlin, Germany

http://www.mdc-berlin.de/en/bimsb/BIMSB_groups/Dieterich
___________________________________________________________
Please keep all replies on the list by using "reply all"
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

 http://lists.bx.psu.edu/

Reply via email to