Recently I tried to upload some annotation/reference files into Galaxy.  After 
couple failed tries using the upload method, I was able to do it via the using 
the full file path.  However, after these tries/retries, I believe Galaxy 
database got confused and was corrupted.  Existing workflows with existing 
reference were used when datasets that previously were processed without any 
problems, now have funky result set names and parameters to tool wrappers 
reference funky paths.  

Here’s an example of the error
I’m encountering.  The highlighted part references the dbSNP file:
 
python
/home/svcbioinfo/galaxy/galaxy-reproduce/tools/gatkV2/gatk_wrapper.py
--max_jvm_heap "16g" --max_jvm_heap_fraction "1" --stdout
"/mnt/ngs/analysis/svcbioinfo/DATA_reproduce/files/024/dataset_24218.dat"
-d "-I"
"/mnt/ngs/analysis/svcbioinfo/DATA_reproduce/files/024/dataset_24216.dat"
"bam" "gatk_input_first" -d "" 
"/mnt/ngs/analysis/svcbioinfo/DATA_reproduce/files/_metadata_files/001/metadata_1491.dat"
"bam_index" "gatk_input_first" -p 'java -jar
"/home/svcbioinfo/galaxy/galaxy-reproduce/tool-data/shared/jars/gatk/GenomeAnalysisTKv2.jar"
-T "RealignerTargetCreator" --num_threads 4 -o 
"/mnt/ngs/analysis/svcbioinfo/DATA_reproduce/files/024/dataset_24217.dat"
-et "NO_ET" -K
"/home/svcbioinfo/galaxy/galaxy-reproduce/tool-data/shared/jars/genomichealth.com.key"
-R
"/mnt/ngs/analysis/refData/processed/bwa/hg19_decoy_rcrs8001sp.fasta"
' -d "-known:dbsnp,%(file_type)s" 
"/mnt/ngs/analysis/tchu/Workflow_RWC/SEQUENCER99/140201_SNTEST_0021_TESTFC21/analysis_31/primary/fastq/99_21_3_ACAGTG_R2.fastq"
"fastq" "input_dbsnp_0" -d
"-known:indels,%(file_type)s"
"/mnt/ngs/analysis/svcbioinfo/DATA_reproduce/files/016/dataset_16596.dat"
"vcf" "input_indels_0" -p '--pedigreeValidationType
"STRICT"' -d "--intervals"
"/mnt/ngs/analysis/svcbioinfo/DATA_reproduce/files/022/dataset_22821.dat"
"bed" "input_intervals_0" -p '--interval_padding
"400"' -p '--interval_set_rule "UNION"' -p
'--downsampling_type "NONE"' -p ' --validation_strictness
"SILENT" --interval_merging "ALL" ' -p ' --unsafe
"LENIENT_VCF_PROCESSING"
 
The highlighted part should be
the dbSNP file which typically has a Galaxy based name and type of “vcf”
 such as below:
 
python
/home/svcbioinfo/galaxy/galaxy-reproduce/tools/gatkV2/gatk_wrapper.py
--max_jvm_heap "16g" --max_jvm_heap_fraction "1" --stdout
"/mnt/ngs/analysis/svcbioinfo/DATA_reproduce/files/024/dataset_24314.dat"
-d "-I" 
"/mnt/ngs/analysis/svcbioinfo/DATA_reproduce/files/024/dataset_24312.dat"
"bam" "gatk_input_first" -d ""
"/mnt/ngs/analysis/svcbioinfo/DATA_reproduce/files/_metadata_files/001/metadata_1492.dat"
"bam_index" "gatk_input_first" -p 'java -jar
"/home/svcbioinfo/galaxy/galaxy-reproduce/tool-data/shared/jars/gatk/GenomeAnalysisTKv2.jar"
-T "RealignerTargetCreator" --num_threads 4 -o
"/mnt/ngs/analysis/svcbioinfo/DATA_reproduce/files/024/dataset_24313.dat"
-et "NO_ET" -K
"/home/svcbioinfo/galaxy/galaxy-reproduce/tool-data/shared/jars/genomichealth.com.key"
-R "/mnt/ngs/analysis/refData/processed/bwa/hg19_decoy_rcrs.fasta" '
-d "-known:dbsnp,%(file_type)s" 
"/mnt/ngs/analysis/svcbioinfo/DATA_reproduce/files/016/dataset_16598.dat"
"vcf" "input_dbsnp_0" -d
"-known:indels,%(file_type)s"
"/mnt/ngs/analysis/svcbioinfo/DATA_reproduce/files/016/dataset_16596.dat"
"vcf" "input_indels_0" -p '--pedigreeValidationType
"STRICT"' -d "--intervals"
"/mnt/ngs/analysis/svcbioinfo/DATA_reproduce/files/016/dataset_16617.dat"
"bed" "input_intervals_0" -p '--interval_padding
"400"' -p '--interval_set_rule "UNION"' -p
'--downsampling_type "NONE"' -p ' --validation_strictness
"SILENT" --interval_merging "ALL" ' -p ' --unsafe
"LENIENT_VCF_PROCESSING"
 
I wonder if there's a way to recover from this or I'll have to reinstall the 
database.  Thanks for your help.


Thuy-Linh Chu
___________________________________________________________
Please keep all replies on the list by using "reply all"
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/

Reply via email to