We have a sample containing several bacterial species and we want to uniquely
map RNA-seq reads to the genomes of each of our organisms to get the expression
patterns of each organism separately. We tried to use BWA in Galaxy with the
“edit distance” (aln -n in the command line version) set to 0 but none of the
reads were mapped (all had the SAM tag set to “4’). This is an artifact since
running BLAST with some of the sequences showed that they have 100% identity to
one of our genomes and not any others, so they should map uniquely.
When running BWA with the number of mismatches set to between 1-5 >90% of our
reads were mapped, and the number of mapped reads increased with the mismatch
number so that seems to be working OK.
Does the "aln -n" option really determine the number of mismatches? Any ideas
why BWA will not run well in Galaxy using –n=0?
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org. Please keep all replies on the list by
using "reply all" in your mail client. For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:
To manage your subscriptions to this and other Galaxy lists,
please use the interface at: