Re: [galaxy-user] Using BWA to map without any mismathces

Jennifer Jackson Thu, 21 Mar 2013 12:41:53 -0700

Hi Daniel,

Thanks for reporting this issue - we have done some testing here and canduplicate your results.

We are still reviewing options about how to address this, includingGalaxy wrapper modifications and possibly more. The issue has to do withhow certain variables are passed in the wrapper and interpreted in thecommand string. If you are interested, write me back to let me know andI can send you a link to the development ticket you can track (oncecreated).

Back to how to do this sort of analysis - yes, "aln -n" is a type ofmismatch parameter. And ideally setting this would result in amismatch-free alignment. Instead it returns no results:


Maximum edit distance (aln -n):  "0"
and
Fraction of missing alignments given 2% uniform base error rate (aln -n): "0"

As a work-around, we have found that making the the Fraction variablevery small achieves a close approximation of mismatch-free alignments onour test sets. This is by no means a guarantee, but pending futurechanges, these are the recommended form settings:


Maximum edit distance (aln -n):  "0"
and
Fraction of missing alignments given 2% uniform base error rate (aln -n): 
"0.00001"

Thank you for your patience while we worked out exactly what was goingon. Hopefully the temporary work-around will allow you to continue withyour research,


Jen
Galaxy team

On 3/2/13 11:44 AM, Daniel Sher wrote:

Hello,
We have a sample containing several bacterial species and we want touniquely map RNA-seq reads to the genomes of each of our organisms toget the expression patterns of each organism separately. We tried touse BWA in Galaxy with the "edit distance" (aln -n in the command lineversion) set to 0 but none of the reads were mapped (all had the SAMtag set to "4'). This is an artifact since running BLAST with some ofthe sequences showed that they have 100% identity to one of ourgenomes and not any others, so they should map uniquely.
When running BWA with the number of mismatches set to between 1-5 >90%of our reads were mapped, and the number of mapped reads increasedwith the mismatch number so that seems to be working OK.
Does the "aln -n" option really determine the number of mismatches?Any ideas why BWA will not run well in Galaxy using --n=0?
Thanks
Daniel

--



___________________________________________________________
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using "reply all" in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

   http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

   http://lists.bx.psu.edu/


--
Jennifer Hillman-Jackson
Galaxy Support and Training
http://galaxyproject.org

___________________________________________________________
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using "reply all" in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/

Re: [galaxy-user] Using BWA to map without any mismathces

Reply via email to