On Jan 4 2006, Alex Golubev wrote:

Hi,

I'm trying to align amino acids to nucleic acids. I'm using gapped sequences both for the protein and for the DNA. I have several problems and I would very appreciate if someone could help. 1. How can I parse DNA nucleic acids and get codons. I would like to start with DNA that look like this "ATGTAT" and get a protein that look like this "MY". I'm using "Alphabet alpha = DNATools.getCodonAlphabet();" but I can't find tokenization to parse the DNA string (does this make any sense?).

You can convert a SymbolList in the DNA alphabet into the equivalent symbol list in the codon alphabet (DNAxDNAxDNA) by using SymbolListViews.orderNSymbolList(...).



2. My other problem is that there are frame shifts and my gapped DNA look actually like this "AT-G-TAT". Is there any way to get/translate locations from the codon symbols list to/from the DNA symbols list?

Ouch.  What do you really want to do here?

Regards,
David Huen
_______________________________________________
Biojava-l mailing list  -  Biojava-l@biojava.org
http://biojava.org/mailman/listinfo/biojava-l

Reply via email to