Re: [EMBOSS] Problem with protein caracters

Peter Rice Sat, 11 Jul 2009 04:04:53 -0700

Radwen ANIBA wrote:

I'm trying to use some programs that comes with emboss package to analyze
some protein sequences but I have sometimes this message :


Error: ajSeqTypeCheckIn: Sequence must be protein sequence without BZ U X or
*: found bad character 'X'

Is there any manner to force the program considering these types of residues

EMBOSS uses the type attribute of the input sequence (or seqset orseqall) to identify the type of the input sequence (nucleotide, protein,or any) and the characters that are allowed (gaps, stops, non-standardresidies and ambiguity characters).

Your application is expecting "pureprotein". This is only used byapplications unable to handle the ambiguity codes (it can be difficultto define what an algorithm should do with them).


The alternative are:

protein - accepts all characters, converts stops to X
proteinstandard - converts U,O and J to 'X'
stopproteinstandard - converts stops, U, O, J to X

"protein" is probably what you want. You need to be able to do somethingwith the ambiguity codes X, B, Z and J and with the non-standard aminoacids U (selenocysteine) and O (pyrrolysine)


Hope this helps

Peter Rice
_______________________________________________
EMBOSS mailing list
[email protected]
http://lists.open-bio.org/mailman/listinfo/emboss

Re: [EMBOSS] Problem with protein caracters

Reply via email to