[R] Removing 99% similar sequence help

Nick Jeffery Wed, 04 Feb 2015 12:12:22 -0800

Dear R users,

I am having trouble finding a package and function to remove DNA sequences
from a fasta file that are >99% similar and/or create an output of the
remaining "unique" sequences. I found the uniquefasta function in phytools
but R can't find this function and also doesn't allow me to set the 99%
parameter. This is because I'm building a phylogeny with tons of nearly
identical sequences so I want to reduce the number of individuals.



Thanks for any help and suggestions,
Nick

-- 
Nick Jeffery, PhD Candidate
Integrative Biology
SCIE 1453
University of Guelph
Guelph, Ontario, Canada

        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] Removing 99% similar sequence help

Reply via email to