On Apr 14, 2010, at 11:19 PM, Bogdan Tanasa wrote:

Dear all,

please could you suggest any R functions or packages (or external programs),
that

a. take as input a large number (> 10 000) of short 20-30 nt sequences, and
do
sequence assembly, to reconstruct larger (extended) 30-50 sequences ?

b. take as input a larger number of sequences (100 000 - 1 mil) and cluster
these
sequences in distinct classes based on the sequence similarity  ?

Most of the discussion about genetics/omics applications occurs on the BioConductor mailing list. You should definitely seek it out, get the base installed and review their available online resources (before sending your next message to the correct mailing list.

http://www.bioconductor.org/docs

--

David Winsemius, MD
West Hartford, CT

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to