Hi Tobias --

Tobias Straub <[email protected]> writes:

> hi
>
> i would like to remove all non-unique sequences from a AlignedRead
> object. i thought that the uniqueFilter would help me to do so. in
> fact, the filter removes a considerable amount of reads, but when i
> call tables on the result object i still have lots of sequences
> occuring more than once.
> did i miss something?

The challenge is in defining what 'unique' is. From the help page
?uniqueFilter

     uniqueFilter(withSread=TRUE, .name="UniqueFilter")

and

withSread: A 'logical(1)' indicating whether uniqueness includes the
          read sequence ('withSread=TRUE') or is based only on
          chromosome, position, and strand ('withSread=FALSE').
     
so uniqueFilter by default looks for reads that are identical in terms
of the actual sequence, and are also identical in terms of chromosome,
position, and strand of alignment. 'tables' is based on just the
reads. If you wanted to make the reads unique, based only on sequence
identity, you could do something like

  aln[!srduplicated(aln)]

Martin

> thanks in advance
> Tobias
>
> ----------------------------------------------------------------------
> Tobias Straub   ++4989218075439   Adolf-Butenandt-Institute, München D
>
> _______________________________________________
> Bioc-sig-sequencing mailing list
> [email protected]
> https://stat.ethz.ch/mailman/listinfo/bioc-sig-sequencing

-- 
Martin Morgan
Computational Biology / Fred Hutchinson Cancer Research Center
1100 Fairview Ave. N.
PO Box 19024 Seattle, WA 98109

Location: Arnold Building M1 B861
Phone: (206) 667-2793

_______________________________________________
Bioc-sig-sequencing mailing list
[email protected]
https://stat.ethz.ch/mailman/listinfo/bioc-sig-sequencing

Reply via email to