Allowing 2 errors, the first two letters of Rpattern3 match the last two letters of the subject.

Maybe you want max.Rmismatch=2, which is effectively c(rep(-1,11),2), or something else to prevent
unintended trimming.

On Oct 9, 2009, at 3:53 PM, joseph wrote:

Hello
In the example below, Rpattern2 (2 mismatches) trims the subject correctly, while Rpattern3 (3 mismatches) which is not supposed to do any trimming, it trims the last 2 nucleotides. Can you explain why?


subject <- DNAStringSet("GCTGGAACCCAGGGTGTTGTACCTGTAGGCACCA")
Rpattern2 <- "CTGaAGGCACgA" # 2 mismatches
Rpattern3 <- "tTGaAGGCACgA" # 3 mismatches
trimLRPatterns(Rpattern = Rpattern2, subject = subject, max.Rmismatch = rep(2, 12))
  A DNAStringSet instance of length 1
    width seq
[1]    22 GCTGGAACCCAGGGTGTTGTAC
trimLRPatterns(Rpattern = Rpattern3, subject = subject, max.Rmismatch = rep(2, 12))
  A DNAStringSet instance of length 1
    width seq
[1]    32 GCTGGAACCCAGGGTGTTGTACCTGTAGGCAC


sessionInfo()
R version 2.10.0 alpha (2009-10-01 r49909)
x86_64-apple-darwin9.8.0

locale:
[1] en_US.UTF-8/en_US.UTF-8/C/C/en_US.UTF-8/en_US.UTF-8

attached base packages:
[1] stats     graphics  grDevices utils     datasets  methods   base

other attached packages:
[1] rtracklayer_1.5.17 RCurl_1.2-0 bitops_1.0-4.1 ShortRead_1.3.36 [5] lattice_0.17-25 BSgenome_1.13.16 Biostrings_2.13.48 IRanges_1.3.87

loaded via a namespace (and not attached):
[1] Biobase_2.5.8 grid_2.10.0   hwriter_1.1   tools_2.10.0  XML_2.6-0




        [[alternative HTML version deleted]]

_______________________________________________
Bioc-sig-sequencing mailing list
[email protected]
https://stat.ethz.ch/mailman/listinfo/bioc-sig-sequencing

_______________________________________________
Bioc-sig-sequencing mailing list
[email protected]
https://stat.ethz.ch/mailman/listinfo/bioc-sig-sequencing

Reply via email to