Dear all,
I have a mapping file from Bowtie from a RNA-seq run against human genome. I
created RangedDataList to represent the mapping coordinates from different
chromosomes, strands and lanes. Now I would like to eliminate the RangedData
entries which have the same IRanges start and end, chromosome number and strand
orientation.
In the following example, entry 1, 3 and 4 have the same chromosome, strand,
start and end, and after the procedure, they should be reduced to one entry. Is
there a function I can use? Or is there some other better ways to represent the
mapping info which include chromosome, strand, star t and end, rather than
RangedData? Thanks.
-Kunbin
> head(sLane[["s_1"]][3])
RangedData with 6 rows and 2 value columns across 1 space
space ranges | strand index
<character> <IRanges> | <factor> <integer>
1 chr1 [223780005, 223780055] | + 6
2 chr1 [ 89018675, 89018725] | - 55
3 chr1 [223780005, 223780055] | + 68
4 chr1 [223780005, 223780055] | + 69
5 chr1 [107921032, 107921082] | - 75
6 chr1 [243086472, 243086522] | - 86
> class(sLane[["s_1"]][3])
[1] "RangedData"
attr(,"package")
[1] "IRanges"
> class(sLane[["s_1"]])
[1] "RangedData"
attr(,"package")
[1] "IRanges"
> class(sLane)
[1] "RangedDataList"
attr(,"package")
[1] "IRanges"
> sessionInfo()
R version 2.11.0 (2010-04-22)
x86_64-unknown-linux-gnu
locale:
[1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C
[3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8
[5] LC_MONETARY=C LC_MESSAGES=en_US.UTF-8
[7] LC_PAPER=en_US.UTF-8 LC_NAME=C
[9] LC_ADDRESS=C LC_TELEPHONE=C
[11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
attached base packages:
[1] stats graphics grDevices utils datasets methods base
other attached packages:
[1] ShortRead_1.6.2 Rsamtools_1.0.1 lattice_0.19-11
[4] Biostrings_2.16.7 GenomicRanges_1.0.1 IRanges_1.6.8
loaded via a namespace (and not attached):
[1] Biobase_2.8.0 grid_2.11.0 hwriter_1.2 tools_2.11.0
>
______________________________________________________________________
The contents of this electronic message, including any attachments, are
intended only for the use of the individual or entity to which they are
addressed and may contain confidential information. If you are not the intended
recipient, you are hereby notified that any use, dissemination, distribution,
or copying of this message or any attachment is strictly prohibited. If you
have received this transmission in error, please send an e-mail to
[email protected] and delete this message, along with any
attachments, from your computer.
[[alternative HTML version deleted]]
_______________________________________________
Bioc-sig-sequencing mailing list
[email protected]
https://stat.ethz.ch/mailman/listinfo/bioc-sig-sequencing