I just skimmed through the documentation for genomeIntervals and IRanges. They both seem to implement the first step of a comparison --- basic set operations like intersection, union, "overlap," etc. --- but it doesn't look like they have tests of significance. Are there packages that implement permutation tests for sets of genome intervals? A simple permutation test could be done by selecting random sets of intervals "matching" the query intervals and counting the number of overlaps with the reference intervals. Each random set of intervals could be picked so that the number and size of the intervals was the same as the query. A general implementation of the method would need to know the length of each chromosome. The implementation should also give the user the ability to specify excluded regions of the genome. For example, if the coordinates are derived from aligning reads to the genome, intervals intersecting sequence gaps would not be admissible.

Of course, if the null hypothesis for this permutation test (the sets intervals are not related) is rejected, then you have to think about the next questions: To what degree are the set related? Where do they differ and where are they the same?


Ivan Gregoretti wrote:
Hello Steve, Nicolas and Michael,

I agree with all of you: it is not a trivial question.

I asked the bioc-sig-seq listers because I thought, --Hey, this must
be the everyday's question of the genome analyst.

Say you ran your chipseq under condition A and then you ran it under
condition B. Then you have to decide whether A and B made any
difference. It doesn't get any simpler than that!

I can't compare the two means or the two dispersions. I have to
compare pairs. The problem is that it is not trivial to unambiguously
determine which spot in B must be paired with each spot in A. To start
with, A and B may have different numbers of loci (ie 15000 versus
18000).

I'll take a look at genomeIntervals and IRanges.

_______________________________________________
Bioc-sig-sequencing mailing list
[email protected]
https://stat.ethz.ch/mailman/listinfo/bioc-sig-sequencing

Reply via email to