Folks,
I have a question about MISO output and you folks are experts.
I have a dataset analyzed and now I am vetting the respective output.
For a candidate gene with three transcripts, I see from the *bf file:
Sample1_counts
S1_assigned_counts
Sample1_counts
S2_assigned_counts
(0,0,0):3117,(0,0,1):19,(0,1,0):543,(1,0,0):2,(1,0,1):303,(1,1,1):135
0:261,1:639,2:102
(0,0,0):3049,(0,0,1):21,(0,1,0):640,(1,0,0):2,(1,0,1):241,(1,1,1):127
0:195,1:734,2:102
The assigned_counts per transcript seem to make sense
However, for the assigned counts, the largest numbers are seen for the
“(0,0,0): 3117” and “(0,0,0): 3049” cases.
From the on-line manual
“The field (0,0):N, if outputted, indicates the number of reads N that mapped
to the region but were thrown out (e.g. because they are not consistent with
the isoforms in the annotation, violate overhang constraints, or other related
reasons.)”
1) Have you seen this before?
2) How can I evaluate whether or not that result is correct? That is, am I
really throwing out 3K reads per the manual? Remember this is for a single
event/Gene. If so I am losing a lot of data.
3) How do you handle replicates? Miso works on pairs.
Cheers,
Starr
-------------------------------------------------------------------------
This message was secured via TLS by MUSC.
_______________________________________________
miso-users mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/miso-users