Re: [aroma.affymetrix] callAB gives Error in bootstrapTCNandDHByRegion.PairedPSCBS

Henrik Bengtsson Mon, 22 Apr 2013 18:01:03 -0700

Hi Ingrid.

[cleaning up by dropping everything but the
bootstrapTCNandDHByRegion() error; will reply to the other stuff
separately.]


On Sun, Apr 21, 2013 at 8:33 PM, Ingrid Lönnstedt
<ingrid.lonnst...@gmail.com> wrote:
>
> Hi Henrik,
> Thank you for your answers! You do a great job, indeed! I am now back to
> this. See a couple of remaining issues below.
> /Ingrid
>
>
>
> On Wednesday, 27 March 2013 18:16:07 UTC+11, Henrik Bengtsson wrote:
>>
>> On Tue, Mar 26, 2013 at 10:03 PM, Ingrid Lönnstedt
>> <ingrid.l...@gmail.com> wrote:
>> >
>> > Thank you Henrik!
>> > OK, I followed your instructions, and the same problem (the error in
>> > callAB()) remains, or rather, it now occurs already in sample one. Also,

[snip]

> So, my main problem that remains is insanity check failures in callAB(). I
> have followed my sample 1 into bootstrapTCNandDHByRegion.PairedPSCBS with
> debug() and concluded that I have 592 segments, and the first failure occurs
> with the 233rd segment. I have printed the CBS info for segments 220 to 233,
> in case that would help, as well as the current error message and session
> Info. If there is a failure with segment 233, is there a reason not to
> report the callAB  outcomes of the other segments? What does the sanity
> check failure tell me, and could I do any fix to get round it or at least
> get something for the other segments?
>
> Thanks,
> Ingrid
>
>
> Browse[3]> segs[220:233,]
>
>     chromosome tcnId dhId  tcnStart    tcnEnd tcnNbrOfLoci   tcnMean
> 220          9    11    1  73187382 125406672        38303 2.1977136
> 221          9    12    1 125406672 141091395         8913 2.1506696
> 222         NA    NA   NA        NA        NA            0        NA
> 223         10     1    1     72760  17932882        15166 2.0003566
> 224         10     2    1  17932882  17956182            2 4.5480683
> 225         10     3    1  17956182  18179544           18 1.9775509
> 226         10     4    1  18179544  18202780            3 0.6127961
> 227         10     5    1  18202780  24376470         4403 2.0286301
> 228         10     6    1  24376470  24378482           11 0.6265435
> 229         10     7    1  24378482  39076222        10124 2.0268394
> 230         10     8    1  39076223  42433539            0       NaN
> 231         10     9    1  42433540  86411230        28982 2.0375534
> 232         10    10    1  86411230  89029244         1697 2.0558821
> 233         10    11    1  89029244  89115851           21 2.5505440
>     tcnNbrOfSNPs tcnNbrOfHets dhNbrOfLoci   dhStart     dhEnd     dhMean
> 220        18928         5290        5290  73187382 125406672 0.07756713
> 221         4002         1101        1101 125406672 141091395 0.07943612
> 222           NA           NA           0        NA        NA         NA
> 223         8456         2087        2087     72760  17932882 0.07487577
> 224            0            0           0  17932882  17956182         NA
> 225            4            1           1  17956182  18179544 0.04524338
> 226            0            0           0  18179544  18202780         NA
> 227         2212          543         543  18202780  24376470 0.07859021
> 228            0            0           0  24376470  24378482         NA
> 229         4971         1086        1086  24378482  39076222 0.07712847
> 230            0            0           0  39076223  42433539         NA
> 231        13983         3576        3576  42433540  86411230 0.06990635
> 232          858          192         192  86411230  89029244 0.06439847
> 233            4            1           1  89029244  89115851 0.36474615
>        c1Mean   c2Mean
> 220 1.0136216 1.184092
> 221 0.9899144 1.160755
> 222        NA       NA
> 223 0.9252892 1.075067
> 224        NA       NA
> 225 0.9440399 1.033511
> 226        NA       NA
> 227 0.9345998 1.094030
> 228        NA       NA
> 229 0.9352562 1.091583
> 230        NA      NaN
> 231 0.9475577 1.089996
> 232 0.9617432 1.094139
> 233 0.8101214 1.740423
>
> deltaCN <- estimateDeltaCN(fit, scale=1) #0.2601609 for my sample
> fit = callAB(fit, delta = deltaCN, verbose = -10)
> List of 7
>  $ nbrOfTCNs      : int 21
>  $ tcnNbrOfLoci   : int 21
>  $ mu             : num 2.58
>  $ tcnMean        : num 2.55
>  $ dMu            : num 0.0248
>  $ abs(dMu)       : num 0.0248
>  $ range(x[units]): num [1:2] 89029244 89114401
>
> Error in bootstrapTCNandDHByRegion.PairedPSCBS(fit, statsFcn = statsFcn,
> :
>   INTERNAL ERROR: Incorrect TCN mean!

This is an internal error that the *implementation* of the PSCBS
algorithm is correct and handles all "edge cases"; apparently there is
still another case that was not considered.  These "edge cases" are
for instance when two SNPs share the exact same location but the
internal CBS segmentation method wants to put a change point in
between them because that's what the data happens to say.  In other
words, it's nothing you're doing wrong and probably nothing you should
try to work around either. (I'll think about this a bit more since it
happens once in a while and it would be nice to be able to keep going.
 On the other hand, if it would be just a warning you would have never
told me and the error would never be fixed.)

First, update to the latest PSCBS v0.34.8 (on CRAN since yday) and
verify that the error remains.  If it does, could you save the
segmentation result;

saveObject(fit, "problem.Rdata")

and make 'problem.Rdata' available to me for download somewhere?  Then
I can troubleshoot.

/Henrik

>
>
> > sessionInfo()
> R version 2.15.2 Patched (2013-02-09 r61879)
> Platform: x86_64-unknown-linux-gnu (64-bit)
>
> locale:
>  [1] LC_CTYPE=en_US.UTF-8       LC_NUMERIC=C
>  [3] LC_TIME=en_US.UTF-8        LC_COLLATE=en_US.UTF-8
>  [5] LC_MONETARY=en_US.UTF-8    LC_MESSAGES=en_US.UTF-8
>  [7] LC_PAPER=C                 LC_NAME=C
>  [9] LC_ADDRESS=C               LC_TELEPHONE=C
> [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
>
> attached base packages:
> [1] stats     graphics  grDevices utils     datasets  methods   base
>
> other attached packages:
>  [1] DNAcopy_1.32.0         PSCBS_0.34.1           aroma.affymetrix_2.8.0
>  [4] affxparser_1.30.2      aroma.apd_0.2.3        R.huge_0.4.1
>  [7] aroma.light_1.28.0     aroma.core_2.8.0       matrixStats_0.7.0
> [10] R.rsp_0.8.2            R.devices_2.1.3        R.cache_0.6.5
> [13] R.filesets_2.0.0       R.utils_1.23.2         R.oo_1.13.0
> [16] R.methodsS3_1.4.2
>
> loaded via a namespace (and not attached):
> [1] calmate_0.10.0 digest_0.6.3   MASS_7.3-23
>
>
>>
>> > All the best,
>> > Ingrid
>> >
>> >
>> > On Saturday, 23 March 2013 05:03:50 UTC+11, Henrik Bengtsson wrote:
>> >>
>> >> Hi Ingrid.
>> >>
>> >> On Wed, Mar 20, 2013 at 4:45 PM, Ingrid Lönnstedt
>> >> <ingrid.l...@gmail.com> wrote:
>> >> >
>> >> > Hi!
>> >> >
>> >> > I have segmented 56 SNP 6 tumour samples with
>> >> > segmentByNonPairedPSCBS(),
>> >> > and like to call allelic balance segments with callAB() as required,
>> >> > I
>> >> > understand, before I can call neutral TCNs with callNTCN().
>> >>

[snip]

>> >>
>> >> source("http://aroma-project.org/hbLite.R";);
>> >> hbInstall("PSCBS");
>> >>
>> >>
>> >>
>> >> > For the first
>> >> > tumour sample, both callAB() and callNTCN() seem to work fine, but
>> >> > then on
>> >> > the second sample I get errors with callAB(), see below.
>> >>
>> >> See comment after the error.
>> >>
>> >> > The plotTracks()
>> >> > image looks pretty much the same between sample one and two, and I
>> >> > have no
>> >> > reason to believe that sample two has a major problem in the
>> >> > segmentation. 4
>> >> > of the 56 samples get very noisy segmentations, with lots of short
>> >> > regions
>> >> > scattered (has anybody seen that before?)

[snip]

>> >> >   betaT = dataN[,'fracB',samp]
>> >> >   ugp = getAromaUgpFile(dsNList$total)
>> >> >   chromosome = ugp[,1,drop = T]
>> >> >   x = ugp[, 2, drop = T]
>> >> >   df = data.frame(chromosome = chromosome, x=x, CT=CT, betaT =
>> >> > betaT)
>> >> >   df = dropSegmentationOutliers(df)
>> >> >         #This gives warnings:
>> >> >         #24: In DNAcopy::CNA(genomdat = yKK, chrom = chromosomeKK,
>> >> > ... :
>> >> >         #array has repeated maploc positions
>> >> >   gaps = findLargeGaps(df, minLength = 1e+06)
>> >> >   knownSegments = gapsToSegments(gaps)
>> >> >
>> >> >   fit = segmentByNonPairedPSCBS(df, knownSegments = knownSegments,
>> >> > avgDH="median",
>> >> >                                                      tauA = .33,
>> >> > seed =
>> >> > 48879, verbose = -10)
>> >> >       #This gives warnings that there are multipe maploc positions
>> >> >   fit = callAB(fit, verbose = -10)
>> >> >
>> >> > Calling segments of allelic balance from one-sided DH bootstrap
>> >> > confidence
>> >> > intervals...
>> >> >  delta (offset adjusting for bias in DH): 0.211106073856354
>> >> >  alpha (CI quantile; significance level): 0.05
>> >> > List of 7
>> >> >  $ nbrOfDHs       : int 70
>> >> >  $ dhNbrOfLoci    : int 70
>> >> >  $ mu             : num 0.183
>> >> >  $ dhMean         : num 0.203
>> >> >  $ dMu            : num -0.02
>> >> >  $ abs(dMu)       : num 0.02
>> >> >  $ range(x[units]): num [1:2] 990417 2792801
>> >> > Error in bootstrapTCNandDHByRegion.PairedPSCBS(fit, statsFcn =
>> >> > statsFcn,
>> >> > :
>> >> >   INTERNAL ERROR: Incorrect DH mean!
>> >> >  Calling segments of allelic balance from one-sided DH bootstrap
>> >> > confidence intervals...done
>> >>
>> >> This is a internal sanity check throwing an error.   As a start, I
>> >> recommend to retry with (i) a proper CT calculate, and (ii) after
>> >> updated PSCBS.
>> >>
>> >> Hope this helps
>> >>
>> >> Henrik
>> >>
>> >> >
>> >> >
>> >> > >
>> >> > > head(getSegments(fit))
>> >> >   chromosome tcnId dhId   tcnStart     tcnEnd tcnNbrOfLoci   tcnMean
>> >> > tcnNbrOfSNPs tcnNbrOfHets dhNbrOfLoci    dhStart      dhEnd
>> >> > 1          1     1    1    61736.0   811403.5           73 1.4735545
>> >> > 15            2           2    61736.0   811403.5
>> >> > 2          1     2    1   811403.5  2821394.0          527 0.9201369
>> >> > 183           70          70   811403.5  2821394.0
>> >> > 3          1     3    1  2821394.0  2827143.5            2 5.0273015
>> >> > 2            0           0  2821394.0  2827143.5
>> >> > 4          1     4    1  2827143.5 11694138.0         5531 0.9780667
>> >> > 2940          983         983  2827143.5 11694138.0
>> >> > 5          1     5    1 11694138.0 11709225.0            5 3.3949294
>> >> > 5            3           3 11694138.0 11709225.0
>> >> > 6          1     6    1 11709225.0 16923050.0         2918 1.0072282
>> >> > 1485          550         550 11709225.0 16923050.0
>> >> >      dhMean    c1Mean    c2Mean
>> >> > 1 0.1695309 0.6118708 0.8616838
>> >> > 2 0.2033105 0.3665317 0.5536052
>> >> > 3        NA        NA        NA
>> >> > 4 0.2033807 0.3895734 0.5884933
>> >> > 5 0.1465555 1.4486918 1.9462376
>> >> > 6 0.2036166 0.4010699 0.6061583
>> >> >
>> >> >
>> >> > > traceback()
>> >> > 7: stop("INTERNAL ERROR: Incorrect DH mean!")
>> >> > 6: bootstrapTCNandDHByRegion.PairedPSCBS(fit, statsFcn = statsFcn,
>> >> >        ..., verbose = less(verbose, 50))
>> >> > 5: bootstrapTCNandDHByRegion(fit, statsFcn = statsFcn, ..., verbose
>> >> > =
>> >> > less(verbose,
>> >> >        50))
>> >> > 4: callAllelicBalanceByDH.PairedPSCBS(fit, ...)
>> >> > 3: callAllelicBalanceByDH(fit, ...)
>> >> > 2: callAB.PairedPSCBS(fit, verbose = -10)
>> >> > 1: callAB(fit, verbose = -10)
>> >> >
>> >> >
>> >> >
>> >> >
>> >> > > sessionInfo()
>> >> >
>> >> > R version 2.15.2 Patched (2013-02-09 r61879)
>> >> > Platform: x86_64-unknown-linux-gnu (64-bit)
>> >> >
>> >> > locale:
>> >> >  [1] LC_CTYPE=en_US.UTF-8       LC_NUMERIC=C
>> >> > LC_TIME=en_US.UTF-8        LC_COLLATE=en_US.UTF-8
>> >> >  [5] LC_MONETARY=en_US.UTF-8    LC_MESSAGES=en_US.UTF-8
>> >> > LC_PAPER=C
>> >> > LC_NAME=C
>> >> >  [9] LC_ADDRESS=C               LC_TELEPHONE=C
>> >> > LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
>> >> >
>> >> > attached base packages:
>> >> > [1] splines   stats     graphics  grDevices utils     datasets
>> >> > methods
>> >> > base
>> >> >
>> >> > other attached packages:
>> >> >  [1] Hmisc_3.10-1           survival_2.37-4        DNAcopy_1.32.0
>> >> > PSCBS_0.30.0           aroma.affymetrix_2.8.0
>> >> >  [6] affxparser_1.30.2      aroma.apd_0.2.3        R.huge_0.4.1
>> >> > aroma.light_1.28.0     aroma.core_2.8.0
>> >> > [11] matrixStats_0.6.2      R.rsp_0.8.2            R.devices_2.1.3
>> >> > R.cache_0.6.5          R.filesets_2.0.0
>> >> > [16] crlmm_1.16.9           preprocessCore_1.20.0
>> >> > oligoClasses_1.20.0
>> >> > BiocGenerics_0.4.0     R.utils_1.19.5
>> >> > [21] R.oo_1.11.7            R.methodsS3_1.4.2
>> >> >
>> >> > loaded via a namespace (and not attached):
>> >> >  [1] affyio_1.26.0        annotate_1.36.0      AnnotationDbi_1.20.5
>> >> > Biobase_2.18.0       BiocInstaller_1.8.3  Biostrings_2.26.3
>> >> >  [7] bit_1.1-9            cluster_1.14.3       codetools_0.2-8
>> >> > DBI_0.2-5            digest_0.6.3         ellipse_0.3-7
>> >> > [13] ff_2.2-10            foreach_1.4.0        genefilter_1.40.0
>> >> > GenomicRanges_1.10.7 grid_2.15.2          IRanges_1.16.6
>> >> > [19] iterators_1.0.6      lattice_0.20-13      Matrix_1.0-11
>> >> > mvtnorm_0.9-9994     parallel_2.15.2      RcppEigen_0.3.1.2.1
>> >> > [25] RSQLite_0.11.2       stats4_2.15.2        tools_2.15.2
>> >> > XML_3.95-0.1         xtable_1.7-1         zlibbioc_1.4.0
>> >> >
>> >> >
>> >> > >
>> >> >
>> >> >
>> >> >
>> >> > --
>> >> > --
>> >> > When reporting problems on aroma.affymetrix, make sure 1) to run the
>> >> > latest version of the package, 2) to report the output of
>> >> > sessionInfo() and
>> >> > traceback(), and 3) to post a complete code example.
>> >> >
>> >> >
>> >> > You received this message because you are subscribed to the Google
>> >> > Groups
>> >> > "aroma.affymetrix" group with website http://www.aroma-project.org/.
>> >> > To post to this group, send email to aroma-af...@googlegroups.com
>> >> > To unsubscribe and other options, go to
>> >> > http://www.aroma-project.org/forum/
>> >> >
>> >> > ---
>> >> > You received this message because you are subscribed to the Google
>> >> > Groups
>> >> > "aroma.affymetrix" group.
>> >> > To unsubscribe from this group and stop receiving emails from it,
>> >> > send an
>> >> > email to aroma-affymetr...@googlegroups.com.
>> >> > For more options, visit https://groups.google.com/groups/opt_out.
>> >> >
>> >> >
>> >
>> > --
>> > --
>> > When reporting problems on aroma.affymetrix, make sure 1) to run the
>> > latest version of the package, 2) to report the output of sessionInfo() and
>> > traceback(), and 3) to post a complete code example.
>> >
>> >
>> > You received this message because you are subscribed to the Google
>> > Groups "aroma.affymetrix" group with website http://www.aroma-project.org/.
>> > To post to this group, send email to aroma-af...@googlegroups.com
>> > To unsubscribe and other options, go to
>> > http://www.aroma-project.org/forum/
>> >
>> > ---
>> > You received this message because you are subscribed to the Google
>> > Groups "aroma.affymetrix" group.
>> > To unsubscribe from this group and stop receiving emails from it, send
>> > an email to aroma-affymetr...@googlegroups.com.
>> > For more options, visit https://groups.google.com/groups/opt_out.
>> >
>> >

-- 
-- 
When reporting problems on aroma.affymetrix, make sure 1) to run the latest 
version of the package, 2) to report the output of sessionInfo() and 
traceback(), and 3) to post a complete code example.


You received this message because you are subscribed to the Google Groups 
"aroma.affymetrix" group with website http://www.aroma-project.org/.
To post to this group, send email to aroma-affymetrix@googlegroups.com
To unsubscribe and other options, go to http://www.aroma-project.org/forum/

--- 
You received this message because you are subscribed to the Google Groups 
"aroma.affymetrix" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to aroma-affymetrix+unsubscr...@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

Re: [aroma.affymetrix] callAB gives Error in bootstrapTCNandDHByRegion.PairedPSCBS

Reply via email to