On Sat, Oct 18, 2008 at 8:20 AM, Edward d'Auvergne
<[EMAIL PROTECTED]> wrote:
> Hi,
>
> Before you sent this message, I was talking to Ben Frank (a PhD
> student in Griesinger's lab) about this exact problem - baseplane RMSD
> noise to volume error. The formula of Nicholson et al., 1992 you
> mentioned makes perfect sense as that's what we came up with too.
> Volume integration over a given area is the sum of the heights of all
> the discrete points in the frequency domain spectrum within that box.
> So the error of a single point is the same as that of the peak height.
> We just have n*m points within this box. And as variances add, not
> standard deviations, then the variance (sigma^2) of the volume is:
>
> sigma_vol^2 = sigma_i^2 * n * m,
>
> where sigma_vol is the standard deviation of the volume, sigma_i is
> the standard deviation of a single point assumed to be equal to the
> RMSD of the baseplane noise, and n and m are the dimensions of the
> box. Taking the square root of this gives the Nicholson et al.
> formula:
>
> sigma_vol = sigma_i * sqrt(n*m).
>
This is the strategy I have used to try and get precision estimates
from peak volumes. As I said earlier, in my hands it does not perform
well. Uncertainties from this method will systematically over-estimate
the precision of strong peaks and underestimate the precision of weak
ones as compared to estimates from duplicate spectra (or perhaps its
the other way around, I don't remember). This may not be evident for
proteins like ubiquitin, where virtually all amides give uniformly
strong peaks in the HSQC, but for proteins with more varied relaxation
behaviour, this can be a major issue. Its important to keep in mind
just how much signal processing goes on between a raw fid (in which
the noise in adjacent points is independent and uncorrelated) and the
spectrum that we integrate (in which, apparently, noise in adjacent
points is not always independent and uncorrelated).
Even apart from this issue, I have always found peak height to give
better results for fitting relaxation data. Heights would be expected
to be less sensitive to all sorts of experimental complications like
imperfect baselines, peak overlap, phase errors, etc. In my hands this
always seems to outweigh the greater precision afforded by peak
volumes.
> However I doubt if many people in the field use this volume
> integration method. I know that this is only one method of 3 in
> Sparky (http://www.cgl.ucsf.edu/home/sparky/manual/peaks.html#Integration).
> The program peakint
> (http://hugin.ethz.ch/wuthrich/software/xeasy/xeasy_m15.html) doesn't
> use all points in the box. And Cara does things differently again
> (http://www.cara.ethz.ch/Wiki/Integration). So even if the user has
> the baseplane RMSD measured, I have no idea if they can work out how
> many points in the spectra were use in the integration. And if a
> Gaussian or Lorentzian fit is used, then this formula definitely
> cannot be used. Maybe support for the different methods of the
> different programs can be added one by one to relax, or maybe ask the
> user which type of fit was used and how many points were integrated,
> but this is going to be difficult to do.
>
> Regards,
>
> Edward
>
>
> On Fri, Oct 17, 2008 at 6:15 PM, Sébastien Morin
> <[EMAIL PROTECTED]> wrote:
>> Hi all,
>>
>> Here is a reference some people in our lab have been using to extract
>> errors on NOE calculated from volumes.
>>
>> Nicholson, Kay, Baldisseri, Arango, Young, Bax, and Torchia (1992)
>> Biochemistry, 31: 5253-5263.
>>
>> And here is the interesting part...
>>
>> ==
>> To estimate the error in NOE values, the standard deviation in baseline
>> noise (a) was determined for each spectrum and expanded into a standard
>> deviation in volume given by
>>
>>Delta V = sigma sqrt(nk)
>>
>> where n is the number of points in the f1 dimension and k is the number
>> of points in the f2 dimension of a typical peak at the baseline. The
>> error associated with the NOE value for peak j is then given by
>>
>> error_j = (V_A_j / V_B_j) * sqrt[(Delta_V_A / V_A_j)^2 + (Delta_V_B /
>> V_B_j)^2]
>>
>> where V_A and V_B denote the volume of peak j in the presence and
>> absense of NOE enhancement, respectively, and Delta_V_A and Delta_V_B
>> denote the standard deviations in volume for the spectra recorded in the
>> presence and absense of NOE enhancement, respectively.
>> ==
>>
>> I'm not an expert in statistics, so won't judge the value of this
>> approach. However, this is an approach some people have been using...
>>
>> Ciao !
>>
>>
>> Séb
>>
>>
>>
>>
>> Edward d'Auvergne wrote:
>>> Oh, I forgot about the std error formula. Is where the sqrt(2) comes
>>> from? Doh, that would be retarded. Then I know someone who would
>>> require sqrt(3) for the NOE spectra! Is that really what Palmer
>>> meant, that std error is the same as "the standard deviation of the
>>> differences between t