***  For details on how to be removed from this list visit the  ***
***          CCP4 home page http://www.ccp4.ac.uk         ***


Dear all,

maybe I'm missing the point completely, but aren't we comparing apples
and pears here? What we have in this type of argument is

1. "RMSD on bond lengths"

   I guess this is what we have in the PDB headers as

     RMS DEVIATIONS FROM IDEAL VALUES

   and what is defined in mmCIF as

     _refine_ls_restr.dev_ideal

     For the given parameter type, the root-mean-square deviation
     between the ideal values used as restraints in the least-squares
     refinement and the values obtained by refinement. For instance,
     bond distances may deviate by 0.018 A (r.m.s.) from ideal values
     in the current model.

   See:

     
http://mmcif.pdb.org/dictionaries/mmcif_std.dic/Items/_refine_ls_restr.dev_ideal.html

2. the Engh & Huber set of parameters

   These are given as a distribution (mean and sigma) for different
   types of bonds (Table 2 in paper):

     "Averages, standard deviations and standard errors were
     calculated, with and without elimination of values outside four
     standard deviations from the mean."

   See:

     http://journals.iucr.org/a/issues/1991/04/00/li0061/li0061.pdf

These two quantities are something completely different - at least as
far as I understand it. The rms deviation from the mean of a
distribution is only identical to the standard deviation of that
*same* distribution.

I don't understand why the RMSD on bond lengths should/can be compared
to the standard deviation of the Engh&Huber distributions to see if a
structure is well restrained?

If we would calculate the distribution of bond lengths within a
refined structure, we would also get a mean and a sigma. And this
could be compared to the Engh&Huber parameters, since it measures the
same thing.

However, there is some sense in arguing about well restrained
structures while looking at the RMSD on bond lengths:

  - high resolution => more observations than parameters => the data
    tells us what the bond-lengths should be => we can deviate from
    the Engh&Huber mean values => larger rmsd(bond)

  - low resolution => few observations => the data doesn't tell us
    what the bond-lengths should be => we can't really deviate from
    the Engh&Huber mean values (prior) => smaller rmsd(bond)

But what should be those values? I guess at 1A resolution I'd expect a
rmsd(bond) of 0.02 or larger. And at 3.5A I'd expect a rmsd(bond) of
0.005 or smaller. But these are personal choices (same as I/sig(I)
or Rmerge cut-offs in data-reduction).

But maybe I'm completely wrong here ... correct me please?

Cheers

Clemens


On Wed, Aug 23, 2006 at 09:50:04AM +0200, Fred. Vellieux wrote:
> ***  For details on how to be removed from this list visit the  ***
> ***          CCP4 home page http://www.ccp4.ac.uk         ***
> 
> 
> On Tue, 22 Aug 2006, Bernhard Rupp wrote:
> 
> > ***  For details on how to be removed from this list visit the  ***
> > ***          CCP4 home page http://www.ccp4.ac.uk         ***
> > 
> > 
> > > at the resolution you're working at the RMSD on bond 
> > > lengths should be ca. 0.012, not 0.02 A.
> > 
> > Interesting statement. I agree that for 
> > very well determined small molecule structures, the
> > overall bond length variation is about 0.014 A or so.
> > 
> > That fits quite well with what you say. It just means 
> > that the structure should be restrained to reflect
> > reality.
> > 
> > But: What does coordinate rmsd exactly have to do with 
> > resolution? If the 3.5 A structure has 0.012 A
> > rmsd, does that mean that a 1.2 A structure
> > should have 0.005? I have a feeling that would
> > be a serious case of over-restraining.
> > 
> > Should not all structures reflect the same 
> > 'real world' rmsd (plus minus some minor individuality) 
> > if the restraint weights are properly selected?
> > 
> > Maybe the small molecule/shelxl fellows may chime in here:
> > What is the expected rmsd for a high res protein 
> > structure vs. a low res? I had a feeling that
> > super low rmsd is partly abused as a sign of crystallographic
> > prowess, like building fantasy stuff into nonexistent density
> > just to have no missing residues &c &c &c... 
> > 
> > Thx, br
> 
> Hi Bernhard,
> 
> What I wrote means that unless you have superhigh resolution (in which
> case you can simply forget about library  restraining --- you can
> even count the number of electrons for one atom to assign its type) the
> model should reflect reality.
> 
> You state that reality is RMSD = 0.014, I thought it was 0.012 A. Small
> mistake from my part. Anyway the message was simply to state that
> targetting an RMSD of 0.02 meant that the geometry was not restrained
> tightly enough to fit what one expects at a resolution where unrestrained
> refinement cannot be done (who would attempt unrestrained refinement at
> 3.4 A refinement?).
> 
> Fred.
> 
> -- 
> 
> s-mail: F.M.D. Vellieux (B.Sc., Ph.D.)
>         Institut de Biologie Structurale J.-P. Ebel CEA CNRS UJF
>         41 rue Jules Horowitz
>         38027 Grenoble Cedex 01
>         France
> Tel:    (+33) (0) 438789605
> Fax:    (+33) (0) 438785494
> e-mail: [EMAIL PROTECTED]
> 

-- 

***************************************************************
* Clemens Vonrhein, Ph.D.     vonrhein AT GlobalPhasing DOT com
*
*  Global Phasing Ltd.
*  Sheraton House, Castle Park 
*  Cambridge CB3 0AX, UK
*--------------------------------------------------------------
* BUSTER Development Group      (http://www.globalphasing.com)
***************************************************************

Reply via email to