Re: [ccp4bb] an over refined structure

Phil Jeffrey Thu, 07 Feb 2008 12:11:26 -0800

Here I will disagree. R-free rewards you for putting in atom in densitywhich an atom belongs in. It doesn't necessarily reward you for puttingthe *right* atom in that density, but it does become difficult to dothat under normal circumstances unless you have approximately the rightstructure.

However in the case of multi-copy refinement at low resolution, therefinement is perfectly capable of shoving any old atom in densitycorresponding to any other old atom if you give it enough leeway.Remember that there's a big difference between R-free for a single copy(45%) and a 16-fold multicopy (38%) in MsbA's P1 form, and almost thesame amount (41% vs 33%) with MsbA's P21 form. (These are E.coli andV.cholerae respectively). Both single copy and multicopy refinementswere NCS-restrained, as far as I know.

So there's evidence, w/o simulation, that the 12-fold or 16-foldmulticopy refinements are worth 7-8% in R-free, and I'm doubtful thatNCS can generate that sort of gain in either crystal form. I'vecertainly never seen that in my own experience at low resolution.

I've been meaning to put online the Powerpoint from the CCP4 talk withall these numbers in it, but I regret it's sitting on my iBook at homeas of writing.


Phil Jeffrey

Dean Madden wrote:

It is true that multicopy refinement was essential for the suppressionof Rwork. However, the whole point of the Rfree is that it is supposedto be independent of the number of parameters you're refining. Simplythrowing multiple copies of the model into the refinement shouldn't haveaffected Rfree, IF IT WERE TRULY "FREE".
It was almost certainly NCS-mediated spillover that allowed themulticopy, parameter-driven reduction in Rwork to pull down the Rfreevalues as well. The experiment is probably not worth the time it wouldtake to do, but I suspect that if MsbA and EmrE test sets had beenchosen in thin shells, then Rfree wouldn't have shown nearly the"improvement" it did.
Dean


Phil Jeffrey wrote:
While NCS probably played a role in the first crystal form of MsbA(P1, 8 monomers), this is also the one that showed the greatestimprovement in R-free once the structure was correctly redetermined(7% or 14% depending on which refinement protocols you compare).
The other crystal form of MsbA and the crystal forms of EmrE didn'thave particularly high-copy NCS (2 dimers, 4 monomers, dimer, 2tetramers) and the R-frees were somewhat comparable in all cases(31-36% for the redetermined structures).
The *major* source of the R-free suppression in all these cases withthe inappropriate use of multi-copy refinement at low resolution.
Phil Jeffrey
Princeton


Dean Madden wrote:
Hi Dirk,
I disagree with your final sentence. Even if you don't apply NCSrestraints/constraints during refinement, there is a serious risk ofNCS "contaminating" your Rfree. Consider the limiting case in whichthe "NCS" is produced simply by working in an artificially lowsymmetry space-group (e.g. P1, when the true symmetry is P2): in thiscase, putting one symmetry mate in the Rfree set, and one in theRwork set will guarantee that Rfree tracks Rwork. The same effectapplies to a large extent even if the NCS is not crystallographic.
Bottom line: thin shells are not a perfect solution, but if NCS ispresent, choosing the free set randomly is *never* a better choice,and almost always significantly worse. Together with multicopyrefinement, randomly chosen test sets were almost certainly a majorcontributor to the spuriously good Rfree values associated with theretracted MsbA and EmrE structures.
Best wishes,
Dean

Dirk Kostrewa wrote:
Dear CCP4ers,
I'm not convinced, that thin shells are sufficient: I think, inprinciple, one should omit thick shells (greater than the diameterof the G-function of the molecule/assembly that is used to describeNCS-interactions in reciprocal space), and use the inner thin layerof these thick shells, because only those should be completelyindependent of any working set reflections. But this would be too"expensive" given the low number of observed reflections that oneusually has ...However, if you don't apply NCS restraints/constraints, there is noneed for any such precautions.
Best regards,

Dirk.

Am 07.02.2008 um 16:35 schrieb Doug Ohlendorf:
It is important when using NCS that the Rfree reflections beselected isdistributed thin resolution shells. That way application of NCSshould not
mix Rwork and Rfree sets.  Normal random selection or Rfree + NCS
(especially 4x or higher) will drive Rfree down unfairly.

Doug Ohlendorf

-----Original Message-----
From: CCP4 bulletin board [mailto:[EMAIL PROTECTED] On Behalf Of
Eleanor Dodson
Sent: Tuesday, February 05, 2008 3:38 AM
To: [email protected] <mailto:[email protected]>
Subject: Re: [ccp4bb] an over refined structure
I agree that the difference in Rwork to Rfree is quite acceptableat your resolution. You cannot/ should not use Rfactors as acriteria for structure correctness.As Ian points out - choosing a different Rfree set of reflectionscan change Rfree a good deal.certain NCS operators can relate reflections exactly making it hardto get a truly independent Free R set, and there are other reasonsto make it a blunt edged tool.
The map is the best validator - are there blobs still not fitted?(maybe side chains you have placed wrongly..) Are there manypositive or negative peaks in the difference map? How well does theNCS match the 2 molecules?
etc etc.
Eleanor

George M. Sheldrick wrote:
Dear Sun,
If we take Ian's formula for the ratio of R(free) to R(work) fromhis paper Acta D56 (2000) 442-450 and make some reasonableapproximations,
we can reformulate it as:

R(free)/R(work) = sqrt[(1+Q)/(1-Q)]  with  Q = 0.025pd^3(1-s)

where s is the fractional solvent content, d is the resolution, p is
the effective number of parameters refined per atom after allowingfor
the restraints applied, d^3 means d cubed and sqrt means square root.
The difficult number to estimate is p. It would be 4 for anisotropic refinement without any restraints. I guess that p=1.5might be an appropriate value for a typical protein refinement(giving an R-factorratio of about 1.4 for s=0.6 and d=2.8). In that case, yourR-factor ratio of 0.277/0.215 = 1.29 is well within the allowedrange!
However it should be added that this formula is almost aself-fulfilling prophesy. If we relax the geometric restraints we
increase p, which then leads to a larger 'allowed' R-factor ratio!

Best wishes, George


Prof. George M. Sheldrick FRS
Dept. Structural Chemistry,
University of Goettingen,
Tammannstr. 4,
D37077 Goettingen, Germany
Tel. +49-551-39-3021 or -3068
Fax. +49-551-39-2582
*******************************************************
Dirk Kostrewa
Gene Center, A 5.07
Ludwig-Maximilians-University
Feodor-Lynen-Str. 25
81377 Munich
Germany
Phone:  +49-89-2180-76845
Fax:  +49-89-2180-76999
E-mail: [EMAIL PROTECTED]<mailto:[EMAIL PROTECTED]>
*******************************************************

Re: [ccp4bb] an over refined structure

Reply via email to