Not that rules of thumb always have to have a rationale, nor that they're
always correct - but it would seem that noise in the data (of which Rmerge
is an indicator) should have a significant relationship with the R:Rfree
difference, since Rfree is not (should not be, if selected correctly)
subject to noise fitting. This rule is easily broken if one refines against
very noisy data (e.g. "that last shell with Rmerge of 55% and <I/sigmaI>
ratio of 1.3 is still good, right?") or if the structure is overfit. The
rule is only an indicative one (i.e. one should get really worried if
R-Rfree looks very different from Rmerge) and it breaks down at very high
and very low resolution (more complete picture by GK and shown in BR's
book).

Since selection of data and refinement procedures is subject to the
decisions of the practitioner, I suspect that the extreme divergence shown
in the figures that you refer to is probably the result of our own
collective decisions. I have no proof, but I suspect that if a large enough
section of the PDB were to be re-refined using the same methods and the same
data trimming practices, the spread would be considerably more narrow.
That'd be somewhat hard to do - but may be doable now given the abundance of
auto-building and auto-correcting algorithms.

Artem

On Mon, Oct 25, 2010 at 9:07 PM, Bernhard Rupp (Hofkristallrat a.D.) <
[email protected]> wrote:

> And the rationale for that rule being exactly what?
>
>
>
> For stats, see figures 12-23, 12-24
>
> http://www.ruppweb.org/garland/gallery/Ch12/index_2.htm
>
>
>
> br
>
>
>
> *From:* CCP4 bulletin board [mailto:[email protected]] *On Behalf Of 
> *Artem
> Evdokimov
> *Sent:* Monday, October 25, 2010 6:36 PM
> *To:* [email protected]
> *Subject:* Re: [ccp4bb] diverging Rcryst and Rfree
>
>
>
> http://www.mail-archive.com/[email protected]/msg04677.html
>
> as well as some notes in the older posts :)
>
> As a very basic rule of thumb, Rfree-Rwork tends to be around Rmerge for
> the dataset for refinements that are not overfitted.
>
> Artem
>
> On Mon, Oct 25, 2010 at 4:10 PM, Rakesh Joshi <[email protected]> wrote:
>
> Hi all,
>
> Can anyone comment, in general, on diverging Rcryst and Rfree
> values(say>7%) for
> structures with kind of low resolutions(2.5-2.9 angstroms)?
>
> Thanks
> RJ
>
>
>

Reply via email to