On 1/13/2022 11:14 AM, Tristan Croll wrote:
(please don’t actually do this)
Too late! I've been doing that for years. What happens, of course, is
the "geometry" improves, but the R factors go through the roof. This I
expect comes as no surprise to anyone who has played with the "weight"
parameters in refinement, but maybe it should? What is it about our
knowledge of chemical bond lengths, angles, and radii that is
inconsistent with the electron density of macromolecules, but not small
molecules? Why do macro-models have a burning desire to leap away from
the configuration we know they adopt in reality? If you zoom in on
those "bad clashes" individually, they don't look like something that is
supposed to happen. There is a LOT of energy stored up in those little
springs. I have a hard time thinking that's for real. The molecule is
no doubt doing something else and we're just not capturing it properly.
There is information to be had here, a lot of information.
This is why I too am looking for an all-encompassing "geometry score".
Right now I'm multiplying other scores together:
score = (1+Clashscore)*sin(worst_omega)*1./(1+worst_rama)*1/(1+worst_rota)
*Cbetadev*worst_nonbond*worst_bond*worst_angle*worst_dihedral*worst_chir*worst_plane
where things like worst_rama is the "%score" given to the worst
Ramachandran angle by phenix.ramalyze, and worst_bond is the largest
"residual" reported among all the bonds in the structure by molprobity
or phenix.geometry_minimization. For "worst_nonbond" I'm plugging the
observed and ideal distances into a Leonard-Jones6-12 potential to
convert it into an "energy" that is always positive.
With x-ray data in hand, I've been multiplying this whole thing by Rwork
and trying to find clever ways to minimize the product. Rfree is then,
as always, the cross-check.
Or does someone have a better idea?
-James Holton
MAD Scientist
On 1/13/2022 11:14 AM, Tristan Croll wrote:
Hard but not impossible - even when you *are* fitting to low-res
density. See
https://twitter.com/crolltristan/status/1381258326223290373?s=21 for
example - no Ramachandran outliers, 1.3% sidechain outliers,
clashscore of 2... yet multiple regions out of register by anywhere up
to 15 residues! I never publicly named the structure (although I did
share my rebuilt model with the authors), but the videos and images in
that thread should be enough to illustrate the scale of the problem.
And that was *with* a map to fit! Take away the map, and run some MD
energy minimisation (perhaps with added Ramachandran and rotamer
restraints), and I think it would be easy to get your model to fool
most “simple” validation metrics (please don’t actually do this). The
upshot is that I still think validation of predicted models in the
absence of at least moderate-resolution experimental data is still a
major challenge requiring very careful thought.
— Tristan
On 13 Jan 2022, at 18:41, James Holton <jmhol...@lbl.gov> wrote:
Agree with Pavel.
Something I think worth adding is a reminder that the MolProbity
score only looks at bad clashes, ramachandran and rotamer outliers.
MPscore=0.426∗ln(1+clashscore)+0.33∗ln(1+max(0,rota_out−1))+0.25∗ln(1+max(0,rama_iffy−2))+0.5
It pays no attention whatsoever to twisted peptide bonds, C-beta
deviations, and, for that matter, bond lengths and bond angles. If
you tweak your weights right you can get excellent MP scores, but
horrible "geometry" in the traditional bonds-and-angles sense. The
logic behind this kind of validation is that normally nonbonds and
torsions are much softer than bond and angle restraints and therefore
fertile ground for detecting problems. Thus far, I am not aware of
any "Grand Unified Score" that combines all geometric considerations,
but perhaps it is time for one?
Tristan's trivial solution aside, it is actually very hard to make
all the "geometry" ideal for a real-world fold, and especially
difficult to do without also screwing up the agreement with density
(R factor). I would argue that if you don't have an R factor then
you should get one, but I am interested in opinions about alternatives.
I.E. What if we could train an AI to predict Rfree by looking at the
coordinates?
-James Holton
MAD Scientist
On 12/21/2021 9:25 AM, Pavel Afonine wrote:
Hi Reza,
If you think about it this way... Validation is making sure that the
model makes sense, data make sense and model-to-data fit make sense,
then the answer to your question is obvious: in your case you do not
have experimental data (at least in a way we used to think of it)
and so then of these three validation items you only have one,
which, for example, means you don’t have to report things like
R-factors or completeness in high-resolution shell.
Really, the geometry of an alpha helix does not depend on how you
determined it: using X-rays or cryo-EM or something else! So, most
(if not all) model validation tools still apply.
Pavel
On Mon, Dec 20, 2021 at 8:10 AM Reza Khayat <rkha...@ccny.cuny.edu>
wrote:
Hi,
Can anyone suggest how to validate a predicted structure?
Something similar to wwPDB validation without the need for
refinement statistics. I realize this is a strange question
given that the geometry of the model is anticipated to be fine
if the structure was predicted by a server that minimizes the
geometry to improve its statistics. Nonetheless, the journal has
asked me for such a report. Thanks.
Best wishes,
Reza
Reza Khayat, PhD
Associate Professor
City College of New York
Department of Chemistry and Biochemistry
New York, NY 10031
------------------------------------------------------------------------
To unsubscribe from the CCP4BB list, click the following link:
https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1
<https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1>
------------------------------------------------------------------------
To unsubscribe from the CCP4BB list, click the following link:
https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1
<https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1>
------------------------------------------------------------------------
To unsubscribe from the CCP4BB list, click the following link:
https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1
<https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1>
########################################################################
To unsubscribe from the CCP4BB list, click the following link:
https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1
This message was issued to members of www.jiscmail.ac.uk/CCP4BB, a mailing list
hosted by www.jiscmail.ac.uk, terms & conditions are available at
https://www.jiscmail.ac.uk/policyandsecurity/