Re: [ccp4bb] REFMAC5 residues with bad geometry

James Holton Tue, 27 Mar 2012 12:34:45 -0700

Try this:

1) take your favorite PDB file and set all the B factors to ~80 (reducesseries-termination errors)

2) use sfall/fft in CCP4 to calculate structure factors to 4A resolution
3) use sftools to add a "SIGF" column (0.1 will do) to make refmac5 happy

4) refine the "perfect" model against these fake data for ~5 cycles(with "solvent no")

5) load this up in coot and contour at 1 sigma
6) repeat the refinement with a PDB file containing only main chain.

7) repeat the refinement after putting all the side chains in their mostlikely (Ponder-Richards) rotamers.


Ask yourself these questions:
1) can you "see" the side chains?
2) can you "see" the waters?
3) what are the R factors from these refinements?

Answers: 1) no, 2) no, 3) ~3% for "perfect", ~50% for "main chain", and~36% for "likely rotamer"

Now ask yourself: even though there is "no density" for side chains andwaters, is there really "no evidence" that they exist?

The point I am trying to make here is that you EXPECT side chains topoke out of density at low resolution, even under ideal conditions(perfect phases). For example, the C-deltas of Leu will "breach" the1-sigma contour at around 2.8A resolution and worse. You can see thisin my old movie:

http://bl831.als.lbl.gov/~jamesh/movies/index.html#reso

When it comes to building, yes, once an atom dips below the 1-sigmacontour it gets harder and harder to know exactly where it is, but itdoes have to be somewhere. Somewhere nearby. Formally, there is "priorknowledge" of bond lengths, etc. at play. And if you know that there isone copy of a given atom in every unit cell of the crystal, thenoccupancy < 1 is inappropriate. Much better to use B = 999, whichmodels the atom as a Gaussian with the electrons spread over an areaabout 3.5 A wide. This is roughly the range your average side chainatom has available to it, given that it is attached to the main chain bycovalent bonds.

Of course, a more "Bayesian" model for the "I don't know what therotamer is" situation would be to build in ALL possible rotamers, withoccupancies equal to their Ponder-Richards probabilities. Someimprovement to this initial "guess" would no doubt be made by usingconstrained occupancy refinement of rigid-body side chains.Unfortunately, this is impossible with any refinement program I knowabout, since refmac, phenix.refine, etc. don't support more than 3 or 4alternate conformers.

Building in all possible conformers and using the occupancy as a"p-value" would also help solve the problem of the careless and/oruneducated over-interpreting PDB files. Which is the "right one"? Goodquestion! I think its time we started dispelling the myth of thesingle-conformer protein anyway.


-James Holton
MAD Scientist

On 3/26/2012 7:40 AM, Ed Pozharski wrote:

On Mon, 2012-03-26 at 10:17 -0400, Gregory Bowman wrote:

But what about the issue of resolution? As was previously pointed out,
at say 3.2 Å resolution, many side chains will fail to fit, but it
doesn't seem appropriate to trim them all down.

Why is it inappropriate to trim them down?  Sometimes at low resolution
all one can be confident about is the backbone trace.

Just to be clear, I am talking about atoms whose positions are not
supported by electron density, i.e. where difference map in the absence
of the side chain is featureless.  I assume that is the likely situation
when one would set occupancy to zero.

Cheers,

Ed.

Re: [ccp4bb] REFMAC5 residues with bad geometry

Reply via email to