Re: [ccp4bb] Disorder or poor phases?

James Holton Fri, 13 Apr 2012 11:32:52 -0700

Francis,

I think in the cases you describe the region in question is disordered.Time and time again I have users coming to my beamline wanting to cleanup a "questionable region" by getting experimental phases. Ahh! Ifonly I had a nickle for each one. Oh wait, I suppose I kind of do? Itake that back! Go MAD everyone!

Much as I hate to discourage people from using my favorite technique,Tim is right: phases are not region-specific in electron density maps.Dale does make a good point that there is such a thing as "model bias"and one can argue that experimental phases don't have it. But, this isonly true if you have not yet applied solvent flattening. How long hasit been since you looked at a "raw" experimentally-phased map (beforesolvent flattening)? I'm willing to bet a while. With very fewexceptions, raw experimental phases are lousy. We have actually becomequite dependent on density modification to clean them up. In fact,solvent flattening is the only reason why SAD works at all.

However, you CAN use anomalous differences to clear up disorderedregions in a different way. Something I started calling "SeMetscanning" a number of years ago. A few of my users have done this, anda good example of it is Figure 3 of Huang et al. 2004(doi:10.1038/nsmb826). Basically, you mutate residues in the disorderedregion one at a time to SeMet, and look at phased anomalous differenceFourier (PADF) maps. These maps are surprisingly clear, even when theanomalous difference signal is so weak as to make experimental phasinghopeless. Yes, the best phases to use for PADF maps are model phases,but, as always, it is prudent to refine the model after omitting thething you are looking for before calculating such phases.

Another way to get residue-specific labeling for low-resolution chaintracing is radiation damage. If you expose for the right amount oftime, Asp and Glu side chains will be specifically "burnt off", but notAsn and Gln. You will also see Met loosing its head, etc. So, as longas you have read Burmeister (2000), an Fo-Fo map of damaged vs undamagedcan be used to guide sequence assignment, even at 4.5 A and worse.

Anyway, when it comes to the question of "is it disordered or is itmodel bias?", I think it is usually the former. It is very difficult tomake "model bias" suppress a region that is actually well-ordered. Tryit! After all, this is the whole reason why we bother looking at fo-fcmaps. Then again, it is always possible to have a model so bad that thephase error is enough to squash anything. An excellent example of thiscan be found in the Book of Fourier. Taking amplitudes from the imageof a cat, you can see what happens when you use the phases of a duck:

http://www.ysbl.york.ac.uk/~cowtan/fourier/picduckcatfft.gif
as opposed to what happens if you use the phases of a manx:
http://www.ysbl.york.ac.uk/~cowtan/fourier/piccatmanx2.gif

A manx is a species of cat that doesn't have a tail, so no animals wereharmed in obtaining these phases. My point here is that the cat's tailcan be seen quite readily in the 2fo-fc map if most of the structure isalready "right", but if your model is completely unrelated to the truestructure (fitting a duck into a cat-shaped hole), then everything is"in the noise".

Real structures are usually somewhere between these two extremes, and Ithink an important shortcoming in modern crystallography is that wedon't have a good quantitative description of this middle-ground. Weall like to think we know what "model bias" is, but we don't exactlyhave "units" for it. Should we be using a scale of 0 to 1? Or perhaps"duck" to "cat"? Yes, I know we have "figure of merit", but FOM is notregion-specific.

In my experience, as long as you have ~50% of the electrons in the"right" place (and none of them in the "wrong" place), then you cangenerally trust that the biggest difference feature in the fo-fc map is"real", and build from there. As the model becomes more complete, thephases should continue to get better, not worse. Eventually, this doesbreak down, although I'm not really sure why. With small molecules, themaximum fo-fc peak keeps getting bigger (on a sigma scale) as you addmore and more atoms, and the biggest one you will ever see is the lastone. For macromolecules, the difference features keep getting smallerand smaller as you build. Perhaps small errors (like non-Gaussianatomic displacement distributions being modeled as Gaussians) slowlyaccumulate? Perhaps there are other sources of systematic error that wedon't yet fully understand? Eventually, for whatever reason, you stopbuilding. Having electrons in the wrong place is about twice as bad asnot having them at all, which I think is why we trim models soaggressively for molecular replacement, and also why we are so reticentto model in things that we are not "sure" about. Disordered regions, ofcourse, will always lie at a lower level of electrons/A^3 than orderedregions, and therefore will be the last things to show up as the "toppeak" in the fo-fc map. They will also be the last things to poke theirheads above the 1-sigma contour in a 2fo-fc map, but that does not meanthey are "not there". You can lower the map contour and see them easilyenough (even in 3bcl). The trick is having some kind ofstatistically-sound rule for "being sure". Otherwise, we might startseeing "map contour creep", just as we currently see "R factor creep" inhigh-profile journals.


-James Holton
MAD Scientist


On 4/10/2012 1:04 PM, Francis E Reyes wrote:

Dale

Thank you for the case study. I will certainly remember it when I next see:

"I don't see density for these atoms therefore they must be
disordered."

You do mention though, that when you were able to assign the sequence to the 
beta sheets, that the loop regions became clear.

I consider the case (which a majority of cases seem to be), where the author 
has built and sequence assigned 95% of the ASU, but is unable to model a loop 
region. One possibility is that the loop is truly disordered (95% of the ASU is 
built and is presumably right), the other possibility is that there's an 
inherent error in the existing structure that is affecting the interpretation 
of the loop region. The errors are probably extremely subtle and distributed 
throughout the model (think of the improvements DEN refinement gave for the 
rerefinement of p97).

I guess in either case, because of the dependency of the map on the existing 
set of phases it's difficult to determine whether it's truly disordered or not.

P.S.: The author should not look at an 2fofc-map but a
sigma-A-weighted map to reduce model bias.

Tim,

I assume a sigmaA weighted 2Fo-Fc map (which I believe is the default for most 
crystallographic refinement packages).

F

---------------------------------------------
Francis E. Reyes M.Sc.
215 UCB
University of Colorado at Boulder

Re: [ccp4bb] Disorder or poor phases?

Reply via email to