Re: [ccp4bb]: Modelling disordered side-chains

Eric Bennett Wed, 10 Jan 2007 22:41:56 -0800

***  For details on how to be removed from this list visit the  ***
***          CCP4 home page http://www.ccp4.ac.uk         ***



M. Schiltz wrote:

If this approach was pushed to the extreme, it would imply that bulksolvent >atoms should also be explicitly included in the PDB file,because, clearly "the >atoms are in the crystal", e.g. refinehundreds of bulk solvent atoms with >occupancy = 1.0 and let the Bfactor reflect the disorder....

I have to agree for the most part. Someone argued against this bysaying that side chain atoms are different because you know they'reanchored to the protein, so you don't have distance restraints. Butif you randomly place a water at position XYZ far away from theprotein you're probably going to be closer to the correct averagereal position of a water oxygen than you would be to the correctaverage position for the terminal nitrogen of a highly disorderedlysine side chain.

And covalent attachment is not a good criterion to use, because theobject that is covalently attached is sometimes an entire proteindomain. If there is no supporting electron density, hopefully nobodywould try to model an entire missing N-terminal domain of a proteininto their x-ray structure of the entire protein based on (a) theknowledge that the other domain was there, and (b) someone else'sx-ray structure of the N-terminal domain by itself.




Mischa Machius wrote:

If C-beta is well defined, we have a pretty good idea about whereC-gamma is >and so on. Contour the 2Fo-Fc map at 0.3 sigma and youwill likely see some >density. Whether this is noise or signal is amatter of discussion (ask people >in Tom Alber's lab), but at leastrefinement programs have something to base >their B values on.

If C-beta is fuzzy and C-gamma is guesstimated based on the positionof C-beta, then what do you do with 0.3 sigma density near yourguesstimated C-gamma? Do you call it C delta? Even if that 0.3 sigmadensity really is signal and not noise, it could be solvent densityif your guesstimated C-gamma position is off.




Frances Berstein wrote:

    I would suggest that the people on this discussion list,
who are all basically crystallographers or sophisticated users,
who are advocating including atoms with occupancy 0.0 should
talk to a biologist PDB user down the hall from them and see if
they even understand what a B value is or what an occupancy
of 0.0 means.

Poor software design contributes to this problem, because even thoughyou might explain what these things are, the biologists are going toforget the B factor is there or forget what it is if the viewew theyare using doesn't keep reminding them the data is there.Crystallographers may not realize the full extent of this problembecause they are using software designed for crystallography.Biologists, chemist, and even molecular modelers tend not to be usingsuch software. Having personally made the change fromcrystallography to modeling, it's very apparent to me thatcrystallographers have had close to zero input into the design ofmany molecular modeling packages.

Ideally, as Dirk Kostrewa said, people would deposit their electrondensity maps. Tassos wrote, "i admit i have limited sympathy for PDBusers that ignore B values and also cant be bothered to use the EDS".But biologists are not going to learn "O" or even Coot, nor shouldthey have to. If you are a molecular modeler using Schrodinger'ssuite, you can't easily view density. If you're using Accelrys, areyou going to pay the extra money for extra program modules to seeelectron density, when really IMO that is a key function of acompetent PDB viewing module?

I can still drag out "O" when I need to look at density but none ofthe other packages I use can view density and trying to remember howto use "O" when I only run it once or twice a year is a real pain(sure I could learn Coot but people like me who only look at mapsinfrequently aren't going to want to invest the time to learn anothersoftware package just to see a map). If I didn't have my x-raybackground, I would probably be tempted conclude it's way too muchtrouble and give up, which would be a mistake, because I see an awfullot of errors in active sites and once you suspect you're looking ata sloppy x-ray refinement you really need the density maps. Even soI am usually too busy to bother with a manual map calculation if theEDS server failed for some reason. Yes, that makes me lazy, but nolazier than crystallographers who don't ensure the EDS server cancalulate their maps.

Should I have to read through the text of a PDB file to identifyresidues with missing atoms or multiple conformations? No. The wholepoint of a PDB file is to provide input to a program that gives us agraphical view of a chemical structure. To the maximum extentpossible all the information in the PDB file should be readilyvisualized by a good software program. One of the commercialmolecular modeling packages does some helpful color coding when itimports a PDB file. Gray atoms have nothing funky going on, orangeones may have incorrect bonding, residues with missing side chainsare red, those with multiple conformations are green, etc. This ismuch better for a biologist/chemist to interpret as opposed to tryingto teach them to scan a PDB file for alternate conformations, buteven this program with the color coding can't actually access thealternate conformations or display a density map.

Suggesting that a biologist/chemist should have to learn the PDB orCIF file format is the wrong approach: it contributes to keepingcrystallography inaccessible to non-experts. Remember, you want yourbiologist and chemist friends to understand and enjoy x-raystructures. They will be more likely to fund your grant proposals ifthey do. :-)






Ethan Merritt wrote:

We are. At least, those who correctly use available options inrefmac or shelx are.
The best results (R, Rfree, geometry) are obtained by explicit inclusion
of hydrogens via the riding-hydrogen model. This is also a basis for the
Molprobity validation tools.

But you still don't include them explicitly in the final model, whichis what this discussion is really about.

Has anyone done a large-scale study of whether modeling allgeometrically reasonable common rotamers improves R, Rfree, andgeometry for various possible definitions of a "disordered sidechain"? It sounds from Kevin's comments like no large study has beendone. But in the end, that is probably the only way to conclusivelyresolve this question: by looking at whether making educated guessesfor disordered side chains (you'd have to carefully define"disordered") improves the model's agreement with the experimentaldata. All of our theoretical arguments in this thread wouldn't meanthat much in the face of some conclusive evidence one way or theother.




--
Eric Bennett
Assistant Director
Center for Drug Design
University of Minnesota

Re: [ccp4bb]: Modelling disordered side-chains

Reply via email to