Re: [ccp4bb] phenix.refine with ligand with ambiguous electron density

Dale Tronrud Thu, 03 Dec 2020 00:11:04 -0800

Hi,

Dr Nicholls brings up many interesting points, but doesn't touch onthe major point I had hoped to make in my letter. Whenever you startmaking multiple tests of your hypothesis you have to evaluate each ofthose tests with a higher standard than you would if you only appliedone. If you take a survey of the amount of fat people eat along withtheir history of heart disease you can calculate a correlation and findit significant with a p value of 0.05. If, instead, you perform asurvey asking for twenty different dietary behaviors and twenty healthoutcomes and find a correlation between eating fat and heart disease youneed a much higher "signal" to determine its significance. You justmade 400 comparisons and a p of 0.05 allows 20 spurious correlations toappear significant.

If you are exploring your data set to decide if a compound hasbound, and your try several different refinement programs and calculateseveral different map types based on the results of those refinements,and then adjust the blur of each map, and pick the map with thestrongest peak in the putative binding site, you have to consider thesignificance of that peak height to be less than if you had justcalculated one map and got that same height.

Ignoring this counterintuitive fact has resulted in a huge number ofstudies in many fields to be published that ultimately turned out to notbe reproducible. It likely has also resulted in the deposition of a lotof "complex" models in the PDB that aren't correct.

Yes, I am arguing for an ideal, hoping to pull some of you overtoward my side a bit. I certainly understand that one has to beflexible when solving a difficult problem, but you can't ignore thatthis "flexibility" has significant consequences for understanding theresults of your work.

Dr Nicholls' letter brings up a related topic which I'd like toexplore. His letter repeatedly mentions the importance of "intuition"when interpreting a map. Yes, the power of human intuition, and ourinability to replicate it in silico is the reason we are still staringat maps in Coot. Intuition is a remarkable tool which, by its nature,is difficult to describe.

Yet, no one is born with an innate intuition for interpretingelectron density maps. Intuition is acquired thru practice. Practiceis not simple repetition, however. You can't become proficient inshooting basketball hoops by simply repeatedly throwing a basketball onthe roof of your garage. You have to have a proper backboard and ahoop. Now, after repeatedly throwing the ball and "feeling" thedifference between it going through the hoop and not, you will developthe ability to make a basket w/o really thinking about it. You willhave developed an intuition for achieving that task.

There are two caveats. First, you have to actually watch the ballgo through the hoop. If you close your eyes right after your throw youwill never develop a useful skill. It is the feedback from the successor failure of each attempt that makes it practice. Second, no matterhow much time you spend shooting baskets, you will never get better atdribbling the ball. Good practice allows you to develop intuition, butonly intuition about that task.

Let's say you are working on a project, but having difficultyinterpreting your map at some critical location. You ask around andlearn of some spiffy new map calculation and you want to try it. Whileyou certainly can calculate the map, you have no intuition on how tointerpret it. You have not practiced with that type of map.

It may look similar to the maps you've looked at before, but thatsimilarity can be a trap. By now a large number of us here on the BBhave had the experience of looking at a high resolution electrostaticpotential (ESP) map and "feeling" that something is wrong with it. Thecarbonyl oxygen bumps are too small and the acid groups are oddly weak.Wow, those magnesium ions really stand out -- Maybe they're potassiuminstead? No, there is nothing wrong with the ESP map. The fault iswith our intuition which was based on many, many hours of looking at EDmaps. To interpret ESP maps you have to practice with a bunch of ESPmaps first.

You cannot develop intuition for the spiffy map calculated from yourproject's data since you don't know its correct interpretation -- Itcannot give you feedback. Before you calculate this map for your datayou should calculate versions for many other *completed* projects andget a "feel" for what that kind of map shows under differentcircumstances. Practice, practice, practice, then you will be ready toreturn to your little mystery and be able to apply your, newly acquired,intuition.

Yes, I try new refinement programs - But first I run refinement withthem on familiar proteins. Yes, I try new styles of map calculations -But first I calculate those maps for cases where I know the answer.I've refined a fair number of structures, probably not as many as mostof you, but at the end of a refinement I take the answer and go back tothe original maps. Looking at those maps in light of the answer is whatimproves my map interpretation skills, such as they are, the most.

All of my practice has been with ED (and some ESP) maps of betterthan 3 A resolution. Despite all the intuition I can bring to bear onthem, when it comes to a 4 A resolution map I'm no better than anundergrad.

Your first experience with a new technique should never be with yourcurrent project's data. You should work to add that technique to yourtool box, and then move back to your data. Practice, and more practicewill build that squishy neural network in your head.


Descending from soapbox,
Dale Tronrud


On 12/1/2020 8:31 AM, Robert Nicholls wrote:

Dear all,
I feel the need to respond following last week’s critique of the use ofCoot’s map blurring tool for providing diagnostic insight and aidingligand identification…
On 24 Nov 2020, at 16:02, Dale Tronrud <[email protected]<mailto:[email protected]>> wrote:
To me, this sounds like a very dangerous way to use this tool decideif a ligand has bound. I would be very reluctant to modify my mapwith a range of arbitrary parameters until it looked like what Iwanted to see. The sharpening and blurring of this tool is not guidedor limited by theory or data.
I disagree with this, subject to the important qualification that careis needed with interpretation. Blurring isn't a crime - it merelyinvolves adjusting the weighting given to lower versus higher resolutionreflections, and thus allows relaxation of the choice of high-resolutionlimit, and facilitates local investigation of regions that exhibit apoor signal-to-noise ratio. This is particularly pertinent to ligandedcompounds, which are typically present with sub-unitary occupancies.
Coot's blurring merely involves convolution of the whole map with anisotropic 3D Gaussian, with a parameter (B-factor) to control thestandard deviation of the Gaussian. This corresponds to reweighting thestructure factors in order to give higher weight to lower-resolutionreflections. This approach is guided by a very simple theory: higherresolution structure factors (SFs) are typically noisier, with aworse signal-to-noise ratio than lower resolution SFs (due to increasederrors in both observed higher-resolution reflections and calculatedphases). Consequently, increasing the blurring B-factor reduces theeffect of the noisier higher-resolution SFs. This results in a map thatshould be more reliable, but at the expense of reduced structural detaildue to artificially reducing the effective resolution.
It should be noted that this does assume that lower resolutionreflections are more reliable than higher resolution ones. So, goodlow-resolution data quality and completeness is important.
Unfortunately, determination of an optimal B-factor parameter is notpresently automated. Consequently, users are currently expected to trialdifferent values in the Coot slider tool in order to maximiseinformation and gain, for want of a better word, intuition.Furthermore, due to the spatially heterogeneous nature of atomicpositional uncertainty in macromolecular complexes, it can be thatdifferent B-factor parameters are of optimal usefulness indifferent local regions of the map that exhibitdifferent signal-to-noise ratios. Such issues are on-going areas ofresearch.
The main problem is that interpretation is subjective. In difficultcases, it is necessary to obtain as much information and insight aspossible in order to gain a good intuition. If you can't see a ligand inthe "standard" maps, but you can see evidence for a ligand inblurred density (or difference density) maps of the various types, thenit means that careful exploration of those avenues is required.Any "evidence" from viewing such maps and map types should serve toguide intuition, and should be digested along with all otheravailable information. Such complementary maps should be seen asdiagnostics to gain intuition, rather than something that can be used asan unequivocal argument for ligand binding.
Ultimately, the presence of significant density in a blurred map meansthat there is something substantial present. Or in a blurred differencedensity that there is something missing from the current model. Thiscould be a missing ligand, or it could be a mismodelled region ofthe macromolecule, or it could be mismodelled solvent (in whichcase re-evaluating any solvent mask may be worthwhile). Ultimately it isdown to the practitioner to explore all potential explanations for anysuch behaviour, in order to maximise intuition and convincethemselves of the crystal's structural composition.
In some cases the presence of density in a blurred map might besufficient to convince the practitioner that it is worth pursinginvestigation of binding. This may take various forms: hypothesising anapproximate pose for the ligand; the nature of interactions in thestructural environment of the macromolecule; re-evaluation aftermodelling and refinement; or simply stating that there may be evidenceof binding. In many cases, the latter is the appropriate action, and, asRobbie quite rightly pointed out: "in a scientific setting this diggingis not to come to a strong conclusion, but only to see if you shouldpursue the project and do additional experiments".
On 24 Nov 2020, at 16:02, Dale Tronrud <[email protected]<mailto:[email protected]>> wrote:[...] to avoid bias in the interpretation of the results, all of thestatistical procedures are decided upon BEFORE the study is evenbegan. This protocol is written down and peer reviewed at the start.Then the study is performed and the protocol is followed exactly.[...] I would recommend that you decide what sort of map you think isthe best at showing features of your active site, based on theresolution of your data set and other qualities of your project,before you calculate your first Fourier transform. If you think aPolder map is the bee's knees then calculate a Polder map and livewith it. If you are convinced of the value of a FEM, or a Buster map,or a SA omit map, or whatever, calculate that map instead and livewith it.
I agree that such an approach would be more scientific, and I certainlyfind this idea very appealing. Whilst I hesitate to speak against such aphilosophy, I feel it is necessary to temper/balance this view bypitching a counterargument in the interests of pragmatism - in generalit's just not that practical. And perhaps propositions for revolution ofbest-practice policies within the field should be distinct from currentpractical recommendation, in the interests of avoidingpotential confusion for the student/user who simply wants a solutionthat they can apply to today's problems.Whilst it sounds like a nice ideal, in general it is difficult to knowwhich pathologies might be encountered (e.g. ambiguous density in thebinding site; twinning; modelling difficulties around a symmetry axis;multiple conformations; semi-disorder; post-translationalchemical modifications; radiation damage… the list goes on). It'scompletely acceptable for someone encountering a problem for thefirst time to explore what tools are available to guideany decision-making, in the hope of achieving the best model possible. Atypical user cannot be expected to outline a strategy for everyeventuality a priori - that sounds more like the design of an automatedpipeline, not advice that users should be expected follow.
In summary, it's unadvisable to put all eggs in one basket (of one typeof map, Polder or otherwise). If an experienced user likes a particulartool because it's worked well for them in the past, it doesn't mean thatthey shouldn't try other tools now (in this case: view other types ofmaps) the next time they encounter a problem. Especially given thattools in our field are still very much evolving over time. Differentapproaches may have more value and provide more insight in differentcircumstances.
Best regards,
Rob







------------------------------------------------------------------------

To unsubscribe from the CCP4BB list, click the following link:
https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1<https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1>


########################################################################

To unsubscribe from the CCP4BB list, click the following link:
https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1

This message was issued to members of www.jiscmail.ac.uk/CCP4BB, a mailing list 
hosted by www.jiscmail.ac.uk, terms & conditions are available at 
https://www.jiscmail.ac.uk/policyandsecurity/

Re: [ccp4bb] phenix.refine with ligand with ambiguous electron density

Reply via email to