Re: [ccp4bb] External: Re: [ccp4bb] AlphaFold: more thinking and less pipetting (?)

Leonid Sazanov Fri, 11 Dec 2020 06:38:41 -0800

Thanks, I will try this.

Also, on CASP website there are such scores as RMS_ALL (can be seen intables) and GDC_SC (for side-chains, not visible in tables for some reason).

RMS_ALL presumably includes side-chains and seems good for AlphaFold2models, between 1 to 2 Angstrom (apart from the same outliers asRMS_CA), although that is not quite at the experimental level.

Were any scores including side-chains included in ranking/evaluation (aswe hear mostly about GDT_TS)?


If not, how can "experimental level" precision be claimed?


Thanks,

Leonid



On 11.12.20 13:56, Tristan Croll wrote:

I agree the website can be quite cryptic!
You can get all the targets as a tarball fromhttps://predictioncenter.org/download_area/CASP14/targets/<https://predictioncenter.org/download_area/CASP14/targets/>. For thepredictions, you can either get them as PDB files on a case-by-casebasis from the results section, or tarballs of all predictions for agiven target fromhttps://predictioncenter.org/download_area/CASP14/predictions_trimmed_to_domains/<https://predictioncenter.org/download_area/CASP14/predictions_trimmed_to_domains/>.In the latter case, each file is essentially a PDB file without the.pdb extension, except with 4 lines added to the front lookingsomething like:
PFRMAT TS
TARGET T1049
MODEL 2
PARENT N/A
Depending on your choice of viewer, you may need to remove these linesbefore attempting to open it.
The GDT_TS score only considers alpha carbons, so in principle it /is/possible to get a high score on it while still having a model that'srubbish in every other respect. It's certainly worth complementing itwith other scores - e.g. good old MolProbity, or SphereGrinder. Thelatter is quite good in principle - essentially, it places a 6 Aradius sphere at each CA atom of the target, finds all heavy atoms inthe sphere, and measures their RMSD to the corresponding atoms in theprediction. The actual implementation for CASP is a bit broad-brush,though - the score is just the fraction of spheres whose RMSD is under2 A.
In the last CASP round I pushed for the need to start adding metricsthat directly compared the models in torsion space - far from thefirst time that's been suggested, but it's arguably only in the pastfew rounds that models have gotten good enough for this to be a usefuldiscriminating measure. It doesn't appear that this has been added tothe standard measures for CASP14, but if it had I can see thatAlphaFold2 would have done extremely well - I only showed the ribbonrepresentation for T1049 in my last email, but the sidechains in thecore show pretty amazing agreement with the target.
Best regards,

Tristan
------------------------------------------------------------------------
*From:* Leonid Sazanov <saza...@ist.ac.at>
*Sent:* 11 December 2020 12:32
*To:* Tristan Croll <ti...@cam.ac.uk>; CCP4BB@JISCMAIL.AC.UK<CCP4BB@JISCMAIL.AC.UK>*Subject:* Re: [ccp4bb] External: Re: [ccp4bb] AlphaFold: morethinking and less pipetting (?)
I see, thanks, that looks good.

Where can one download predicted_model+exp_model PDBs together?
I could easily find predicted models but not experimental - CASPwebsite seems very cryptic.
Also, can you comment on how much GDT_TS depends on CA and how much onside chains positioning?
E.g. if it is >90, can one be sure that most side-chains are in theright place?
Thanks.

Leonid


On 11.12.20 13:12, Tristan Croll wrote:
I'm not Randy, but I do have an answer: like this. This is T1049-D1.AlphaFold prediction in red, experimental structure (6y4f) in green.Agreement is close to perfect, apart from the C-terminal tail whichis way off - but clearly flexible and only resolved in thisconformation in the crystal due to packing interactions. GDT_TS is93.1; RMS_CA is 3.68 - but if you exclude those tail residues, it's0.79. With an alignment cutoff of 1 A, you can align 109 of 134 CAswith an RMSD of 0.46 A.
------------------------------------------------------------------------
*From:* CCP4 bulletin board <CCP4BB@JISCMAIL.AC.UK><mailto:CCP4BB@JISCMAIL.AC.UK> on behalf of Leonid Sazanov<saza...@ist.ac.at> <mailto:saza...@ist.ac.at>
*Sent:* 11 December 2020 10:36
*To:* CCP4BB@JISCMAIL.AC.UK <mailto:CCP4BB@JISCMAIL.AC.UK><CCP4BB@JISCMAIL.AC.UK> <mailto:CCP4BB@JISCMAIL.AC.UK>*Subject:* Re: [ccp4bb] External: Re: [ccp4bb] AlphaFold: morethinking and less pipetting (?)
Dear Randy,
Can you comment on why for some of AplhaFold2 models with GDT_TS > 90(supposedly as good as experimental model) the RMS_CA (backbone) is >3.0 Angstrom? Such a deviation can hardly be described as good asexperimental. Could it be that GDT_TS is kind of designed to evaluatehow well the general sub-domain level fold is predicted, rather thanoverall detail?
Thanks,
Leonid


>>>>>
Several people have mentioned lack of peer review as a reason todoubt the significance of the AlphaFold2 results. There aredifferent routes to peer review and, while the results have not beenpublished in a peer review journal, I would have to say (as someonewho has been an assessor for two CASPs, as well as having editorialresponsibilities for a peer-reviewed journal), the peer review atCASP is much more rigorous than the peer review that most papersundergo. The targets are selected from structures that have recentlybeen solved but not published or disseminated, and even just tweetinga C-alpha trace is probably enough to get a target cancelled. Insome cases (as we’ve heard here) the people determining the structureare overly optimistic about when their structure solution will befinished, so even they may not know the structure at the time it ispredicted. The assessors are blinded to the identities of thepredictors, and they carry out months of calculations and inspectionsof the models, computing ranking scores before they find out who madethe predictions. Most assessors try to bring something new to theassessment, because the criteria should get more stringent as thepredictions get better, and they have new ideas of what to look for,but there’s always some overlap with “traditional” measures such asGDT-TS, GDT-HA (more stringent high-accuracy version of GDT) and lDDT.
Of course we’d all like to know the details of how AlphaFold2 works,and the DeepMind people could have been (and should be) much moreforthcoming, but their results are real. They didn’t have any way ofcheating, being selective about what they reported, or gaming thesystem in any other way that the other groups couldn’t do. (And yes,when we learned that DeepMind was behind the exceptionally goodresults two years ago at CASP13, we made the same half-jokes aboutwhether Gmail had been in the database they were mining!)
Best wishes,



Randy Read

########################################################################

To unsubscribe from the CCP4BB list, click the following link:
https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1<https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1>
This message was issued to members of www.jiscmail.ac.uk/CCP4BB<http://www.jiscmail.ac.uk/CCP4BB>, a mailing list hosted bywww.jiscmail.ac.uk <http://www.jiscmail.ac.uk>, terms & conditionsare available at https://www.jiscmail.ac.uk/policyandsecurity/<https://www.jiscmail.ac.uk/policyandsecurity/>
--
Prof. Leonid Sazanov FRS
IST Austria
Am Campus 1
A-3400 Klosterneuburg
Austria

Phone: +43 2243 9000 3026
E-mail:saza...@ist.ac.at  <mailto:saza...@ist.ac.at>


--
Prof. Leonid Sazanov FRS
IST Austria
Am Campus 1
A-3400 Klosterneuburg
Austria

Phone: +43 2243 9000 3026
E-mail: saza...@ist.ac.at


########################################################################

To unsubscribe from the CCP4BB list, click the following link:
https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1

This message was issued to members of www.jiscmail.ac.uk/CCP4BB, a mailing list 
hosted by www.jiscmail.ac.uk, terms & conditions are available at 
https://www.jiscmail.ac.uk/policyandsecurity/

Re: [ccp4bb] External: Re: [ccp4bb] AlphaFold: more thinking and less pipetting (?)

Reply via email to