Re: [ccp4bb] a challenge

James Holton Sat, 12 Jan 2013 08:25:48 -0800


Fair enough!

I have just now added DANO and I(+)/I(-) to the files. I'll be veryinterested to see what you can come up with! For the record, the phasestherein came from running mlphare with default parameters but exactlythe correct heavy-atom constellation (all the sulfur atoms in 3dko), andthen running dm with default parameters.

Yes, there are other ways to run mlphare and dm that give better phases,but I was only able to determine those parameters by "cheating"(comparing the resulting map to the right answer), so I don't think itis "fair" to use those maps.

I have had a few questions about what is "cheating" and what is notcheating. I don't have a problem with the use of sequence informationbecause that actually is something that you realistically would knowabout your protein when you sat down to collect data. The sequence ofthis molecule is that of 3dko:

http://bl831.als.lbl.gov/~jamesh/challenge/seq.pir

I also don't have a problem with anyone actually using an automationprogram to _help_ them solve the "impossible" dataset as long as theycan explain what they did. Simply putting the above sequence intoBALBES would, of course, be cheating! I suppose one could tryeliminating 3dko and its "homologs" from the BALBES search, but that, inand of itself, is perhaps relevant to the challenge: "what is the mostdistance homolog that still allows you to solve the structure?". That,I think, is also a stringent test of model-building skill.

I have already tried ARP/wARP, phenix.autobuild andbuccaneer/refmac. With default parameters, all of these programs failon both the "possible" and "impossible" datasets. It was only with somesubstantial tweaking that I found a way to get phenix.autobuild to crackthe "possible" dataset (using 20 models in parallel). I have not yetfound a way to get any automation program to build its way out of the"impossible" dataset. Personally, I think that the breakthrough might besomething like what Tom Terwilliger mentioned. If you build a goodenough starting set of atoms, then I think an automation program shouldbe able to take you the rest of the way. If that is the case, then itmeans people like Tom who develop such programs for us might be able touse that insight to improve the software, and that is something thatwill benefit all of us.

Or, it is entirely possible that I'm just not running the currentsoftware properly! If so, I'd love it if someone who knows better (suchas their developers) could enlighten me.


-James Holton
MAD Scientist

On 1/12/2013 3:07 AM, Pavol Skubak wrote:


Dear James,

your challenge in its current form ignores an important source
of information for model building that is available for your
simulated data - namely, it does not allow to use anomalous
phase information in the model building. In difficult cases on
the edge of success such as this one, this typically makes
the difference between building and not building.

If you can make the F+/F- and Se substructure available, we
can test whether this is the case indeed. However, while I
expect this would push the challenge further significantly,
most likely you would be able to decrease the Se incorporation
of your simulated data further to such levels that the anomalous
signal is again no longer sufficient to build the structure. And
most likely, there would again exist an edge where a small
decrease in the Se incorporation would lead from a model built
to no model built.

Best regards,

--
Pavol Skubak
Biophysical Structural Chemistry
Gorleaus Laboratories
Einsteinweg 55
Leiden University
LEIDEN  2333CC
the Netherlands
tel: 0031715274414 <tel:0031715274414>
web: http://bsc.lic.leidenuniv.nl/people/skubak-0

Re: [ccp4bb] a challenge

Reply via email to