I have had several people now ask me where the heck my "Crick-Magdoff" equation came from. So it looks like brevity has once again conquered clarity. So, those of you you enjoy my long emails read on. Those of you who do not, I direct you to the "Delete" button that should have come with your email client.

My little formula is a rough approximation, and it begins with the Crick-Magdoff equation, which you can find in Blundell and Johnson section 6.5 (page 161), paraphrased here:

rms(deltaI)/I_P = sqrt(2*N_H/N_P)*F_H/F_P

where:
deltaI   change in spot intensity due to heavy atom contribution
I_P      spot intensity with no heavy atom contribution
N_H   number of heavy atoms in cell
N_P   number of protein atoms in cell
F_H   structure factor of the heavy atom
F_P   average structure factor of the protein atom

I have always found it more useful to rearrange equations like this to include the values you might have on hand, such as the molecular weight. So, if we assume that the average protein atom weighs 14 amu and has 7 electrons in it, and that the average B-factor of protein and heavy atoms are similar, we can replace F_P with 7 and replace N_P with MW/14 where MW is the molecular weight of the protein in Daltons and call N_H the number of heavy atoms per protein. It is also not completely wrong to replace F_H with fpp.

If we then require that the rms change in intensity be greater than the average noise, then we can write down the requirement:
sigma(I_P) < rms(deltaI)

So, after doing these substitutions and rearranging, we get:

sigma(I_P)/I_P < sqrt(2*N_H/(MW/14))*fpp/7
sigma(I_P)/I_P < 0.756*sqrt(N_H/MW)*fpp

I_P/sigma(I_P) > 1.3*sqrt(MW/N_H)/fpp

There are some obvious approximations here. Probably the biggest is assuming that fpp = F_H. In actual fact, anomalous differences "count double" since fpp contributes both to F+ and F-. I think Peter Zwart pointed this out earlier. There is also another sqrt(2) in the opposite direction because sigma(delta-I) is the quadrature sum of two sigma(I_P). It also matters if you are interested in the rms anomalous difference or the mean absolute anomalous difference, as these are not the same thing. Nonetheless, I think this last formula should be accurate to at worst a factor of two. In general, it is a good idea to have your signal be more than equal to noise, so I consider this formula a limit to be avoided rather than a goal to be met. The skill and expertise required to solve the structure increases quite sharply as your I/sigma(I) approaches this limit, but you can always double I/sigma(I) by merging data from four crystals. The latter is a better strategy.

-James Holton
MAD Scientist

Reply via email to