Re: [ccp4bb] mosflm gain

James Holton Tue, 08 Mar 2011 12:08:43 -0800

Andrew! You don't believe me? Well, I suppose it serves me right fornot explaining where the idea came from (see below).

I do, however, agree with Andrew's assessment that the default-chosengain in MOSFLM is adequate for all practical purposes. Any error inGAIN will be almost exactly compensated for by a corresponding change inSdfac in SCALA, and the final value of sigma(I) will be essentially thesame. The only possible difference will be in the sigma-based outlierrejection within MOSFLM, but since the typical errors in the sigma areonly ~30%, I predict it will be hard to find a situation where thismakes or breaks a structure determination.

So, by way of explanation: there are three things that led me to thisconclusion:

1) the control: fake data with all pixels independent.

adjusting the GAIN as MOSFLM recommends from the BGRATIO analysisdoes, in fact, reproduce the "correct" value of the gain used togenerate the fake data. In SCALA, Sdfac refines to ~1.0, SdB refines to0, and Sdadd refines to the actual magnitude of fractional error(introduced by beam flicker, shutter jitter, etc.). No surprises here.2) "blur" the fake data with the point-spread function (PSF) empiricallyderived for my detectorIn this case, the "MOSFLM-refined gain" is too low. In SCALA,Sdfac refines to ~1.3, SdB refines to 3-5, and Sdadd is a bit low.These parameters are about what I see processing good real data.3) use real data, but force MOSFLM to use the GAIN calibratedindependently for the detectorMOSFLM grumbles a lot about the BGRATIO. In SCALA, Sdfac refines to~1, and SdB refines to ~0. Sdadd is consistent with myindependently-measured fractional error sources.

Now, I have not evaluated this approach on a huge number of data sets,but in this case the PSF was both necessary and sufficient to explainthe "mystery of SdB". That is: the need for SdB arises because using an"incorrect" gain creates a correlation between Sdfac and Sdadd. Iimagine there are other ways to get a non-zero SdB as well, but for"good data" I suspect this is the dominant mechanism. I never wrotethis up because I am fairly certain the article would do nothing toimprove the impact factor of the journal in which it was published, butthis anecdote might perhaps be useful to Andrew, Phil, and a few otherreaders of this list.


-James Holton
MAD Scientist


On 3/7/2011 2:00 AM, A Leslie wrote:

I have to say that I don't fully agree with James' recommendation toadjust the GAIN in MOSFLM until the calculated SDFAC parameter inSCALA is 1.0.
(Background information, the sigmas from Mosflm sd(I) are corrected inSCALA according tosd(I) corrected = SdFac * sqrt{sd(I)**2 + SdB*Ihl +(SdAdd*Ihl)**2}in order to get the best agreement between corrected sigmas and theobserved differences between symmtery/Friedel related intensities)
While I fully agree with his argument that systematic errors such asabsorption, etc give an error proportional to the intensity, andtherefore should be corrected by the SDADD term rather than SDFAC, inany "real world" data set that I have come across the situation is notso simple. Indeed, according to the usual treatment of errors thereshould be no need for the SDB term in SCALA, but in practice it isessential to have this term to be able to match corrected sigmas withthe observed differences between symmetry related reflections. It alsoturns out that the three variable parameters SDFAC, SDB and SDADD arehighly correlated, so one can get rather different values for anyindividual parameter from very similar datasets. Radiation damage iscertainly one source of error which would not be expected to follow asimple error model, or non-isomorphism if multiple crystals have beenused.
Phil Evans is not entirely happy with the behaviour of the refinementof these parameters and is in fact currently looking at this, butthere is a basic problem here that one is trying to use a simpleerror model for a situation where (for whatever reason) it does notreally apply.
The sigma estimates from MOSFLM are only intended to give an estimateof the random error in the intensities. In my opinion, trying toaccount for systematic errors is best done at the point of merging thedata where much more information is available (ie symmetry relatedmeasurements).
I would be most interested to hear of any examples where the defaultvalue of the GAIN in MOSFLM is clearly wrong, but to the best of mycurrent knowledge the default GAIN is perfectly adequate.
Best wishes

Andrew
On 4 Mar 2011, at 19:47, James Holton wrote:
I have found that the best way to get the GAIN "right" in MOSFLM isto have a look at the optimum "Sdfac" parameter at the end of SCALA(the first of the three SDCORRection values). Specifically, if SDFacis > 1, then you need to increase the GAIN. This is because SDFac>1means that the spots were noisier than MOSFLM thought they should be,and if a given number of ADU is noisier than expected, then theremust have been fewer photons involved in generating the signal. Thismeans that the "true gain" was higher. Yes, there are other sourcesof error, like shutter jitter, beam flicker, calibration errors,absorption effects, scale factor errors, etc. But these are alldirectly proportional to the intensity, and therefore accounted forby adjusting SDadd (the last of the three SDCORR values). SDfacaccounts for noise proportional to the square root of intensity, andonly shot noise (like photon counting) behaves like that.
David Waterman makes an excellent point that the point-spreadfunction (PSF) acts like a smoothing filter and makes the backgroundlook less noisy than photon-counting error permits. This makes theBGRATIO-estimated GAIN lower than the "true" GAIN. However, one canargue that this is not always a bad thing, since the error inmeasuring the intensity of a given area of flat background really is"better than photon counting". This is because you have thesmoothing effect of the PSF working "for you": bringing in signalfrom areas outside the region you are measuring (prior knowledge of"flatness" if you will). However, this smoothing effect of the PSFdoes not apply to spots because spot photons all arrive inessentially the same place, and no "smoothing" will change theintrinsic noise of the total number of photons that actuallyarrived. The upshot of this is that we really need two differentvalues for GAIN, one for the background and one for thebackground-subtracted spot intensity. The influence on sigma(I)would depend on the relative contributions from the spot vs thebackground under it. I am pretty sure this is not implemented.
It is perhaps interesting that there is also a third type of noisewhich is independent of the spot intensity: "read-out noise". Thisused to be called "fog" on film detectors. Despite all the money wespend on detectors that minimize it, there is no specific accountingfor read-out noise in MOSFLM or any other integration package I amaware of. However, a "trick" to account for it is to simply lowerthe ADCOFFSET. For example, using 1 A X-rays on an ADSC Q315rdetector in hwbin mode, the true GAIN is 1.8 ADU/photon, theADCOFFSET is 40 ADU, and the read-out noise is equivalent to thenoise deposited by ~2 photon/pixel of x-ray background. This meansthat a blank image has an average value of 40 ADU and rms variationof ~2.5 ADU, but this is equivalent to an image from a detector withthe same gain, no read-out noise, and ADCOFFSET of 36 that was"fogged" by 2 photons/pixel (regardless of exposure time). Yes, thisis a small change in ADCOFFSET, and I doubt you will notice thedifference. I think this speaks to the fact that, on moderndetectors at least, read-out noise is essentially negligible.
Another way to get the GAIN, of course, is to measure it directly. Idid this on an ADSC Q315 detector in swbin mode by comparison to aNaI:Tl scintillator (after accounting for the window and sensorthickness of the latter device):
http://bl831.als.lbl.gov/~jamesh/pickup/Q315_gain.png
You can see how the GAIN changes appreciably with photon energy, andthis is largely because lower-energy photons generate less signal.GAIN also changes with the detector read-out mode. For example, thisnumber is 3 times higher for a Q315r in hwbin mode. I have listed mybest information on the typical GAIN and read-out noise of commondetectors on my "minimum crystal size" page here:
http://bl831.als.lbl.gov/xtalsize.html
You can extract the parameters by selecting the "detector type = "you want, and then switching it again to "Custom..."
-James Holton
MAD Scientist

On 3/3/2011 12:34 PM, Bryan Lepore wrote:
wondering if mosflm can automatically estimate the gain.

i.e. i gather it is still estimated the usual way.

-Bryan

Re: [ccp4bb] mosflm gain

Reply via email to