Re: [ccp4bb] AW: [ccp4bb] AW: [ccp4bb] AW: [EXTERNAL] Re: [ccp4bb] number of frames to get a full dataset?

2020-07-03 Thread Gerard Bricogne
Dear Herman and David,

 This thread seems inexhaustible :-) .

 On the matter of "measurement" vs. "observation", we seem again to be
in a situation described by the British idiom "half of one and half-a-dozen"
of the other, i.e. distinct but synonymous terms between which a choice is
quite indifferent.

 In the work on STARANISO and the documentation of that work, a
distinction had to be made between the two terms, for which readers are
referred to Ian's carefully crafted material at 

http://staraniso.globalphasing.org/anisotropy_about.html

and 

http://staraniso.globalphasing.org/staraniso_glossary.html

Here, a measurement is a number plucked out of examining the raw data,
namely an integrated intensity obtained by considering the pixel values
around the position in 3D reciprocal space predicted from an indexing
solution. The next step is to determine whether this qualifies as an
observation, in the sense of containing information that a structural model
would be expected to comply with. This determination is carried out by
computing a local average of I/sig(I) through reciprocal space and applying
a cut-off criterion based on a threshold value for that local average. Other
criteria can be considered, and are indeed offered by the program as
alternatives. Measurements complying with this selection criterion are then
called "observations". In this picture, an observation is defined as a
significant measurement. This basic distinction of vocabulary is then
extended to talking about "unmeasured" reflections (for which there weren't
any detector pixels to catch any photons at their predicted position - e.g.
in gaps between detector modules) and "unobserved" reflections (that are
unmeasured but for which the analysis of the I/sig(I) distribution predicts
that they would have been significant, had they been measured - e.g. in
cusps or missing angular ranges, as well as in module gaps etc.). The
display of the latter as blue dots in the STARANISO Reciprocal Lattice
Viewer then gives a vivid picture of the inadequacies of the experimental
protocol used, in failing to catch all the significant diffraction from the
sample.

 This being said, things could very well had been done the other way,
saying that the blindly integrated intensity was an observation, and that
the subsequent analysis was intended to determine whether you had really
measured something significant (i.e. a useful integrated intensity) by
making that observation. We were aware of this ambivalence, but felt that we
had to comply with the boundary condition that what we ended up with, after
conversion to an amplitude, had to be denoted "Fobs" ;-) . If the early
crystallographers had used the notation "Fmeas" for what they considered as
their experimental data, the choice of terminology would definitely have
gone the other way.

 As Graeme said, use the terminology you want, but document exactly what
you mean by it. The two URLs quoted above (especially the second) show that
this suggestion was conscientiously followed by the STARANISO developers.


 With best wishes,

  Gerard,

--
On Fri, Jul 03, 2020 at 10:22:43AM +, Schreuder, Herman /DE wrote:
> Dear David,
> 
> Thank you for your reaction. It has become clear to me that although most 
> people understand what I intended with “measurement”, in practice it is very 
> much in the eye of the beholder. It was suggested in the BB to use 
> observation instead, but I am fairly sure that some people will also have 
> issues with that.
> 
> The advantage of multiplicity/redundancy is that it does not mention what is 
> multiple or redundant and that one can refer to the program documentation for 
> an exact definition. Since most people are happy with the 
> multiplicity/redundancy they grew up with, that is the way it will stay.
> 
> Best regards,
> Herman
> 
> 
> 
> 
> Von: David Waterman 
> Gesendet: Freitag, 3. Juli 2020 10:49
> An: Schreuder, Herman /DE 
> Cc: CCP4BB@jiscmail.ac.uk
> Betreff: Re: [ccp4bb] AW: [ccp4bb] AW: [EXTERNAL] Re: [ccp4bb] number of 
> frames to get a full dataset?
> 
> 
> EXTERNAL : Real sender is dgwater...@gmail.com<mailto:dgwater...@gmail.com>
> 
> Hi Herman,
> 
> I like the idea of MPR, but I continue to worry about the term "measurement". 
> The intensity associated with a particular reflection is a fit based on a 
> scaling model, and ultimately, depending on your integration software, may be 
> linked to a weighted sum of two raw measurements: the summation and 
> profile-fitted intensities. I think these are the measurements, not the 
> intensity derived during the scaling procedure. Sure, anyone who wants to be 
> even more pedantic than me will point out that these "raw measurements" are 
>

Re: [ccp4bb] AW: [ccp4bb] AW: [ccp4bb] AW: [EXTERNAL] Re: [ccp4bb] number of frames to get a full dataset?

2020-07-03 Thread John R Helliwell
Dear Colleagues,
Now that Herman has announced a quietude I thought you might enjoy this quite 
short report on a synchrotron radiation issue that came up some years back via 
the JSR Main Editors into the IUCr Nomenclature Committee, chaired by Andre 
Authier, Past President of the IUCr:-
https://journals.iucr.org/s/issues/2005/03/00/es0344/es0344.pdf
Have a great weekend,
John 
Emeritus Professor John R Helliwell DSc




> On 3 Jul 2020, at 11:22, Schreuder, Herman /DE  
> wrote:
> 
> 
> Dear David,
>  
> Thank you for your reaction. It has become clear to me that although most 
> people understand what I intended with “measurement”, in practice it is very 
> much in the eye of the beholder. It was suggested in the BB to use 
> observation instead, but I am fairly sure that some people will also have 
> issues with that.
>  
> The advantage of multiplicity/redundancy is that it does not mention what is 
> multiple or redundant and that one can refer to the program documentation for 
> an exact definition. Since most people are happy with the 
> multiplicity/redundancy they grew up with, that is the way it will stay.
>  
> Best regards,
> Herman
>  
>  
>  
>  
> Von: David Waterman  
> Gesendet: Freitag, 3. Juli 2020 10:49
> An: Schreuder, Herman /DE 
> Cc: CCP4BB@jiscmail.ac.uk
> Betreff: Re: [ccp4bb] AW: [ccp4bb] AW: [EXTERNAL] Re: [ccp4bb] number of 
> frames to get a full dataset?
>  
> EXTERNAL : Real sender is dgwater...@gmail.com
> 
>  
> 
> Hi Herman,
>  
> I like the idea of MPR, but I continue to worry about the term "measurement". 
> The intensity associated with a particular reflection is a fit based on a 
> scaling model, and ultimately, depending on your integration software, may be 
> linked to a weighted sum of two raw measurements: the summation and 
> profile-fitted intensities. I think these are the measurements, not the 
> intensity derived during the scaling procedure. Sure, anyone who wants to be 
> even more pedantic than me will point out that these "raw measurements" are 
> also the result of fitting procedures. However, to my eyes, the difference is 
> that we don't consider the profile and summation integrated intensities to 
> change as a result of the procedure that ultimately determines the statistic 
> (MPR) of interest. During that procedure they are independent, not dependent 
> variables.
>  
> Maybe I am worrying about nothing. It agree it is fairly clear what you mean 
> by MPR. I just wanted to explore if there was any opportunity for further 
> reducing ambiguity.
>  
> Cheers
> -- David
>  
>  
> On Fri, 3 Jul 2020 at 08:12, Schreuder, Herman /DE 
>  wrote:
> Dear Ian,
>  
> Since some very advanced countries still use miles, Fahrenheit and inches, I 
> did not expect anything to change. It was an escalating discussion in this 
> thread on data completeness(!) on the use of multiplicity vs redundancy that 
> made me suggest a different term. Except for an occasional discussion in the 
> BB, there is nothing against people using the term they are most comfortable 
> with.
>  
> However, I insist that trying to impose a different definition of 
> “measurement” for MPR vs the definition used for the calculation of 
> redundancy/multiplicity is not a valid argument against MPR.
>  
> Cheers,
> Herman
>  
>  
>  
>  
> Von: CCP4 bulletin board  Im Auftrag von Ian Tickle
> Gesendet: Donnerstag, 2. Juli 2020 22:06
> An: CCP4BB@JISCMAIL.AC.UK
> Betreff: Re: [ccp4bb] AW: [EXTERNAL] Re: [ccp4bb] number of frames to get a 
> full dataset?
>  
> EXTERNAL : Real sender is owner-ccp...@jiscmail.ac.uk
> 
>  
> 
>  
> Well I very much doubt that many software developers are going to trawl 
> through all their code, comments, output statements & documentation to change 
> 'redundancy' or 'multiplicity' to 'MPR' or whatever terminology is agreed on 
> (assuming of course we do manage to come to an agreement, which I doubt).  
> And good luck with persuading wwPDB to change 'redundancy' in their mmCIF 
> dictionary!  That would be not only pointless but also a lot of work, partly 
> because terms get abbreviated in code and in outputs (e.g. to 'redund' in 
> mine, or 'mult').  And don't say I can keep the code & comments the same and 
> only change the outputs and documentation: that will really tax my brain!  
> Also don't say this need only apply to new code: no code is ever completely 
> new, and mixing up old & new terminology would be a disaster waiting to 
> happen!  Also it won't end there: someone will always find terminology that 
> they disagree with: I can think of plenty cans of worms that we could open, 
> but I think one is already on