Hi Leonard

Yes it wasn't at all clear from Stefan's announcement where the exact
problem/inconsistency between STARANISO and MorDa lies.  We would need to
get hold of some data that exhibits this behaviour so that we can reproduce
the problem ourselves and perform a more detailed analysis of what's going
on.  Without this more detailed information one probably shouldn't
speculate as to the exact cause; however my initial thought is that this
may be the exact same issue that Andrew Leslie raised a while back where
the anisotropically-cut data are refilled with DFc by the structure factor
/ refinement program for 2mFo-DFc map calculation.  Whenever a user
successfully runs a job on the STARANISO server and downloads the output
files (s)he receives this warning message at the very top of the page:

*"WARNING:* We are grateful to Andrew Leslie (MRC-LMB) for identifying a
potentially undesirable 'feature' of CCP4/REFMAC (and possibly other
refinement software), namely the 'fill-in' of unobserved data with *DF*calc
which makes it effectively *incompatible* with the MTZ file output by
STARANISO.  Our advice is therefore to use *BUSTER
<http://www.globalphasing.com/buster>* for refinement using the downloaded
MTZ file, unless the user can find a way to disable that feature in the
alternative software.  In general, any treatment of STARANISO's output data
that adds missing reflections out to the resolution limit may result in a
very high proportion of unobserved data compared with data that have had a
cut-off applied according to an isotropic criterion.  Where missing
reflections have been added in this way the user should take care to verify
that any applications used to process the data subsequently can handle it
correctly."

Following the addition of that caveat to the server, I'm aware that Refmac
now has an option to skip this fill-in of missing data by DFc; however not
all CCP4 tasks that incorporate Refmac in their pipeline may have
implemented this option.  This problem in fact stems from the use of the
'uniqueify' script on the _output_ MTZ from STARANISO, thus filling in
reflections out to the nominal isotropic resolution limit.  Of course the
whole point of this exercise is that the resolution limit is anisotropic,
not isotropic, so one should not be surprised if things go wrong if
incorrect assumptions are made!  The simple solution is to run 'uniqueify'
_before_ running STARANISO; this will apply the anisotropic cut-off and
transfer Rfree flags of the non-cut data to the output file thus preventing
fill-in of missing map coefficients by DFc.

For obvious reasons we would be very keen to resolve this issue as quickly
as possible!

Finally I agree with Pavel that there's no such thing as bad publicity!

Best wishes

-- Ian


On 24 November 2017 at 00:07, CHAVAS Leonard <
leonard.cha...@synchrotron-soleil.fr> wrote:

> Dear Gerard
>
> I should certainly be the one to blame, as I was the one spotting this
> behaviour to Stefan. Telling this, it could indeed be a problem with the
> data set, yet submitting the original mtz to ContaMiner provided with the
> strangely hoped and expected ContaMiner negative result. I made few tests
> afterwards, quick ones, with other mtz files treated or not with Staraniso,
> and although the direct results were not exactly the same as the extreme
> 'all positive' results noted earlier, it behaved strange (all 'possible
> positive', while standard non-contaminant data give you all 'negative'
> results).
>
> I then ran MorDa standalone and found the same behaviour: for the first
> trial, it did found a solution... while there were none. Interestingly,
> Rfac/Rfree in the resulting MorDa solved structure was something like
> 50/20... I strongly believe that MorDa, and hence ContaMiner, scores its
> results on the Rfree value. Errors could have been spotted by looking at
> the ratio, which clearly was showing that something wrong is happening,
> operation either on MorDa or on ContaMiner side (or even both).
>
> Additional investigations are obviously welcome, however and for now, I
> guess the message from Stefan was more in the idea to warn people that
> having too many positive hits for one single data probably means something
> is wrong with the ContaMiner/MorDa processing, more than something is wrong
> with the dataset itself. That goes without saying that something could go
> wrong if the dataset is wrong, obviously...
>
> To Gerard and good willing developers: would you need the badly behaving
> data, please do let me know and I shall send it to you offline.
>
> Cheers, leo
>
> -
> Leonard Chavas
> -
> Synchrotron SOLEIL
> Proxima-I
> L'Orme des Merisiers
> Saint-Aubin - BP 48
> 91192 Gif-sur-Yvette Cedex
> France
> -
> Phone:  +33 169 359 746
> Mobile: +33 644 321 614
> E-mail: leonard.cha...@synchrotron-soleil.fr
> -
>
> > On 23 Nov 2017, at 23:53, Gerard Bricogne <g...@globalphasing.com>
> wrote:
> >
> > Dear Pierre,
> >
> >     Thank you for throwing some light on the circumstances under
> > which the "suspicious" results cropped up :-) . Was the "Government
> > Heatlh Warning" based on this one and only mtz file, then?
> >
> >     We would certainly be interested in examining this dataset for
> > any anomalies in the anisotropy analysis and the mtz file it produced
> > (perhaps you can simply give us the job ID on the server). However the
> > "results" consist of the spurious recognition of molecules that are
> > known not to be in the crystal, so they are the outcome of numerous
> > unspecified steps downstream of the anisotropy analysis itself, that
> > in the end produce a "score" that is misleading. It would be useful to
> > have at least some idea of what those steps are in order to identify
> > the possible causes of the erroneous detections.
> >
> >
> >     With best wishes,
> >
> >          Gerard.
> >
> > --
> > On Thu, Nov 23, 2017 at 10:05:09PM +0000, LEGRAND Pierre wrote:
> >> Dear Gérard,
> >>
> >> As being part of the group how has initially raised the issue to
> Stefan, I stand out to try clarifying a misinterpretation.
> >> In brief, because we are happy users of StarAniso, it happened that we
> have submitted an mtz that it had produced to the ContaMiner server. We
> were very surprised to find that almost all contaminants evaluated gave
> high scores according to MoRDa. On the contrary, using an "isotropicaly"
> treated mtz, no hit could be detected in by ContaMiner.
> >>
> >> Being collected on PROXIMA-1 beamline, it coun't be a "badly collected
> datasets" ;-)
> >>
> >> So, most probably, as you suggested, the issue is linked to some bad
> assumptions made by MoRDa or inadequate criteria choices _in_the_cases_ of
> "corrected" anisotropic data.
> >> We can probably provide some examples of datasets to the developers
> willing to pursue investigations on this.
> >>
> >> I completely agree with Tristan! I have to admit having lost several
> weeks in my career (if not months) with "contaminated" crystals. And
> working on an MX beamline, I can testify that this is unfortunately
> happening regularly.
> >>
> >> I will finish with a big thanks to all the (StarAniso/ContaMiner/MoRDA)
> developers how are helping us to untwist the unavoidable experimental
> mess/reality.
> >>
> >> Kind regards,
> >> Pierre
> >> ________________________________________
> >> De : CCP4 bulletin board [CCP4BB@JISCMAIL.AC.UK] de la part de Gerard
> Bricogne [g...@globalphasing.com]
> >> Envoyé : jeudi 23 novembre 2017 19:34
> >> À : CCP4BB@JISCMAIL.AC.UK
> >> Objet : Re: [ccp4bb] new ContaMiner features
> >>
> >> Dear Stefan,
> >>
> >>     Regarding your final paragraph: your server carries a warning
> >> with the exact wording:
> >>
> >>     "Submitting StarAniso files can give you suspicious results. Use
> >> with care!"
> >>
> >>     It seems rather regrettable that you are posting such a public
> >> warning without ever having contacted the STARANISO developers about
> >> your observations, nor giving any information about what you call
> >> "suspicious" or what the "care" you recommend would consist of.
> >>
> >>     We have taken a great deal of care ourselves in developing the
> >> program and offering it to the community through a server, and the
> >> least we would have expected is that any pattern of "suspicious"
> >> results would be referred to us so that we could investigate them.
> >> There may be some assumptions made in MoRDa that we are not aware of,
> >> that might be incompatible with assumptions made in STARANISO - who
> >> knows? Or it could be that some particularly badly collected datasets
> >> are made to look worse after their anisotropy analysis.
> >>
> >>     Could we discuss your observations, and what it is exactly that
> >> you call "suspicious", before they end up being referred to in such an
> >> uninformative manner as some sort of "Government Health Warning"?
> >>
> >>     I think that would be nice :-) and we would be only too keen to
> >> take whatever extra "care" is needed ourselves. We would all learn
> >> something.
> >>
> >>
> >>     With best wishes,
> >>
> >>          Gerard.
> >>
> >> (on behalf of the STARANISO developers)
> >>
> >> --
> >> On Thu, Nov 23, 2017 at 05:02:39PM +0100, Stefan Arold wrote:
> >>> Dear Community,
> >>>
> >>> A quick message to announce the following two new features on our
> >>> ContaMiner web server for the automated detection of unwantedly
> >>> crystallised contaminants (
> >>> https://strube.cbrc.kaust.edu.sa/contaminer/submit)
> >>>
> >>> 1) online visualisation of 2FoFc and FoFc maps. In cases of positive
> >>> results, the ‘UglyMol’ tab allows to inspect 2FoFc and FoFc maps
> directly
> >>> in the web browser. Thi
> >>>
> >>> 2) life-update. Previously, results were sent to you once all ~2000 MR
> jobs
> >>> were finished. Now, the individual results for each potential
> contaminant
> >>> will appear as soon as they are finished. This feature should
> substantially
> >>> shorten the time for identifying positive results (i.e. contaminant
> >>> detected), which are terminated faster than negative ones.
> >>>
> >>> 3) custom contaminants. In the ‘Advanced’ tab, users can upload own PDB
> >>> files (more than one is possible) to be included as search models. This
> >>> feature can be used to include PDB files from your lab bench
> neighbour’s
> >>> project to test for potential lab internal contaminations (through
> >>> bacterial contamination or through mix-up of plasmids or glycerol
> stocks).
> >>> This feature could also be ‘abused’ as a means to use the MoRDa
> pipeline to
> >>> run molecular replacements with template structures that are not yet
> >>> deposited in the PDB; for example to run molecular replacement and
> initial
> >>> refinement for liganded or complexed versions of an unpublished
> structure.
> >>> This might be particularly interesting for crystallographers away from
> >>> their usual home software environment (e.g. at the beamline).
> >>>
> >>> Finally, a word of warning – Staraniso files might give false
> positives if
> >>> they have large anisotropic cuts.
> >>>
> >>> Keep your crystals clean!
> >>>
> >>> With best wishes
> >>>
> >>> The ContaMiner Team
>
>

Reply via email to