Re: [ccp4bb] Invisible atoms in ligands

2014-06-16 Thread James Holton


Thank you Pavel for the clarification!

What I was really trying to point out is that a missing atom, occ=0.00 
and occ=0.01 are not as similar as one might naiively think.  Also, if 
you put a ligand into a pocket and the occupancy refines to  0, that 
does not necessarily mean the ligand is partially occupied.  If the 
pocket is actually filled with flat bulk solvent, then you expect the 
ligand occupancy to be non-zero, simply because something is better than 
nothing.  However, if the bulk solvent mask were somehow smarter and 
filled the pocket of a, say, 60% occupied ligand with flat bulk density 
at 40% the level of bulk density used far away from any atoms, then one 
might actually see the occupancy of a bogus ligand refine to zero.  That 
is, a ligand built into a pocket that is truly empty (filled with flat 
bulk solvent) and then occupancy refined would actually be a 
competition between two alternative hypotheses: 1) ligand in the 
pocket, 2) nothing but solvent in the pocket.  If the occupancy of the 
ligand refines to zero in this context, then you can be quite confident 
that it didn't bind, at least not in the given orientation.


I fully realize that the implementation of this is easier said than 
done, but perhaps it would be worth the effort?


-James Holton
MAD Scientist

On 6/16/2014 3:04 PM, Pavel Afonine wrote:

Hi James,

a remark: different programs may treat occ=0 differently. In 
phenix.refine (phenix.maps, etc) atoms with zero occupancy will be 
ignored for bulk-solvent mask calculation, unless you ask to do 
otherwise. For example, this means that if you want to calculate a 
ligand OMIT map both options

- removing the ligand from PDB file;
- setting its occupancy to zero and making sure mask does not ignore 
occ=0 atoms)

are a) not equivalent and b) both not good.
In first case (removing atoms from file) bulk-solvent will flatten 
residual map (as you pointed out). In second case bulk-solvent will be 
excluded in a very specific area, so that residual (green) density 
you see there may be either just bulk-solvent or ligand in question or 
a mixture; obviously not a very useful information! This highlights 
the fundamental problem of flat bulk-solvent model the way it's 
currently used.


Pavel


On Sun, Jun 15, 2014 at 3:01 PM, James Holton jmhol...@lbl.gov 
mailto:jmhol...@lbl.gov wrote:



The principle difference between occ=0 and omitting the atom
entirely is that occ=0 atoms exclude bulk solvent. Or at least
they do for typical operation of contemporary refinement
programs.  So, by defining occ=0 you are forcing all map voxels
within ~0.6A or so of your invisible atom to be vacuum.  If you
omit it, then the bulk solvent may flood in, perhaps enough to
pull the fo-fc peak down below 3x rms.  How much the bulk solvent
floods in depends on how nearby atoms exclude the bulk solvent,
and this, in turn, depends on which refinement program you are
using.  Different bulk solvent implementations use different
radii, shrink parameters, etc.  In addition, bulk solvent always
bleeds a bit into surrounding areas because the solvent B factor
is never zero.

The real problem, I think, is that for any voxel of the map there
is ALWAYS something there.  The only question is: what is it? 
Is there a 100% occupied ligand?  100% occupied solvent?  Two

conformers of the ligand?  Or is it some mixture of all these?  If
you are asking these questions I think it is most likely a
mixture, and mixtures are hard to model.  What is worse, mixtures
of a partially-occupied ligand with bulk solvent taking up the
slack is currently impossible to model.  We will have to wait for
partial-occupancy-bulk-solvent to be implemented before we can
build representations of these alternative hypotheses and and test
them with competitive occupancy refinement.

The bulk solvent is actually a very good example of something for
which we see no evidence in our electron density maps, yet we
model it in because 1) we know it must be there, and 2) it makes
our R factors lower. What more could you want?

-James Holton
MAD Scientist


On 6/13/2014 7:45 PM, Frank von Delft wrote:

Hi all - talking about ligands, a quick question on that old
conundrum, of what to do about invisible atoms -- build them with
occ=0, or omit them?

For bits of protein, I know all the arguments; personally I
prefer omitting atoms because:

  * for amino acid sidechains, their presence is implied in the
residue name.
  * for whole residues, their presence is implied in the sequence
numbering

However:  what about ligands?  Nowhere else in the PDB file is
their presence implied - or have I missed something?

(Certainly disorder in a ligand is important information that
needs to be captured!)

Cheers
phx







Re: [ccp4bb] Invisible atoms in ligands

2014-06-16 Thread Hargreaves, David
Hi All,

I had an interesting case recently where a Cl atom of a chlorophenyl moiety 
went missing in a structure (primary X-ray damage was the suspected culprit). 
Small molecule Mass Spec suggested the atom was there to start with but it was 
quite obviously missing in the maps (1.9Ang) and the des-chloro refined much 
better. I was asked to replace the missing atom as it was seen as misleading 
because all the assay and biophysical data was generated with the chlorophenyl 
compound; this after all, is what the X-ray model was supporting. I did wonder 
where the Cl atom ended up and in what state.

David Hargreaves
Associate Principal Scientist
_
AstraZeneca
Discovery Sciences, Structure  Biophysics
Mereside, 50F49, Alderley Park, Cheshire, SK10 4TF
Tel +44 (0)01625 518521  Fax +44 (0) 1625 232693
David.Hargreaves @astrazeneca.commailto:name.surn...@astrazeneca.com

Please consider the environment before printing this e-mail

From: CCP4 bulletin board [mailto:CCP4BB@JISCMAIL.AC.UK] On Behalf Of Frank von 
Delft
Sent: 13 June 2014 10:45
To: CCP4BB@JISCMAIL.AC.UK
Subject: [ccp4bb] Invisible atoms in ligands

Hi all - talking about ligands, a quick question on that old conundrum, of what 
to do about invisible atoms -- build them with occ=0, or omit them?

For bits of protein, I know all the arguments;  personally I prefer omitting 
atoms because:

  *   for amino acid sidechains, their presence is implied in the residue name.
  *   for whole residues, their presence is implied in the sequence numbering
However:  what about ligands?  Nowhere else in the PDB file is their presence 
implied - or have I missed something?

(Certainly disorder in a ligand is important information that needs to be 
captured!)

Cheers
phx

--
AstraZeneca UK Limited is a company incorporated in England and Wales with 
registered number: 03674842 and a registered office at 2 Kingdom Street, 
London, W2 6BD.
Confidentiality Notice: This message is private and may contain confidential, 
proprietary and legally privileged information. If you have received this 
message in error, please notify us and remove it from your system and note that 
you must not copy, distribute or take any action in reliance on it. Any 
unauthorised use or disclosure of the contents of this message is not permitted 
and may be unlawful.
Disclaimer: Email messages may be subject to delays, interception, non-delivery 
and unauthorised alterations. Therefore, information expressed in this message 
is not given or endorsed by AstraZeneca UK Limited unless otherwise notified by 
an authorised representative independent of this message. No contractual 
relationship is created by this message by any person unless specifically 
indicated by agreement in writing other than email.
Monitoring: AstraZeneca UK Limited may monitor email traffic data and content 
for the purposes of the prevention and detection of crime, ensuring the 
security of our computer systems and checking Compliance with our Code of 
Conduct and Policies.


Re: [ccp4bb] Invisible atoms in ligands

2014-06-16 Thread Ian Tickle
Dear James

You seem to be discounting the possibility of a true vacuum inside a
structure, which is obviously not the same thing as 'something' (bulk
solvent or whatever).  I accept that this is unlikely in the case of ligand
binding sites exposed to solvent, or indeed any site on the outer surface
of the molecule, since any vacuum in that situation would be unstable
against the ingress of water molecules, but it is possible in the case of
fully-enclosed cavities (i.e. 'inner surface') that are normally
inaccessible to water.  I don't know if anyone has done a systematic survey
of this, i.e. looking for cavities where the density appears to be actually
zero (taking into account F000 of course), or at least significantly lower
compared with the bulk solvent density (where the assumed value of F000
wouldn't affect the result).

Cheers

-- Ian


On 16 June 2014 07:37, James Holton jmhol...@lbl.gov wrote:


 Thank you Pavel for the clarification!

 What I was really trying to point out is that a missing atom, occ=0.00
 and occ=0.01 are not as similar as one might naiively think.  Also, if you
 put a ligand into a pocket and the occupancy refines to  0, that does not
 necessarily mean the ligand is partially occupied.  If the pocket is
 actually filled with flat bulk solvent, then you expect the ligand
 occupancy to be non-zero, simply because something is better than nothing.
 However, if the bulk solvent mask were somehow smarter and filled the
 pocket of a, say, 60% occupied ligand with flat bulk density at 40% the
 level of bulk density used far away from any atoms, then one might actually
 see the occupancy of a bogus ligand refine to zero.  That is, a ligand
 built into a pocket that is truly empty (filled with flat bulk solvent)
 and then occupancy refined would actually be a competition between two
 alternative hypotheses: 1) ligand in the pocket, 2) nothing but solvent in
 the pocket.  If the occupancy of the ligand refines to zero in this
 context, then you can be quite confident that it didn't bind, at least not
 in the given orientation.

 I fully realize that the implementation of this is easier said than done,
 but perhaps it would be worth the effort?

 -James Holton
 MAD Scientist


 On 6/16/2014 3:04 PM, Pavel Afonine wrote:

 Hi James,

  a remark: different programs may treat occ=0 differently. In
 phenix.refine (phenix.maps, etc) atoms with zero occupancy will be ignored
 for bulk-solvent mask calculation, unless you ask to do otherwise. For
 example, this means that if you want to calculate a ligand OMIT map both
 options
 - removing the ligand from PDB file;
 - setting its occupancy to zero and making sure mask does not ignore occ=0
 atoms)
 are a) not equivalent and b) both not good.
 In first case (removing atoms from file) bulk-solvent will flatten
 residual map (as you pointed out). In second case bulk-solvent will be
 excluded in a very specific area, so that residual (green) density you
 see there may be either just bulk-solvent or ligand in question or a
 mixture; obviously not a very useful information! This highlights the
 fundamental problem of flat bulk-solvent model the way it's currently used.

  Pavel


 On Sun, Jun 15, 2014 at 3:01 PM, James Holton jmhol...@lbl.gov wrote:


 The principle difference between occ=0 and omitting the atom entirely is
 that occ=0 atoms exclude bulk solvent.  Or at least they do for typical
 operation of contemporary refinement programs.  So, by defining occ=0 you
 are forcing all map voxels within ~0.6A or so of your invisible atom to
 be vacuum.  If you omit it, then the bulk solvent may flood in, perhaps
 enough to pull the fo-fc peak down below 3x rms.  How much the bulk solvent
 floods in depends on how nearby atoms exclude the bulk solvent, and this,
 in turn, depends on which refinement program you are using.  Different bulk
 solvent implementations use different radii, shrink parameters, etc.  In
 addition, bulk solvent always bleeds a bit into surrounding areas because
 the solvent B factor is never zero.

 The real problem, I think, is that for any voxel of the map there is
 ALWAYS something there.  The only question is: what is it?  Is there a
 100% occupied ligand?  100% occupied solvent?  Two conformers of the
 ligand?  Or is it some mixture of all these?  If you are asking these
 questions I think it is most likely a mixture, and mixtures are hard to
 model.  What is worse, mixtures of a partially-occupied ligand with bulk
 solvent taking up the slack is currently impossible to model.  We will have
 to wait for partial-occupancy-bulk-solvent to be implemented before we can
 build representations of these alternative hypotheses and and test them
 with competitive occupancy refinement.

 The bulk solvent is actually a very good example of something for which
 we see no evidence in our electron density maps, yet we model it in
 because 1) we know it must be there, and 2) it makes our R factors lower.
 What more could you want?

 

Re: [ccp4bb] Invisible atoms in ligands

2014-06-16 Thread Daniel Picot

Dear Ian,
  This has been discussed in a review and related articles by Brian 
Matthews and Liljun Liu:


Matthews BW, Liu L. A review about nothing: are apolar cavities in proteins
really empty? Protein Sci. 2009 Mar;18(3):494-502. doi: 10.1002/pro.61. Review.
PubMed PMID: 19241368; PubMed Central PMCID: PMC2760356.

Daniel



Le 16/06/2014 11:32, Ian Tickle a écrit :

Dear James

You seem to be discounting the possibility of a true vacuum inside a 
structure, which is obviously not the same thing as 'something' (bulk 
solvent or whatever).  I accept that this is unlikely in the case of 
ligand binding sites exposed to solvent, or indeed any site on the 
outer surface of the molecule, since any vacuum in that situation 
would be unstable against the ingress of water molecules, but it is 
possible in the case of fully-enclosed cavities (i.e. 'inner surface') 
that are normally inaccessible to water.  I don't know if anyone has 
done a systematic survey of this, i.e. looking for cavities where the 
density appears to be actually zero (taking into account F000 of 
course), or at least significantly lower compared with the bulk 
solvent density (where the assumed value of F000 wouldn't affect the 
result).


Cheers

-- Ian


On 16 June 2014 07:37, James Holton jmhol...@lbl.gov 
mailto:jmhol...@lbl.gov wrote:



Thank you Pavel for the clarification!

What I was really trying to point out is that a missing atom,
occ=0.00 and occ=0.01 are not as similar as one might naiively
think.  Also, if you put a ligand into a pocket and the occupancy
refines to  0, that does not necessarily mean the ligand is
partially occupied.  If the pocket is actually filled with flat
bulk solvent, then you expect the ligand occupancy to be non-zero,
simply because something is better than nothing.  However, if the
bulk solvent mask were somehow smarter and filled the pocket of
a, say, 60% occupied ligand with flat bulk density at 40% the
level of bulk density used far away from any atoms, then one might
actually see the occupancy of a bogus ligand refine to zero.  That
is, a ligand built into a pocket that is truly empty (filled
with flat bulk solvent) and then occupancy refined would actually
be a competition between two alternative hypotheses: 1) ligand
in the pocket, 2) nothing but solvent in the pocket.  If the
occupancy of the ligand refines to zero in this context, then you
can be quite confident that it didn't bind, at least not in the
given orientation.

I fully realize that the implementation of this is easier said
than done, but perhaps it would be worth the effort?

-James Holton
MAD Scientist


On 6/16/2014 3:04 PM, Pavel Afonine wrote:

Hi James,

a remark: different programs may treat occ=0 differently. In
phenix.refine (phenix.maps, etc) atoms with zero occupancy will
be ignored for bulk-solvent mask calculation, unless you ask to
do otherwise. For example, this means that if you want to
calculate a ligand OMIT map both options
- removing the ligand from PDB file;
- setting its occupancy to zero and making sure mask does not
ignore occ=0 atoms)
are a) not equivalent and b) both not good.
In first case (removing atoms from file) bulk-solvent will
flatten residual map (as you pointed out). In second case
bulk-solvent will be excluded in a very specific area, so that
residual (green) density you see there may be either just
bulk-solvent or ligand in question or a mixture; obviously not a
very useful information! This highlights the fundamental problem
of flat bulk-solvent model the way it's currently used.

Pavel


On Sun, Jun 15, 2014 at 3:01 PM, James Holton jmhol...@lbl.gov
mailto:jmhol...@lbl.gov wrote:


The principle difference between occ=0 and omitting the atom
entirely is that occ=0 atoms exclude bulk solvent.  Or at
least they do for typical operation of contemporary
refinement programs.  So, by defining occ=0 you are forcing
all map voxels within ~0.6A or so of your invisible atom to
be vacuum.  If you omit it, then the bulk solvent may flood
in, perhaps enough to pull the fo-fc peak down below 3x
rms.  How much the bulk solvent floods in depends on how
nearby atoms exclude the bulk solvent, and this, in turn,
depends on which refinement program you are using. Different
bulk solvent implementations use different radii, shrink
parameters, etc.  In addition, bulk solvent always bleeds a
bit into surrounding areas because the solvent B factor is
never zero.

The real problem, I think, is that for any voxel of the map
there is ALWAYS something there.  The only question is:
what is it?  Is there a 100% occupied ligand?  100% occupied
solvent?  Two conformers of the ligand? Or is it some mixture
  

Re: [ccp4bb] Invisible atoms in ligands

2014-06-16 Thread Ian Tickle
Dear Daniel

Thanks for the info.  I knew that Brian Matthews' group had done some work
in this area.

Cheers

-- Ian


On 16 June 2014 11:16, Daniel Picot daniel.pi...@ibpc.fr wrote:

  Dear Ian,
   This has been discussed in a review and related articles by Brian
 Matthews and Liljun Liu:

 Matthews BW, Liu L. A review about nothing: are apolar cavities in proteins
 really empty? Protein Sci. 2009 Mar;18(3):494-502. doi: 10.1002/pro.61. 
 Review.
 PubMed PMID: 19241368; PubMed Central PMCID: PMC2760356.

 Daniel



 Le 16/06/2014 11:32, Ian Tickle a écrit :

  Dear James

  You seem to be discounting the possibility of a true vacuum inside a
 structure, which is obviously not the same thing as 'something' (bulk
 solvent or whatever).  I accept that this is unlikely in the case of ligand
 binding sites exposed to solvent, or indeed any site on the outer surface
 of the molecule, since any vacuum in that situation would be unstable
 against the ingress of water molecules, but it is possible in the case of
 fully-enclosed cavities (i.e. 'inner surface') that are normally
 inaccessible to water.  I don't know if anyone has done a systematic survey
 of this, i.e. looking for cavities where the density appears to be actually
 zero (taking into account F000 of course), or at least significantly lower
 compared with the bulk solvent density (where the assumed value of F000
 wouldn't affect the result).

  Cheers

  -- Ian


 On 16 June 2014 07:37, James Holton jmhol...@lbl.gov wrote:


 Thank you Pavel for the clarification!

 What I was really trying to point out is that a missing atom, occ=0.00
 and occ=0.01 are not as similar as one might naiively think.  Also, if you
 put a ligand into a pocket and the occupancy refines to  0, that does not
 necessarily mean the ligand is partially occupied.  If the pocket is
 actually filled with flat bulk solvent, then you expect the ligand
 occupancy to be non-zero, simply because something is better than nothing.
 However, if the bulk solvent mask were somehow smarter and filled the
 pocket of a, say, 60% occupied ligand with flat bulk density at 40% the
 level of bulk density used far away from any atoms, then one might actually
 see the occupancy of a bogus ligand refine to zero.  That is, a ligand
 built into a pocket that is truly empty (filled with flat bulk solvent)
 and then occupancy refined would actually be a competition between two
 alternative hypotheses: 1) ligand in the pocket, 2) nothing but solvent in
 the pocket.  If the occupancy of the ligand refines to zero in this
 context, then you can be quite confident that it didn't bind, at least not
 in the given orientation.

 I fully realize that the implementation of this is easier said than done,
 but perhaps it would be worth the effort?

 -James Holton
 MAD Scientist


 On 6/16/2014 3:04 PM, Pavel Afonine wrote:

 Hi James,

  a remark: different programs may treat occ=0 differently. In
 phenix.refine (phenix.maps, etc) atoms with zero occupancy will be ignored
 for bulk-solvent mask calculation, unless you ask to do otherwise. For
 example, this means that if you want to calculate a ligand OMIT map both
 options
 - removing the ligand from PDB file;
 - setting its occupancy to zero and making sure mask does not ignore
 occ=0 atoms)
 are a) not equivalent and b) both not good.
 In first case (removing atoms from file) bulk-solvent will flatten
 residual map (as you pointed out). In second case bulk-solvent will be
 excluded in a very specific area, so that residual (green) density you
 see there may be either just bulk-solvent or ligand in question or a
 mixture; obviously not a very useful information! This highlights the
 fundamental problem of flat bulk-solvent model the way it's currently used.

  Pavel


 On Sun, Jun 15, 2014 at 3:01 PM, James Holton jmhol...@lbl.gov wrote:


 The principle difference between occ=0 and omitting the atom entirely is
 that occ=0 atoms exclude bulk solvent.  Or at least they do for typical
 operation of contemporary refinement programs.  So, by defining occ=0 you
 are forcing all map voxels within ~0.6A or so of your invisible atom to
 be vacuum.  If you omit it, then the bulk solvent may flood in, perhaps
 enough to pull the fo-fc peak down below 3x rms.  How much the bulk solvent
 floods in depends on how nearby atoms exclude the bulk solvent, and this,
 in turn, depends on which refinement program you are using.  Different bulk
 solvent implementations use different radii, shrink parameters, etc.  In
 addition, bulk solvent always bleeds a bit into surrounding areas because
 the solvent B factor is never zero.

 The real problem, I think, is that for any voxel of the map there is
 ALWAYS something there.  The only question is: what is it?  Is there a
 100% occupied ligand?  100% occupied solvent?  Two conformers of the
 ligand?  Or is it some mixture of all these?  If you are asking these
 questions I think it is most likely a mixture, and mixtures are hard to
 

Re: [ccp4bb] Invisible atoms in ligands

2014-06-15 Thread James Holton


The principle difference between occ=0 and omitting the atom entirely is 
that occ=0 atoms exclude bulk solvent.  Or at least they do for typical 
operation of contemporary refinement programs.  So, by defining occ=0 
you are forcing all map voxels within ~0.6A or so of your invisible 
atom to be vacuum.  If you omit it, then the bulk solvent may flood 
in, perhaps enough to pull the fo-fc peak down below 3x rms.  How much 
the bulk solvent floods in depends on how nearby atoms exclude the bulk 
solvent, and this, in turn, depends on which refinement program you are 
using.  Different bulk solvent implementations use different radii, 
shrink parameters, etc.  In addition, bulk solvent always bleeds a 
bit into surrounding areas because the solvent B factor is never zero.


The real problem, I think, is that for any voxel of the map there is 
ALWAYS something there.  The only question is: what is it? Is there a 
100% occupied ligand?  100% occupied solvent?  Two conformers of the 
ligand?  Or is it some mixture of all these?  If you are asking these 
questions I think it is most likely a mixture, and mixtures are hard to 
model.  What is worse, mixtures of a partially-occupied ligand with bulk 
solvent taking up the slack is currently impossible to model.  We will 
have to wait for partial-occupancy-bulk-solvent to be implemented before 
we can build representations of these alternative hypotheses and and 
test them with competitive occupancy refinement.


The bulk solvent is actually a very good example of something for which 
we see no evidence in our electron density maps, yet we model it in 
because 1) we know it must be there, and 2) it makes our R factors 
lower.  What more could you want?


-James Holton
MAD Scientist

On 6/13/2014 7:45 PM, Frank von Delft wrote:
Hi all - talking about ligands, a quick question on that old 
conundrum, of what to do about invisible atoms -- build them with 
occ=0, or omit them?


For bits of protein, I know all the arguments;  personally I prefer 
omitting atoms because:


  * for amino acid sidechains, their presence is implied in the
residue name.
  * for whole residues, their presence is implied in the sequence
numbering

However:  what about ligands?  Nowhere else in the PDB file is their 
presence implied - or have I missed something?


(Certainly disorder in a ligand is important information that needs to 
be captured!)


Cheers
phx




Re: [ccp4bb] Invisible atoms in ligands

2014-06-15 Thread Pavel Afonine
Hi James,

a remark: different programs may treat occ=0 differently. In phenix.refine
(phenix.maps, etc) atoms with zero occupancy will be ignored for
bulk-solvent mask calculation, unless you ask to do otherwise. For example,
this means that if you want to calculate a ligand OMIT map both options
- removing the ligand from PDB file;
- setting its occupancy to zero and making sure mask does not ignore occ=0
atoms)
are a) not equivalent and b) both not good.
In first case (removing atoms from file) bulk-solvent will flatten residual
map (as you pointed out). In second case bulk-solvent will be excluded in a
very specific area, so that residual (green) density you see there may be
either just bulk-solvent or ligand in question or a mixture; obviously not
a very useful information! This highlights the fundamental problem of flat
bulk-solvent model the way it's currently used.

Pavel


On Sun, Jun 15, 2014 at 3:01 PM, James Holton jmhol...@lbl.gov wrote:


 The principle difference between occ=0 and omitting the atom entirely is
 that occ=0 atoms exclude bulk solvent.  Or at least they do for typical
 operation of contemporary refinement programs.  So, by defining occ=0 you
 are forcing all map voxels within ~0.6A or so of your invisible atom to
 be vacuum.  If you omit it, then the bulk solvent may flood in, perhaps
 enough to pull the fo-fc peak down below 3x rms.  How much the bulk solvent
 floods in depends on how nearby atoms exclude the bulk solvent, and this,
 in turn, depends on which refinement program you are using.  Different bulk
 solvent implementations use different radii, shrink parameters, etc.  In
 addition, bulk solvent always bleeds a bit into surrounding areas because
 the solvent B factor is never zero.

 The real problem, I think, is that for any voxel of the map there is
 ALWAYS something there.  The only question is: what is it?  Is there a
 100% occupied ligand?  100% occupied solvent?  Two conformers of the
 ligand?  Or is it some mixture of all these?  If you are asking these
 questions I think it is most likely a mixture, and mixtures are hard to
 model.  What is worse, mixtures of a partially-occupied ligand with bulk
 solvent taking up the slack is currently impossible to model.  We will have
 to wait for partial-occupancy-bulk-solvent to be implemented before we can
 build representations of these alternative hypotheses and and test them
 with competitive occupancy refinement.

 The bulk solvent is actually a very good example of something for which we
 see no evidence in our electron density maps, yet we model it in because
 1) we know it must be there, and 2) it makes our R factors lower.  What
 more could you want?

 -James Holton
 MAD Scientist


 On 6/13/2014 7:45 PM, Frank von Delft wrote:

 Hi all - talking about ligands, a quick question on that old conundrum, of
 what to do about invisible atoms -- build them with occ=0, or omit them?

 For bits of protein, I know all the arguments;  personally I prefer
 omitting atoms because:

- for amino acid sidechains, their presence is implied in the residue
name.
- for whole residues, their presence is implied in the sequence
numbering

 However:  what about ligands?  Nowhere else in the PDB file is their
 presence implied - or have I missed something?

 (Certainly disorder in a ligand is important information that needs to be
 captured!)

 Cheers
 phx





[ccp4bb] Invisible atoms in ligands

2014-06-13 Thread Frank von Delft
Hi all - talking about ligands, a quick question on that old conundrum, 
of what to do about invisible atoms -- build them with occ=0, or omit them?


For bits of protein, I know all the arguments;  personally I prefer 
omitting atoms because:


 * for amino acid sidechains, their presence is implied in the residue
   name.
 * for whole residues, their presence is implied in the sequence numbering

However:  what about ligands?  Nowhere else in the PDB file is their 
presence implied - or have I missed something?


(Certainly disorder in a ligand is important information that needs to 
be captured!)


Cheers
phx


Re: [ccp4bb] Invisible atoms in ligands

2014-06-13 Thread Tim Gruene
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

Hi Frank,

if you can calculate their position, i.e. it is part of a rigid group,
I would leave it. If it is flexible, I would remove it because the
reader of my deposited structure may not be a crystallographer and
misinterpret the result.

If it is obvious that some atoms are missing, it is even better
because it is more likely to make the reader think about the reasons
why part of the ligand is not displayed in the model.

Cheers,
Tim

On 06/13/2014 11:45 AM, Frank von Delft wrote:
 Hi all - talking about ligands, a quick question on that old
 conundrum, of what to do about invisible atoms -- build them with
 occ=0, or omit them?
 
 For bits of protein, I know all the arguments;  personally I
 prefer omitting atoms because:
 
 * for amino acid sidechains, their presence is implied in the
 residue name. * for whole residues, their presence is implied in
 the sequence numbering
 
 However:  what about ligands?  Nowhere else in the PDB file is
 their presence implied - or have I missed something?
 
 (Certainly disorder in a ligand is important information that needs
 to be captured!)
 
 Cheers phx
 

- -- 
- --
Dr Tim Gruene
Institut fuer anorganische Chemie
Tammannstr. 4
D-37077 Goettingen

GPG Key ID = A46BEE1A

-BEGIN PGP SIGNATURE-
Version: GnuPG v1.4.12 (GNU/Linux)
Comment: Using GnuPG with Icedove - http://www.enigmail.net/

iD8DBQFTmsx7UxlJ7aRr7hoRAhxOAJ9YXwdr4kZULidLAdzTgEjdZwfXNQCeKYTA
9vSeeDh7eq2v4CwBII64wn8=
=uviV
-END PGP SIGNATURE-


Re: [ccp4bb] Invisible atoms in ligands

2014-06-13 Thread Marco Mazzorana
Hi Frank,
I agree with what Tim has just posted.

Personally I would not think occupancy=0, since this would mean the atom is
not where you placed it (i.e. nowhere near). This could be useful if all
your ligand has the same partial occupancy or, if you have multiple poses,
the sum of occupancies of the single atoms equals that of the whole ligand.
None of the above seems to fit your case.

Another possibility is to define the whole ligand and letting the B-factors
for those atoms go very high, but this provides an information on
coordinates which some would just take for granted... if that structure
ends up in the PDB, the less experienced readers would be tempted to say
that x,y,z are the coordinates of that atom, which we would know from the B
factors they are not.

Since a PDB file contains a model (i.e. our understanding and
interpretation of the electron density maps), omitting those atoms seems to
me the more correct way.

Having said so, I agree some important information is missing, but this can
be explained in the paper supporting the entry. If the structure is only
for internal use, then people dealing with it should be warned and the
insanely-high B-factors option might be preferable.

Hope this helps,

Ciao


Marco





2014-06-13 10:45 GMT+01:00 Frank von Delft frank.vonde...@sgc.ox.ac.uk:

  Hi all - talking about ligands, a quick question on that old conundrum,
 of what to do about invisible atoms -- build them with occ=0, or omit
 them?

 For bits of protein, I know all the arguments;  personally I prefer
 omitting atoms because:

- for amino acid sidechains, their presence is implied in the residue
name.
- for whole residues, their presence is implied in the sequence
numbering

 However:  what about ligands?  Nowhere else in the PDB file is their
 presence implied - or have I missed something?

 (Certainly disorder in a ligand is important information that needs to be
 captured!)

 Cheers
 phx



Re: [ccp4bb] Invisible atoms in ligands

2014-06-13 Thread Robbie Joosten
Hi Tim,

The problem with missing atoms in ligands is that you cannot use the 
coordinates for any follow-up calculation that requires ligand topology (e.g. 
restraint generation). That forces you to rely on the annotation of the 
compound, for instance at the PDB. That can be quite messy and leaves extra 
room for errors and misunderstandings.

Cheers,
Robbie

 -Original Message-
 From: CCP4 bulletin board [mailto:CCP4BB@JISCMAIL.AC.UK] On Behalf Of
 Tim Gruene
 Sent: Friday, June 13, 2014 12:04
 To: CCP4BB@JISCMAIL.AC.UK
 Subject: Re: [ccp4bb] Invisible atoms in ligands
 
 -BEGIN PGP SIGNED MESSAGE-
 Hash: SHA1
 
 Hi Frank,
 
 if you can calculate their position, i.e. it is part of a rigid group, I 
 would leave
 it. If it is flexible, I would remove it because the reader of my deposited
 structure may not be a crystallographer and misinterpret the result.
 
 If it is obvious that some atoms are missing, it is even better because it is
 more likely to make the reader think about the reasons why part of the
 ligand is not displayed in the model.
 
 Cheers,
 Tim
 
 On 06/13/2014 11:45 AM, Frank von Delft wrote:
  Hi all - talking about ligands, a quick question on that old
  conundrum, of what to do about invisible atoms -- build them with
  occ=0, or omit them?
 
  For bits of protein, I know all the arguments;  personally I prefer
  omitting atoms because:
 
  * for amino acid sidechains, their presence is implied in the residue
  name. * for whole residues, their presence is implied in the sequence
  numbering
 
  However:  what about ligands?  Nowhere else in the PDB file is their
  presence implied - or have I missed something?
 
  (Certainly disorder in a ligand is important information that needs to
  be captured!)
 
  Cheers phx
 
 
 - --
 - --
 Dr Tim Gruene
 Institut fuer anorganische Chemie
 Tammannstr. 4
 D-37077 Goettingen
 
 GPG Key ID = A46BEE1A
 
 -BEGIN PGP SIGNATURE-
 Version: GnuPG v1.4.12 (GNU/Linux)
 Comment: Using GnuPG with Icedove - http://www.enigmail.net/
 
 iD8DBQFTmsx7UxlJ7aRr7hoRAhxOAJ9YXwdr4kZULidLAdzTgEjdZwfXNQCeKYT
 A
 9vSeeDh7eq2v4CwBII64wn8=
 =uviV
 -END PGP SIGNATURE-


Re: [ccp4bb] Invisible atoms in ligands

2014-06-13 Thread Tim Gruene
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

Hi Robbie,

maybe it is good not to use coordinates for calculations if you cannot
measure the coordinates?

Cheers,
Tim

On 06/13/2014 12:35 PM, Robbie Joosten wrote:
 Hi Tim,
 
 The problem with missing atoms in ligands is that you cannot use
 the coordinates for any follow-up calculation that requires ligand
 topology (e.g. restraint generation). That forces you to rely on
 the annotation of the compound, for instance at the PDB. That can
 be quite messy and leaves extra room for errors and
 misunderstandings.
 
 Cheers, Robbie
 
 -Original Message- From: CCP4 bulletin board
 [mailto:CCP4BB@JISCMAIL.AC.UK] On Behalf Of Tim Gruene Sent:
 Friday, June 13, 2014 12:04 To: CCP4BB@JISCMAIL.AC.UK Subject:
 Re: [ccp4bb] Invisible atoms in ligands
 
 Hi Frank,
 
 if you can calculate their position, i.e. it is part of a rigid
 group, I would leave it. If it is flexible, I would remove it
 because the reader of my deposited structure may not be a
 crystallographer and misinterpret the result.
 
 If it is obvious that some atoms are missing, it is even better
 because it is more likely to make the reader think about the
 reasons why part of the ligand is not displayed in the model.
 
 Cheers, Tim
 
 On 06/13/2014 11:45 AM, Frank von Delft wrote:
 Hi all - talking about ligands, a quick question on that old 
 conundrum, of what to do about invisible atoms -- build them
 with occ=0, or omit them?
 
 For bits of protein, I know all the arguments;  personally I
 prefer omitting atoms because:
 
 * for amino acid sidechains, their presence is implied in the
 residue name. * for whole residues, their presence is implied
 in the sequence numbering
 
 However:  what about ligands?  Nowhere else in the PDB file
 is their presence implied - or have I missed something?
 
 (Certainly disorder in a ligand is important information that
 needs to be captured!)
 
 Cheers phx
 
 
 

- -- 
- --
Dr Tim Gruene
Institut fuer anorganische Chemie
Tammannstr. 4
D-37077 Goettingen

GPG Key ID = A46BEE1A

-BEGIN PGP SIGNATURE-
Version: GnuPG v1.4.12 (GNU/Linux)
Comment: Using GnuPG with Icedove - http://www.enigmail.net/

iD8DBQFTmuJyUxlJ7aRr7hoRAjdqAJ95tqbDlWhqW01lAuVvLwsMIAkv1wCaA0Bk
O0z7t7Bl7O7amDF9CxM0CI0=
=QSdy
-END PGP SIGNATURE-


Re: [ccp4bb] Invisible atoms in ligands

2014-06-13 Thread Isaac Westwood
Hi Tim,

The problem I see with including atoms with no density is that the one
place you can guarantee those atoms are not (at 100% occupancy), is at the
coordinates assigned to them.
I would argue that the scientific inaccuracy of such a model should
outweigh the desire to use the coordinates for downstream calculations.

A better solution may be for the PDB to require us to submit the PDB and
CIF file we generate for the full ligand in addition to the protein PDB
file which may only contain a truncated portion of the ligand as part of
the model.

I prefer omitting the atoms in the same way I do for the protein model and
for the same reasons. I accept this makes it potentially more troublesome
for downstream users as things stand, although would argue that there is
greater potential for harm if the unsuspecting user believes the atomic
positions are correct, where a ligand (or protein) atom has been modeled
into a region with no density.

Best,

Isaac Westwood




 On 13 June 2014 11:35, Robbie Joosten robbie_joos...@hotmail.com wrote:

 Hi Tim,

 The problem with missing atoms in ligands is that you cannot use the
 coordinates for any follow-up calculation that requires ligand topology
 (e.g. restraint generation). That forces you to rely on the annotation of
 the compound, for instance at the PDB. That can be quite messy and leaves
 extra room for errors and misunderstandings.

 Cheers,
 Robbie

  -Original Message-
  From: CCP4 bulletin board [mailto:CCP4BB@JISCMAIL.AC.UK] On Behalf Of
  Tim Gruene
  Sent: Friday, June 13, 2014 12:04
  To: CCP4BB@JISCMAIL.AC.UK
  Subject: Re: [ccp4bb] Invisible atoms in ligands
 
  -BEGIN PGP SIGNED MESSAGE-
  Hash: SHA1
 
  Hi Frank,
 
  if you can calculate their position, i.e. it is part of a rigid group,
 I would leave
  it. If it is flexible, I would remove it because the reader of my
 deposited
  structure may not be a crystallographer and misinterpret the result.
 
  If it is obvious that some atoms are missing, it is even better
 because it is
  more likely to make the reader think about the reasons why part of the
  ligand is not displayed in the model.
 
  Cheers,
  Tim
 
  On 06/13/2014 11:45 AM, Frank von Delft wrote:
   Hi all - talking about ligands, a quick question on that old
   conundrum, of what to do about invisible atoms -- build them with
   occ=0, or omit them?
  
   For bits of protein, I know all the arguments;  personally I prefer
   omitting atoms because:
  
   * for amino acid sidechains, their presence is implied in the residue
   name. * for whole residues, their presence is implied in the sequence
   numbering
  
   However:  what about ligands?  Nowhere else in the PDB file is their
   presence implied - or have I missed something?
  
   (Certainly disorder in a ligand is important information that needs
 to
   be captured!)
  
   Cheers phx
  
 
  - --
  - --
  Dr Tim Gruene
  Institut fuer anorganische Chemie
  Tammannstr. 4
  D-37077 Goettingen
 
  GPG Key ID = A46BEE1A
 
  -BEGIN PGP SIGNATURE-
  Version: GnuPG v1.4.12 (GNU/Linux)
  Comment: Using GnuPG with Icedove - http://www.enigmail.net/
 
  iD8DBQFTmsx7UxlJ7aRr7hoRAhxOAJ9YXwdr4kZULidLAdzTgEjdZwfXNQCeKYT
  A
  9vSeeDh7eq2v4CwBII64wn8=
  =uviV
  -END PGP SIGNATURE-






Re: [ccp4bb] Invisible atoms in ligands

2014-06-13 Thread Robbie Joosten
Hi Tim,

The decision of which atoms you can and cannot see in your map is rather 
subjective. Also the way you generate your map can make a big (enough) 
difference. A new map after additional refinement, an NCS averaged map, or a 
feature-enhanced map might show you the position of (some of) the missing atoms.
I prefer the 'high B-factor' model unless you have a very good reason to 
believe the compound is in any way chemically modified.

Cheers,
Robbie   



 -Original Message-
 From: Tim Gruene [mailto:t...@shelx.uni-ac.gwdg.de]
 Sent: Friday, June 13, 2014 13:37
 To: Robbie Joosten; CCP4BB@JISCMAIL.AC.UK
 Subject: Re: [ccp4bb] Invisible atoms in ligands
 
 -BEGIN PGP SIGNED MESSAGE-
 Hash: SHA1
 
 Hi Robbie,
 
 maybe it is good not to use coordinates for calculations if you cannot
 measure the coordinates?
 
 Cheers,
 Tim
 
 On 06/13/2014 12:35 PM, Robbie Joosten wrote:
  Hi Tim,
 
  The problem with missing atoms in ligands is that you cannot use the
  coordinates for any follow-up calculation that requires ligand
  topology (e.g. restraint generation). That forces you to rely on the
  annotation of the compound, for instance at the PDB. That can be quite
  messy and leaves extra room for errors and misunderstandings.
 
  Cheers, Robbie
 
  -Original Message- From: CCP4 bulletin board
  [mailto:CCP4BB@JISCMAIL.AC.UK] On Behalf Of Tim Gruene Sent:
  Friday, June 13, 2014 12:04 To: CCP4BB@JISCMAIL.AC.UK Subject:
  Re: [ccp4bb] Invisible atoms in ligands
 
  Hi Frank,
 
  if you can calculate their position, i.e. it is part of a rigid group,
  I would leave it. If it is flexible, I would remove it because the
  reader of my deposited structure may not be a crystallographer and
  misinterpret the result.
 
  If it is obvious that some atoms are missing, it is even better
  because it is more likely to make the reader think about the reasons
  why part of the ligand is not displayed in the model.
 
  Cheers, Tim
 
  On 06/13/2014 11:45 AM, Frank von Delft wrote:
  Hi all - talking about ligands, a quick question on that old
  conundrum, of what to do about invisible atoms -- build them with
  occ=0, or omit them?
 
  For bits of protein, I know all the arguments;  personally I prefer
  omitting atoms because:
 
  * for amino acid sidechains, their presence is implied in the
  residue name. * for whole residues, their presence is implied in
  the sequence numbering
 
  However:  what about ligands?  Nowhere else in the PDB file is
  their presence implied - or have I missed something?
 
  (Certainly disorder in a ligand is important information that needs
  to be captured!)
 
  Cheers phx
 
 
 
 
 - --
 - --
 Dr Tim Gruene
 Institut fuer anorganische Chemie
 Tammannstr. 4
 D-37077 Goettingen
 
 GPG Key ID = A46BEE1A
 
 -BEGIN PGP SIGNATURE-
 Version: GnuPG v1.4.12 (GNU/Linux)
 Comment: Using GnuPG with Icedove - http://www.enigmail.net/
 
 iD8DBQFTmuJyUxlJ7aRr7hoRAjdqAJ95tqbDlWhqW01lAuVvLwsMIAkv1wCaA0
 Bk
 O0z7t7Bl7O7amDF9CxM0CI0=
 =QSdy
 -END PGP SIGNATURE-


Re: [ccp4bb] Invisible atoms in ligands

2014-06-13 Thread Tim Gruene
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

Hi Robbie,

Frank probably meant the status of the model when the author is ready
for deposition, i.e., already after additional refinement.

The B-factor assumes a harmonic oscillation of its atom on a linear
trajectory (maybe the superposition of one trajectory in each of the
three dimensions) - that's almost always a very poor assumption for an
atom for which you don't see the density, and by using the B-value to
mop up, you only adjust a parameter and may reduce the R-value, i.e.
improve your model w.r.t. crystallographic data.
As you remove the atoms you don't see, you improve your model with
respect to its usability after deposition.

Cheers,
Tim

On 06/13/2014 01:59 PM, Robbie Joosten wrote:
 Hi Tim,
 
 The decision of which atoms you can and cannot see in your map is
 rather subjective. Also the way you generate your map can make a
 big (enough) difference. A new map after additional refinement, an
 NCS averaged map, or a feature-enhanced map might show you the
 position of (some of) the missing atoms. I prefer the 'high
 B-factor' model unless you have a very good reason to believe the
 compound is in any way chemically modified.
 
 Cheers, Robbie
 
 
 
 -Original Message- From: Tim Gruene
 [mailto:t...@shelx.uni-ac.gwdg.de] Sent: Friday, June 13, 2014
 13:37 To: Robbie Joosten; CCP4BB@JISCMAIL.AC.UK Subject: Re:
 [ccp4bb] Invisible atoms in ligands
 
 Hi Robbie,
 
 maybe it is good not to use coordinates for calculations if you
 cannot measure the coordinates?
 
 Cheers, Tim
 
 On 06/13/2014 12:35 PM, Robbie Joosten wrote:
 Hi Tim,
 
 The problem with missing atoms in ligands is that you cannot
 use the coordinates for any follow-up calculation that
 requires ligand topology (e.g. restraint generation). That
 forces you to rely on the annotation of the compound, for
 instance at the PDB. That can be quite messy and leaves extra
 room for errors and misunderstandings.
 
 Cheers, Robbie
 
 -Original Message- From: CCP4 bulletin board 
 [mailto:CCP4BB@JISCMAIL.AC.UK] On Behalf Of Tim Gruene
 Sent: Friday, June 13, 2014 12:04 To: CCP4BB@JISCMAIL.AC.UK
 Subject: Re: [ccp4bb] Invisible atoms in ligands
 
 Hi Frank,
 
 if you can calculate their position, i.e. it is part of a
 rigid group, I would leave it. If it is flexible, I would
 remove it because the reader of my deposited structure may
 not be a crystallographer and misinterpret the result.
 
 If it is obvious that some atoms are missing, it is even
 better because it is more likely to make the reader think
 about the reasons why part of the ligand is not displayed in
 the model.
 
 Cheers, Tim
 
 On 06/13/2014 11:45 AM, Frank von Delft wrote:
 Hi all - talking about ligands, a quick question on
 that old conundrum, of what to do about invisible atoms
 -- build them with occ=0, or omit them?
 
 For bits of protein, I know all the arguments;
 personally I prefer omitting atoms because:
 
 * for amino acid sidechains, their presence is implied
 in the residue name. * for whole residues, their
 presence is implied in the sequence numbering
 
 However:  what about ligands?  Nowhere else in the PDB
 file is their presence implied - or have I missed
 something?
 
 (Certainly disorder in a ligand is important
 information that needs to be captured!)
 
 Cheers phx
 
 
 
 
 

- -- 
- --
Dr Tim Gruene
Institut fuer anorganische Chemie
Tammannstr. 4
D-37077 Goettingen

GPG Key ID = A46BEE1A

-BEGIN PGP SIGNATURE-
Version: GnuPG v1.4.12 (GNU/Linux)
Comment: Using GnuPG with Icedove - http://www.enigmail.net/

iD8DBQFTmuzWUxlJ7aRr7hoRAsazAJ0bkmlFBRPNxDDOnvuzRp9y5FMYBwCg3yj0
XosHt98UyfOcUrW4OWizoow=
=YDU4
-END PGP SIGNATURE-