Re: [ccp4bb] Comparison of Water Positions across PDBs

2013-11-06 Thread Bernhard Rupp
Hmmm..does that mean that the journals are now the ultimate authority of
what stays in the PDB?

I find this slightly irritating and worthy of change.

 

Best, BR

 

From: CCP4 bulletin board [mailto:CCP4BB@JISCMAIL.AC.UK] On Behalf Of Nat
Echols
Sent: Dienstag, 5. November 2013 15:09
To: CCP4BB@JISCMAIL.AC.UK
Subject: Re: [ccp4bb] Comparison of Water Positions across PDBs

 

On Tue, Nov 5, 2013 at 12:22 AM, Bernhard Rupp hofkristall...@gmail.com
wrote:

Given their otherwise almost paranoid sensitivity to ultimate author
authority 

(resulting in things like still having 2hr0 etc in the bank because certain
authors go AWOL or ignore major issues)

 

In defense of the PDB, it's not just the authors who went AWOL in that case
- it is ultimately the responsibility of the journals to retract clearly
fraudulent publications.

 

-Nat



Re: [ccp4bb] Comparison of Water Positions across PDBs

2013-11-06 Thread Nat Echols
On Wed, Nov 6, 2013 at 12:39 AM, Bernhard Rupp hofkristall...@gmail.comwrote:

 Hmmm….does that mean that the journals are now the ultimate authority of
 what stays in the PDB?

 I find this slightly irritating and worthy of change.


http://www.wwpdb.org/UAB.html

It is the current wwPDB (Worldwide PDB) policy that entries can be made
obsolete following a request from the people responsible for publishing it
(be it the principal author or journal editors).

I'm not sure I understand why things should be any different; the PDB is
not advertising itself as anything other than an archival service, unlike
the journals which are supposed to be our primary mechanism of quality
control.

-Nat


Re: [ccp4bb] Comparison of Water Positions across PDBs

2013-11-06 Thread Ronald E Stenkamp

I've remained silent as this thread evolved into a discussion of how the PDB deals with 
water names and numbers.  But Nat's comment about the PDB not advertising itself as 
anything other than an archival service finally prodded me into saying something.

Something I've slowly come to realize is that the PDB, while it started as an archive, 
has developed into a working database.  That's why they (the PDB 
workers/organizers/managers) have gotten into this mode where they change things from the 
original deposited files.  I learned several years ago that the PDB is willing to change 
the atom names in a ligand from those previously used in the published  literature.  This 
was done in the name of consistency and essentially made the PDB files into 
database entries, rather than archival files since the atom names no longer matched the 
atom names used in the papers.

Given the difficulties I had in discussing this with the annotaters, I've come 
to realize that as soon as I hit the submit button on a PDB submission, I've 
lost control over what will appear in the distributed file.  In a way, it's 
been liberating to reach that point.  It reduces my sense of responsibility for 
the contents of the file.

Ron

On Wed, 6 Nov 2013, Nat Echols wrote:


On Wed, Nov 6, 2013 at 12:39 AM, Bernhard Rupp hofkristall...@gmail.com wrote:

  Hmmm….does that mean that the journals are now the ultimate authority of 
what stays in
  the PDB?

  I find this slightly irritating and worthy of change.


http://www.wwpdb.org/UAB.html

It is the current wwPDB (Worldwide PDB) policy that entries can be made 
obsolete following a request
from the people responsible for publishing it (be it the principal author or journal 
editors).

I'm not sure I understand why things should be any different; the PDB is not 
advertising itself as
anything other than an archival service, unlike the journals which are supposed 
to be our primary
mechanism of quality control.

-Nat




Re: [ccp4bb] Comparison of Water Positions across PDBs

2013-11-06 Thread Prof. K. Sekar
Dear all,

there is a tool to identify invariant
water molecules in homologous protein structures
and the tool can be accessed via

http://cluster.physics.iisc.ernet.in/3dss/

best,

Sekar



 Thanks Bernhard 
   you have helpfully distinguished between the two processes - there is
 certainly a movement of waters to symmetry replacements closer to a chain
 - and that gets documented in Remark 525 of the PDB file returned to
 authors - although then it is stripped out, I think, before the entry is
 released.

  But generally a renumbering is applied to all the waters - these are not
 actually moved but are re-ordered. And of course the number count of them
 may change - in order to accommodate any waters that are swapped in or out
 of the chain during the symmetry operation.

  Currently I don't think authors are given access to an audit of what is
 happening - they can of course check their favourite waters by a
 superimposition. Still have to say best way to avoid errors would be to
 check symmetry and re-order at the end of refinement, pre-deposition. 

 All the best 
   Martyn 


 
  From: Bernhard Rupp hofkristall...@gmail.com
 To: CCP4BB@JISCMAIL.AC.UK
 Sent: Monday, 4 November 2013, 14:50
 Subject: Re: [ccp4bb] Comparison of Water Positions across PDBs



 As far as confusion as a result of PDB renumbering is concerned: It was
 useful to run the old REM525 standalone program (I think I got it from
 PDBe/Kim Hendrick) at the end of solvent building. It does what the PDB
 did with water renumbering when creating the REMARK 525 (probably based on
 ccp4 contact with additions). Is there an updated standalone PDB tool
 available one can use these days to avoid at least that issue?
  
 Thx, BR
  
 From:CCP4 bulletin board [mailto:CCP4BB@JISCMAIL.AC.UK] On Behalf Of
 MARTYN SYMMONS
 Sent: Montag, 4. November 2013 15:17
 To: CCP4BB@JISCMAIL.AC.UK
 Subject: Re: [ccp4bb] Comparison of Water Positions across PDBs
  
 Thank you for that, Rachel
  
 Even though the tone of your comment does not suggest that you want to
 carry on a dialogue about this, I thought I would reply in any case -
 since dialogue is what this forum is supposed to be about.
  
 Thing is,  I was sort of looking for an explanation of why the rule was
 adopted that waters were to be renumbered from N to C terminus. If this is
 not functioning to put the waters in register across a set of related
 structures then it seems somewhat arbitrary. And other schemes might be
 suggested to be better for the usability and interpretation of the
 structural data. 
  
 In some cases there are only a few waters but in many structures the wwPDB
 partners renumber hundreds. And this process makes it difficult for
 authors to check the final deposited structure against the output of their
 refinement. 
  
 I have to say that I agree with other contributors to this thread. It
 would be much better to let the refinement program authors agree on a
 default water numbering scheme. And then maintain that through
 deposition. 
  
 I thought of six possible schemes before breakfast... one of my favourites
 was to order by B-factor - which might appeal to crystallographers.
 Another was to give priority to those in the coordination sphere of any
 metal ions - these actually get priority in the PDB as they are included
 in the LINKS records above the coordinates. These coordinated waters are
 often refined together with the metals and so it would make sense to move
 them closer to their friendly ion.    
  
 And of course one other clearly suitable option would be to leave the
 waters in the authors' preferred order - chosen with help from their
 refinement suite. This is what happens during deposition with the residues
 of the polymers - (provided the authors chainids are suitably chosen).
 Following your link the rule for polymers is that: 'If the coordinate
 residue numbers, as provided by the author, are unique and sequential
 within a particular chain ID, the residues will not be renumbered.' 
  
 I'm presuming that if the authors have a preferred suitable set of water
 numbers then that would be maintained similarly?
  
 Perhaps that is what is happening in the cases I notice that do not follow
 wwPDB rules?
  
 On Friday I was looking at TIRAP structures and in 3ub2 the protein
 construct starts at residue 78 and its final residue is 221 - but the
 associated DTT is labelled back at residue 1 in the same chain. Then the
 first ten out of eleven waters are residues 2 to 11 but then oddly the
 eleventh water is residue 222. Is there a difference in this C-terminal
 water compared with the N-terminal ones? I imagined it was perhaps
 maintained to fit in with the associated publication - or maybe started
 out life modelled as a metal ion - unfortunately I can find no mention of
 it in the paper. 
  
 But, regardless of this distracting feature,  surely this entry does not
 conform to the expected numbering scheme you mentioned

Re: [ccp4bb] Comparison of Water Positions across PDBs

2013-11-05 Thread Nat Echols
On Tue, Nov 5, 2013 at 12:22 AM, Bernhard Rupp hofkristall...@gmail.comwrote:

 Given their otherwise almost paranoid sensitivity to ultimate author
 authority

 (resulting in things like still having 2hr0 etc in the bank because
 certain authors go AWOL or ignore major issues)


In defense of the PDB, it's not just the authors who went AWOL in that case
- it is ultimately the responsibility of the journals to retract clearly
fraudulent publications.

-Nat


Re: [ccp4bb] Comparison of Water Positions across PDBs

2013-11-04 Thread MARTYN SYMMONS
Thank you for that, Rachel

Even though the tone of your comment does not suggest that you want to carry on 
a dialogue about this, I thought I would reply in any case - since dialogue is 
what this forum is supposed to be about.

Thing is,  I was sort of looking for an explanation of why the rule was adopted 
that waters were to be renumbered from N to C terminus. If this is not 
functioning to put the waters in register across a set of related structures 
then it seems somewhat arbitrary. And other schemes might be suggested to be 
better for the usability and interpretation of the structural data. 

In some cases there are only a few waters but in many structures the wwPDB 
partners renumber hundreds. And this process makes it difficult for authors to 
check the final deposited structure against the output of their refinement. 

I have to say that I agree with other contributors to this thread. It would be 
much better to let the refinement program authors agree on a default water 
numbering scheme. And then maintain that through deposition. 

I thought of six possible schemes before breakfast... one of my favourites was 
to order by B-factor - which might appeal to crystallographers. Another was to 
give priority to those in the coordination sphere of any metal ions - these 
actually get priority in the PDB as they are included in the LINKS records 
above the coordinates. These coordinated waters are often refined together with 
the metals and so it would make sense to move them closer to their friendly 
ion.    

And of course one other clearly suitable option would be to leave the waters in 
the authors' preferred order - chosen with help from their refinement suite. 
This is what happens during deposition with the residues of the polymers - 
(provided the authors chainids are suitably chosen). Following your link the 
rule for polymers is that: 'If the coordinate residue numbers, as provided by 
the author, are unique and sequential within a particular chain ID, the 
residues will not be renumbered.' 

I'm presuming that if the authors have a preferred suitable set of water 
numbers then that would be maintained similarly?

Perhaps that is what is happening in the cases I notice that do not follow 
wwPDB rules?

On Friday I was looking at TIRAP structures and in 3ub2 the protein construct 
starts at residue 78 and its final residue is 221 - but the associated DTT is 
labelled back at residue 1 in the same chain. Then the first ten out of eleven 
waters are residues 2 to 11 but then oddly the eleventh water is residue 222. 
Is there a difference in this C-terminal water compared with the N-terminal 
ones? I imagined it was perhaps maintained to fit in with the associated 
publication - or maybe started out life modelled as a metal ion - unfortunately 
I can find no mention of it in the paper. 

But, regardless of this distracting feature,  surely this entry does not 
conform to the expected numbering scheme you mentioned as the wwPDB standard: 
polymer - heterogen - water? 

Yours perplexedly
 Martyn 



From: Rachel Kramer Green kra...@rcsb.rutgers.edu
To: CCP4BB@JISCMAIL.AC.UK 
Sent: Friday, 1 November 2013, 20:18
Subject: Re: [ccp4bb] Comparison of Water Positions across PDBs
 


In PDB format files, each polymer is assigned a unique chain ID. Chain IDs for 
all bound moieties and waters are assigned based on their proximity (number of 
contacts) to the nearest polymer. Once the polymers and non-polymer residues 
associated with them are assigned chain IDs, they are also assigned unique 
residue numbering with the order polymer residues, ligands and then waters. 

Please see: http://www.wwpdb.org/procedure.html#toc_4

The wwPDB has established this rule to improve the usability and
  interpretation of the structural data. Assigning the same chain ID
  for all moieties associated with a polymer enables rapid and
  uniform identification of feature analysis.

Sincerely,
Rachel Green

 


 
Rachel Kramer Green, Ph.D.
RCSB PDB
kra...@rcsb.rutgers.edu
 
 
Twitter: https://twitter.com/#!/buildmodels
Facebook: http://www.facebook.com/RCSBPDB
 
On 10/30/2013 8:09 AM, Eugene Krissinel wrote:

This is to be answered by PDB people, who definitely read BB :) Would be nice 
to have a tool common between CCP4/Phenix and the PDB which sorts this out 
Eugene On 30 Oct 2013, at 12:09, Andreas Förster wrote: 
Dear all, this water discussion is flowing increasingly towards a place where 
I feel a bit out of my depth. What is the convention for numbering water 
molecules?  Is there preference for: - putting waters into a separate chain (W 
for water or S for solvent)?
- splitting waters according to the peptide chains in the structure?
- appending all waters to chain A? Thanks. Andreas On 30/10/2013 11:57, MARTYN 
SYMMONS wrote: 
At deposition the PDB runs a script that renumbers authors'  waters
according to a scheme based on the residue

Re: [ccp4bb] Comparison of Water Positions across PDBs

2013-11-04 Thread Bernhard Rupp
As far as confusion as a result of PDB renumbering is concerned: It was
useful to run the old REM525 standalone program (I think I got it from
PDBe/Kim Hendrick) at the end of solvent building. It does what the PDB did
with water renumbering when creating the REMARK 525 (probably based on ccp4
contact with additions). Is there an updated standalone PDB tool available
one can use these days to avoid at least that issue? 

 

Thx, BR

 

From: CCP4 bulletin board [mailto:CCP4BB@JISCMAIL.AC.UK] On Behalf Of MARTYN
SYMMONS
Sent: Montag, 4. November 2013 15:17
To: CCP4BB@JISCMAIL.AC.UK
Subject: Re: [ccp4bb] Comparison of Water Positions across PDBs

 

Thank you for that, Rachel

 

Even though the tone of your comment does not suggest that you want to carry
on a dialogue about this, I thought I would reply in any case - since
dialogue is what this forum is supposed to be about.

 

Thing is,  I was sort of looking for an explanation of why the rule was
adopted that waters were to be renumbered from N to C terminus. If this is
not functioning to put the waters in register across a set of related
structures then it seems somewhat arbitrary. And other schemes might be
suggested to be better for the usability and interpretation of the
structural data. 

 

In some cases there are only a few waters but in many structures the wwPDB
partners renumber hundreds. And this process makes it difficult for authors
to check the final deposited structure against the output of their
refinement. 

 

I have to say that I agree with other contributors to this thread. It would
be much better to let the refinement program authors agree on a default
water numbering scheme. And then maintain that through deposition. 

 

I thought of six possible schemes before breakfast... one of my favourites
was to order by B-factor - which might appeal to crystallographers. Another
was to give priority to those in the coordination sphere of any metal ions -
these actually get priority in the PDB as they are included in the LINKS
records above the coordinates. These coordinated waters are often refined
together with the metals and so it would make sense to move them closer to
their friendly ion.

 

And of course one other clearly suitable option would be to leave the waters
in the authors' preferred order - chosen with help from their refinement
suite. This is what happens during deposition with the residues of the
polymers - (provided the authors chainids are suitably chosen). Following
your link the rule for polymers is that: 'If the coordinate residue numbers,
as provided by the author, are unique and sequential within a particular
chain ID, the residues will not be renumbered.' 

 

I'm presuming that if the authors have a preferred suitable set of water
numbers then that would be maintained similarly?

 

Perhaps that is what is happening in the cases I notice that do not follow
wwPDB rules?

 

On Friday I was looking at TIRAP structures and in 3ub2 the protein
construct starts at residue 78 and its final residue is 221 - but the
associated DTT is labelled back at residue 1 in the same chain. Then the
first ten out of eleven waters are residues 2 to 11 but then oddly the
eleventh water is residue 222. Is there a difference in this C-terminal
water compared with the N-terminal ones? I imagined it was perhaps
maintained to fit in with the associated publication - or maybe started out
life modelled as a metal ion - unfortunately I can find no mention of it in
the paper. 

 

But, regardless of this distracting feature,  surely this entry does not
conform to the expected numbering scheme you mentioned as the wwPDB
standard: polymer - heterogen - water? 

 

Yours perplexedly

 Martyn 

  _  


From: Rachel Kramer Green kra...@rcsb.rutgers.edu
To: CCP4BB@JISCMAIL.AC.UK 
Sent: Friday, 1 November 2013, 20:18
Subject: Re: [ccp4bb] Comparison of Water Positions across PDBs

 

In PDB format files, each polymer is assigned a unique chain ID. Chain IDs
for all bound moieties and waters are assigned based on their proximity
(number of contacts) to the nearest polymer. Once the polymers and
non-polymer residues associated with them are assigned chain IDs, they are
also assigned unique residue numbering with the order polymer residues,
ligands and then waters. 

Please see: http://www.wwpdb.org/procedure.html#toc_4

The wwPDB has established this rule to improve the usability and
interpretation of the structural data. Assigning the same chain ID for all
moieties associated with a polymer enables rapid and uniform identification
of feature analysis.

Sincerely,
Rachel Green

  _  

Rachel Kramer Green, Ph.D.

RCSB PDB

kra...@rcsb.rutgers.edu

 

 

Twitter: https://twitter.com/#!/buildmodels

Facebook: http://www.facebook.com/RCSBPDB

 

On 10/30/2013 8:09 AM, Eugene Krissinel wrote:

This is to be answered by PDB people, who definitely read BB :)
 
Would be nice to have a tool common between CCP4/Phenix and the PDB which
sorts this out

Re: [ccp4bb] Comparison of Water Positions across PDBs

2013-11-04 Thread MARTYN SYMMONS
Thanks Bernhard 
  you have helpfully distinguished between the two processes - there is 
certainly a movement of waters to symmetry replacements closer to a chain - and 
that gets documented in Remark 525 of the PDB file returned to authors - 
although then it is stripped out, I think, before the entry is released.

 But generally a renumbering is applied to all the waters - these are not 
actually moved but are re-ordered. And of course the number count of them may 
change - in order to accommodate any waters that are swapped in or out of the 
chain during the symmetry operation.

 Currently I don't think authors are given access to an audit of what is 
happening - they can of course check their favourite waters by a 
superimposition. Still have to say best way to avoid errors would be to check 
symmetry and re-order at the end of refinement, pre-deposition. 

All the best 
  Martyn 



 From: Bernhard Rupp hofkristall...@gmail.com
To: CCP4BB@JISCMAIL.AC.UK 
Sent: Monday, 4 November 2013, 14:50
Subject: Re: [ccp4bb] Comparison of Water Positions across PDBs
 


As far as confusion as a result of PDB renumbering is concerned: It was useful 
to run the old REM525 standalone program (I think I got it from PDBe/Kim 
Hendrick) at the end of solvent building. It does what the PDB did with water 
renumbering when creating the REMARK 525 (probably based on ccp4 contact with 
additions). Is there an updated standalone PDB tool available one can use these 
days to avoid at least that issue? 
 
Thx, BR
 
From:CCP4 bulletin board [mailto:CCP4BB@JISCMAIL.AC.UK] On Behalf Of MARTYN 
SYMMONS
Sent: Montag, 4. November 2013 15:17
To: CCP4BB@JISCMAIL.AC.UK
Subject: Re: [ccp4bb] Comparison of Water Positions across PDBs
 
Thank you for that, Rachel
 
Even though the tone of your comment does not suggest that you want to carry on 
a dialogue about this, I thought I would reply in any case - since dialogue is 
what this forum is supposed to be about.
 
Thing is,  I was sort of looking for an explanation of why the rule was adopted 
that waters were to be renumbered from N to C terminus. If this is not 
functioning to put the waters in register across a set of related structures 
then it seems somewhat arbitrary. And other schemes might be suggested to be 
better for the usability and interpretation of the structural data. 
 
In some cases there are only a few waters but in many structures the wwPDB 
partners renumber hundreds. And this process makes it difficult for authors to 
check the final deposited structure against the output of their refinement. 
 
I have to say that I agree with other contributors to this thread. It would be 
much better to let the refinement program authors agree on a default water 
numbering scheme. And then maintain that through deposition. 
 
I thought of six possible schemes before breakfast... one of my favourites was 
to order by B-factor - which might appeal to crystallographers. Another was to 
give priority to those in the coordination sphere of any metal ions - these 
actually get priority in the PDB as they are included in the LINKS records 
above the coordinates. These coordinated waters are often refined together with 
the metals and so it would make sense to move them closer to their friendly 
ion.    
 
And of course one other clearly suitable option would be to leave the waters in 
the authors' preferred order - chosen with help from their refinement suite. 
This is what happens during deposition with the residues of the polymers - 
(provided the authors chainids are suitably chosen). Following your link the 
rule for polymers is that: 'If the coordinate residue numbers, as provided by 
the author, are unique and sequential within a particular chain ID, the 
residues will not be renumbered.' 
 
I'm presuming that if the authors have a preferred suitable set of water 
numbers then that would be maintained similarly?
 
Perhaps that is what is happening in the cases I notice that do not follow 
wwPDB rules?
 
On Friday I was looking at TIRAP structures and in 3ub2 the protein construct 
starts at residue 78 and its final residue is 221 - but the associated DTT is 
labelled back at residue 1 in the same chain. Then the first ten out of eleven 
waters are residues 2 to 11 but then oddly the eleventh water is residue 222. 
Is there a difference in this C-terminal water compared with the N-terminal 
ones? I imagined it was perhaps maintained to fit in with the associated 
publication - or maybe started out life modelled as a metal ion - unfortunately 
I can find no mention of it in the paper. 
 
But, regardless of this distracting feature,  surely this entry does not 
conform to the expected numbering scheme you mentioned as the wwPDB standard: 
polymer - heterogen - water? 
 
Yours perplexedly
 Martyn 




From:Rachel Kramer Green kra...@rcsb.rutgers.edu
To: CCP4BB@JISCMAIL.AC.UK 
Sent: Friday, 1 November 2013, 20:18
Subject

Re: [ccp4bb] Comparison of Water Positions across PDBs

2013-11-01 Thread Rachel Kramer Green
In PDB format files, each polymer is assigned a unique chain ID. Chain 
IDs for all bound moieties and waters are assigned based on their 
proximity (number of contacts) to the nearest polymer. Once the polymers 
and non-polymer residues associated with them are assigned chain IDs, 
they are also assigned unique residue numbering with the order polymer 
residues, ligands and then waters.


Please see: http://www.wwpdb.org/procedure.html#toc_4

The wwPDB has established this rule to improve the usability and 
interpretation of the structural data. Assigning the same chain ID for 
all moieties associated with a polymer enables rapid and uniform 
identification of feature analysis.


Sincerely,
Rachel Green


Rachel Kramer Green, Ph.D.

RCSB PDB

kra...@rcsb.rutgers.edu

Twitter: https://twitter.com/#!/buildmodels

Facebook: http://www.facebook.com/RCSBPDB

On 10/30/2013 8:09 AM, Eugene Krissinel wrote:

This is to be answered by PDB people, who definitely read BB :)

Would be nice to have a tool common between CCP4/Phenix and the PDB which sorts 
this out

Eugene

On 30 Oct 2013, at 12:09, Andreas Förster wrote:


Dear all,

this water discussion is flowing increasingly towards a place where I feel a 
bit out of my depth.

What is the convention for numbering water molecules?  Is there preference for:

- putting waters into a separate chain (W for water or S for solvent)?
- splitting waters according to the peptide chains in the structure?
- appending all waters to chain A?


Thanks.


Andreas




On 30/10/2013 11:57, MARTYN SYMMONS wrote:

At deposition the PDB runs a script that renumbers authors'  waters
according to a scheme based on the residue they are nearest from N to C
terminus along each chain. This renumbering started  when waters were
assigned to macromolecular chains rather than getting a chain id of
their own.  I have failed to find the rationale explained in any PDB
documents - but it could be motivated by this sort of consideration when
waters from different chains or entries are to be compared. Having said
that I do not know if there are any cases where this approach has
successfully matched waters. ..

However an associated step which is certainly a help is that, in the
case of multiple chains, the crystal symmetry is applied to replace
waters with their symmetry equivalent position if it is closer to a
different chain.

I believe a freely available program implementing a similar approach is
WATERTIDY in CCP4 which might be a good place to start.  It gives a
pretty complete output, detailing residues actually H-bonded to the
waters, and you could parse that for further analysis and comparisons.

Best wishes.
   Martyn

--
  Andreas Förster
 Crystallization and Xray Facility Manager
   Centre for Structural Biology
  Imperial College London






Re: [ccp4bb] Comparison of Water Positions across PDBs

2013-10-31 Thread Paul Emsley

On 30/10/13 12:32, Tim Gruene wrote:

-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

Dear Andreas,

I am not sure there is a convention, but there has been a decision.
When you deposit a structure with the PDB, they place all ligands
including water molecules to the respective nearest macromolecular
chain. I found this rather annoying the other day as I looked at the
polymerase structure and wanted to take a look at the Zn-atoms.
Instead of navigating through chain Z, where probably many
crystallographers would have put them, I had to check the PDB (text)
file for the correct chain and residue number and enter this into the
'goto'-field of coot.

If I understand some people correctly, this is not a shortcoming of
the PDB-format, but of the software developer developing the software
I used. So I am waiting for the next version of coot/mmdb with voice
recognition where I can just say Coot, please centre on the next Zn
in this structure. Or at least opening a pop-up window saying This
is you coot speaking. This structure contains 6 Zn atoms, and I feel
that you are interested in looking at their environment. Please click
'yes' to centre on the next Zn-atom I think is most suitable for you
to look at ;-)



How about Ctrl-L (het-group navigation)?

Paul.


Re: [ccp4bb] Comparison of Water Positions across PDBs

2013-10-30 Thread Joel Sussman
For detailed examination of this topic, see:

Koellner, G., Kryger, G., Millard, C. B., Silman, I., Sussman, J. L.  Steiner, 
T. (2000).
Active-site gorge and buried water molecules in crystal structures of 
acetylcholinesterase from Torpedo californica. Journal of Molecular Biology 
296, 713-735.

http://www.ncbi.nlm.nih.gov/pubmed/10669619

best regards,
Joel

On 30 Oct 2013, at 01:35, Ed Pozharski 
epozh...@umaryland.edumailto:epozh...@umaryland.edu wrote:

http://www.ccp4.ac.uk/html/watertidy.html


On 10/29/2013 04:43 PM, Elise B wrote:
Hello,

I am working on a project with several (separate) structures of the same 
protein. I would like to be able to compare the solvent molecules between the 
structures, and it would be best if the waters that exist in roughly the same 
position in each PDB share the same residue number. Basically, I want to 
compare solvent molecule coordinates and assign similar locations the same name 
in each structure.

What would be the best strategy for re-numbering the water molecules such that 
those with similar coordinates in all the structures receive the same residue 
number? I'd appreciate any suggestions.

Elise Blankenship



--
Oh, suddenly throwing a giraffe into a volcano to make water is crazy?
   Julian, King of Lemurs



Re: [ccp4bb] Comparison of Water Positions across PDBs

2013-10-30 Thread Eleanor Dodson
Well - years ago I wrote a program called watertidy to do just this -
but it asigned waters as OH0 OH1 OH2 with a according to what atom it
was H bonded too, and those names are not now permitted..

My way now is to read in a completed homologous structure - use SSM to
fit it over the new one - extract the HOHs from structure 1 - add them
to structure 2 - do some refinement and use COOT to decide which ones
are valid - the density fit picture shows wonderful red bars for
wrong'uns - then add more again using coot..
Eleanor

On 30 October 2013 06:49, Joel Sussman joel.suss...@weizmann.ac.il wrote:
 For detailed examination of this topic, see:

 Koellner, G., Kryger, G., Millard, C. B., Silman, I., Sussman, J. L. 
 Steiner, T. (2000).
 Active-site gorge and buried water molecules in crystal structures of
 acetylcholinesterase from Torpedo californica. Journal of Molecular Biology
 296, 713-735.

 http://www.ncbi.nlm.nih.gov/pubmed/10669619

 best regards,
 Joel

 On 30 Oct 2013, at 01:35, Ed Pozharski epozh...@umaryland.edu wrote:

 http://www.ccp4.ac.uk/html/watertidy.html


 On 10/29/2013 04:43 PM, Elise B wrote:

 Hello,

 I am working on a project with several (separate) structures of the same
 protein. I would like to be able to compare the solvent molecules between
 the structures, and it would be best if the waters that exist in roughly the
 same position in each PDB share the same residue number. Basically, I want
 to compare solvent molecule coordinates and assign similar locations the
 same name in each structure.

 What would be the best strategy for re-numbering the water molecules such
 that those with similar coordinates in all the structures receive the same
 residue number? I'd appreciate any suggestions.

 Elise Blankenship



 --
 Oh, suddenly throwing a giraffe into a volcano to make water is crazy?
Julian, King of Lemurs




Re: [ccp4bb] Comparison of Water Positions across PDBs

2013-10-30 Thread MARTYN SYMMONS
At deposition the PDB runs a script that renumbers authors'  waters according 
to a scheme based on the residue they are nearest from N to C terminus along 
each chain. This renumbering started  when waters were assigned to 
macromolecular chains rather than getting a chain id of their own.  I have 
failed to find the rationale explained in any PDB documents - but it could be 
motivated by this sort of consideration when waters from different chains or 
entries are to be compared. Having said that I do not know if there are any 
cases where this approach has successfully matched waters. ..


However an associated step which is certainly a help is that, in the case of 
multiple chains, the crystal symmetry is applied to replace waters with their 
symmetry equivalent position if it is closer to a different chain.

I believe a freely available program implementing a similar approach is 
WATERTIDY in CCP4 which might be a good place to start.  It gives a pretty 
complete output, detailing residues actually H-bonded to the waters, and you 
could parse that for further analysis and comparisons.

Best wishes.
  Martyn   


 From: Joel Sussman joel.suss...@weizmann.ac.il
To: CCP4BB@JISCMAIL.AC.UK 
Sent: Wednesday, 30 October 2013, 6:49
Subject: Re: [ccp4bb] Comparison of Water Positions across PDBs
 


For detailed examination of this topic, see: 


Koellner, G., Kryger, G., Millard, C. B., Silman, I., Sussman, J. L.  Steiner, 
T. (2000). 
Active-site gorge and buried water molecules in crystal structures of 
acetylcholinesterase from Torpedo californica. Journal of Molecular Biology 
296, 713-735.

http://www.ncbi.nlm.nih.gov/pubmed/10669619

best regards,
Joel

On 30 Oct 2013, at 01:35, Ed Pozharski epozh...@umaryland.edu wrote:

http://www.ccp4.ac.uk/html/watertidy.html


On 10/29/2013 04:43 PM, Elise B wrote:

Hello,

I am working on a project with several (separate) structures of the same 
protein. I would like to be able to compare the solvent molecules between the 
structures, and it would be best if the waters that exist in roughly the same 
position in each PDB share the
 same residue number. Basically, I want to compare solvent molecule coordinates 
and assign similar locations the same name in each structure.

What would be the best strategy for re-numbering the water molecules such 
that those with similar coordinates in all the structures receive the same 
residue number? I'd appreciate any suggestions.

Elise Blankenship



-- 
Oh, suddenly throwing a giraffe into a volcano to make water is crazy?
   Julian, King of Lemurs


Re: [ccp4bb] Comparison of Water Positions across PDBs

2013-10-30 Thread Andreas Förster

Dear all,

this water discussion is flowing increasingly towards a place where I 
feel a bit out of my depth.


What is the convention for numbering water molecules?  Is there 
preference for:


- putting waters into a separate chain (W for water or S for solvent)?
- splitting waters according to the peptide chains in the structure?
- appending all waters to chain A?


Thanks.


Andreas




On 30/10/2013 11:57, MARTYN SYMMONS wrote:

At deposition the PDB runs a script that renumbers authors'  waters
according to a scheme based on the residue they are nearest from N to C
terminus along each chain. This renumbering started  when waters were
assigned to macromolecular chains rather than getting a chain id of
their own.  I have failed to find the rationale explained in any PDB
documents - but it could be motivated by this sort of consideration when
waters from different chains or entries are to be compared. Having said
that I do not know if there are any cases where this approach has
successfully matched waters. ..

However an associated step which is certainly a help is that, in the
case of multiple chains, the crystal symmetry is applied to replace
waters with their symmetry equivalent position if it is closer to a
different chain.

I believe a freely available program implementing a similar approach is
WATERTIDY in CCP4 which might be a good place to start.  It gives a
pretty complete output, detailing residues actually H-bonded to the
waters, and you could parse that for further analysis and comparisons.

Best wishes.
   Martyn


--
  Andreas Förster
 Crystallization and Xray Facility Manager
   Centre for Structural Biology
  Imperial College London


Re: [ccp4bb] Comparison of Water Positions across PDBs

2013-10-30 Thread Tim Gruene
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

Dear Andreas,

I am not sure there is a convention, but there has been a decision.
When you deposit a structure with the PDB, they place all ligands
including water molecules to the respective nearest macromolecular
chain. I found this rather annoying the other day as I looked at the
polymerase structure and wanted to take a look at the Zn-atoms.
Instead of navigating through chain Z, where probably many
crystallographers would have put them, I had to check the PDB (text)
file for the correct chain and residue number and enter this into the
'goto'-field of coot.

If I understand some people correctly, this is not a shortcoming of
the PDB-format, but of the software developer developing the software
I used. So I am waiting for the next version of coot/mmdb with voice
recognition where I can just say Coot, please centre on the next Zn
in this structure. Or at least opening a pop-up window saying This
is you coot speaking. This structure contains 6 Zn atoms, and I feel
that you are interested in looking at their environment. Please click
'yes' to centre on the next Zn-atom I think is most suitable for you
to look at ;-)

Best,
Tim

On 10/30/2013 01:09 PM, Andreas Förster wrote:
 Dear all,
 
 this water discussion is flowing increasingly towards a place where
 I feel a bit out of my depth.
 
 What is the convention for numbering water molecules?  Is there 
 preference for:
 
 - putting waters into a separate chain (W for water or S for
 solvent)? - splitting waters according to the peptide chains in the
 structure? - appending all waters to chain A?
 
 
 Thanks.
 
 
 Andreas
 
 
 
 
 On 30/10/2013 11:57, MARTYN SYMMONS wrote:
 At deposition the PDB runs a script that renumbers authors'
 waters according to a scheme based on the residue they are
 nearest from N to C terminus along each chain. This renumbering
 started  when waters were assigned to macromolecular chains
 rather than getting a chain id of their own.  I have failed to
 find the rationale explained in any PDB documents - but it could
 be motivated by this sort of consideration when waters from
 different chains or entries are to be compared. Having said that
 I do not know if there are any cases where this approach has 
 successfully matched waters. ..
 
 However an associated step which is certainly a help is that, in
 the case of multiple chains, the crystal symmetry is applied to
 replace waters with their symmetry equivalent position if it is
 closer to a different chain.
 
 I believe a freely available program implementing a similar
 approach is WATERTIDY in CCP4 which might be a good place to
 start.  It gives a pretty complete output, detailing residues
 actually H-bonded to the waters, and you could parse that for
 further analysis and comparisons.
 
 Best wishes. Martyn
 

- -- 
- --
Dr Tim Gruene
Institut fuer anorganische Chemie
Tammannstr. 4
D-37077 Goettingen

GPG Key ID = A46BEE1A

-BEGIN PGP SIGNATURE-
Version: GnuPG v1.4.15 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/

iD8DBQFScPxSUxlJ7aRr7hoRAi1TAKDrjtWhH5sCw/3kwPimT/0WVE4oWwCgovPM
4bWow+/SfAp46XnoxD5UJXE=
=YXln
-END PGP SIGNATURE-


Re: [ccp4bb] Comparison of Water Positions across PDBs

2013-10-30 Thread Ulrich Gohlke
Dear Elise,

 Try the 3Dss server (http://cluster.physics.iisc.ernet.in/3dss/). You can 
superimpose up to 20 structures and look for invariant waters.

Cheers,

 Uli

---
dr ulrich gohlke
staff scientist - macromolecular structure and interaction
max-delbrück-center for molecular medicine (mdc)

+49 30 9406 - 2725 (w)
+49 30 9406 - 2548 (fax)
ulrich.goh...@mdc-berlin.de
 
http://www.mdc-berlin.de/en/research/research_teams/macromolecular_structure_and_interaction/


Re: [ccp4bb] Comparison of Water Positions across PDBs

2013-10-30 Thread Eugene Krissinel
This is to be answered by PDB people, who definitely read BB :)

Would be nice to have a tool common between CCP4/Phenix and the PDB which sorts 
this out

Eugene

On 30 Oct 2013, at 12:09, Andreas Förster wrote:

 Dear all,
 
 this water discussion is flowing increasingly towards a place where I feel a 
 bit out of my depth.
 
 What is the convention for numbering water molecules?  Is there preference 
 for:
 
 - putting waters into a separate chain (W for water or S for solvent)?
 - splitting waters according to the peptide chains in the structure?
 - appending all waters to chain A?
 
 
 Thanks.
 
 
 Andreas
 
 
 
 
 On 30/10/2013 11:57, MARTYN SYMMONS wrote:
 At deposition the PDB runs a script that renumbers authors'  waters
 according to a scheme based on the residue they are nearest from N to C
 terminus along each chain. This renumbering started  when waters were
 assigned to macromolecular chains rather than getting a chain id of
 their own.  I have failed to find the rationale explained in any PDB
 documents - but it could be motivated by this sort of consideration when
 waters from different chains or entries are to be compared. Having said
 that I do not know if there are any cases where this approach has
 successfully matched waters. ..
 
 However an associated step which is certainly a help is that, in the
 case of multiple chains, the crystal symmetry is applied to replace
 waters with their symmetry equivalent position if it is closer to a
 different chain.
 
 I believe a freely available program implementing a similar approach is
 WATERTIDY in CCP4 which might be a good place to start.  It gives a
 pretty complete output, detailing residues actually H-bonded to the
 waters, and you could parse that for further analysis and comparisons.
 
 Best wishes.
   Martyn
 
 -- 
  Andreas Förster
 Crystallization and Xray Facility Manager
   Centre for Structural Biology
  Imperial College London


-- 
Scanned by iCritical.



Re: [ccp4bb] Comparison of Water Positions across PDBs

2013-10-30 Thread Alastair Fyfe
Elise - after looking into a related problem for some time,  I've come 
to the conclusion it's a bit harder than  it seems. Two obstacles:
(1) there is no good coordinate system for partitioning the overall 
solvent volume (fraction of unit cell not occupied by protein density)  
into suitable neighborhoods
(2) experimental support for water placement in deposited structures 
varies from sound to wishful thinking - with no single useful 
credibility metric ( a helpful review :

http://www.ncbi.nlm.nih.gov/pubmed/8081736
)

The path I've gone down for (1) is to : (a) compute the distance field 
from the molecular surface and skeletonize it. This will give you the 
outer boundary of surrounding solvent (eg if you blew up the molecular 
surfaces like balloons, the surface defined by where the balloons 
touch); (b) carve up the resulting  volume as appropriate, eg radial 
shells from the molecular surface for outer regions modeled as bulk 
solvent, atom-specific expansion of the molecular surface for regions 
occupied by modeled waters/ions.


This is quite a bit of work (or so it seemed) but it enables you to 
segment the overall solvent volume into regions of interest and then 
calculate density of ordered waters/ions across structures (bearing in 
mind that much of modeled solvent may not be real). For sorting out  
non-bonding contacts there is a very nice and fast class, MAtomNonBond, 
in Kevin Cowtan's clipper library. For the distance field and 
skeletonization/medial-axis computation you might want to look at 
fast-marching implementations.

Good luck!
Alastair
---
On 10/29/2013 04:43 PM, Elise B wrote:
Hello, I am working on a project with several (separate) structures of 
the same protein. I would like to be able to compare the solvent 
molecules between the structures, and it would be best if the waters 
that exist in roughly the same position in each PDB share the same 
residue number. Basically, I want to compare solvent molecule 
coordinates and assign similar locations the same name in each 
structure. What would be the best strategy for re-numbering the water 
molecules such that those with similar coordinates in all the structures 
receive the same residue number? I'd appreciate any suggestions. Elise 
Blankenship




[ccp4bb] Comparison of Water Positions across PDBs

2013-10-29 Thread Elise B
Hello,

I am working on a project with several (separate) structures of the same
protein. I would like to be able to compare the solvent molecules between
the structures, and it would be best if the waters that exist in roughly
the same position in each PDB share the same residue number. Basically, I
want to compare solvent molecule coordinates and assign similar locations
the same name in each structure.

 What would be the best strategy for re-numbering the water molecules such
that those with similar coordinates in all the structures receive the same
residue number? I'd appreciate any suggestions.

Elise Blankenship


Re: [ccp4bb] Comparison of Water Positions across PDBs

2013-10-29 Thread Swastik Phulera
Downloading structures of the same homologous family with blast. Then
superimpose them on a reference structure. You may then try to look at over
lapping water molecules
On 30 Oct 2013 02:23, Elise B ek...@case.edu wrote:

 Hello,

 I am working on a project with several (separate) structures of the same
 protein. I would like to be able to compare the solvent molecules between
 the structures, and it would be best if the waters that exist in roughly
 the same position in each PDB share the same residue number. Basically, I
 want to compare solvent molecule coordinates and assign similar locations
 the same name in each structure.

  What would be the best strategy for re-numbering the water molecules such
 that those with similar coordinates in all the structures receive the same
 residue number? I'd appreciate any suggestions.

 Elise Blankenship




Re: [ccp4bb] Comparison of Water Positions across PDBs

2013-10-29 Thread Dom Bellini
Hi Elise,



How about taking the homologue structure with highest number of waters and use 
it to run molecular replacement on all other datasets? Then you could keep only 
the good waters (manually unfortunately) which will ensure they will all have 
the same numbers.



Probably not the fastest way but it should give what I think you were asking 
for?



HTWorks



D






From: CCP4 bulletin board [CCP4BB@JISCMAIL.AC.UK] on behalf of Swastik Phulera 
[swastik.phul...@gmail.com]
Sent: 29 October 2013 21:16
To: ccp4bb
Subject: Re: [ccp4bb] Comparison of Water Positions across PDBs


Downloading structures of the same homologous family with blast. Then 
superimpose them on a reference structure. You may then try to look at over 
lapping water molecules

On 30 Oct 2013 02:23, Elise B ek...@case.edumailto:ek...@case.edu wrote:
Hello,

I am working on a project with several (separate) structures of the same 
protein. I would like to be able to compare the solvent molecules between the 
structures, and it would be best if the waters that exist in roughly the same 
position in each PDB share the same residue number. Basically, I want to 
compare solvent molecule coordinates and assign similar locations the same name 
in each structure.

 What would be the best strategy for re-numbering the water molecules such that 
those with similar coordinates in all the structures receive the same residue 
number? I'd appreciate any suggestions.

Elise Blankenship




-- 

This e-mail and any attachments may contain confidential, copyright and or 
privileged material, and are for the use of the intended addressee only. If you 
are not the intended addressee or an authorised recipient of the addressee 
please notify us of receipt by returning the e-mail and do not use, copy, 
retain, distribute or disclose the information in or attached to the e-mail.

Any opinions expressed within this e-mail are those of the individual and not 
necessarily of Diamond Light Source Ltd. 

Diamond Light Source Ltd. cannot guarantee that this e-mail or any attachments 
are free from viruses and we cannot accept liability for any damage which you 
may sustain as a result of software viruses which may be transmitted in or with 
the message.

Diamond Light Source Limited (company no. 4375679). Registered in England and 
Wales with its registered office at Diamond House, Harwell Science and 
Innovation Campus, Didcot, Oxfordshire, OX11 0DE, United Kingdom

 









Re: [ccp4bb] Comparison of Water Positions across PDBs

2013-10-29 Thread Miller, Mitchell D.
Hi Elise,   
Not exactly what you are asking for, but 
lsqman will compare waters between structures with a
cut-off. (However, it will not renumber the
waters as you wanted)
http://xray.bmc.uu.se/usf/lsqman_man.html#S71 

If you want to look at waters related by NCS in a 
single PDB file with multiple macromolecule chains, 
then you can use sortwater which will renumber
NCS related waters with the same residue number but
different chain identifiers. (I think watncs
also does a similar thing, but I have not used it).

Regards,
Mitch

-Original Message-
From: CCP4 bulletin board [mailto:CCP4BB@JISCMAIL.AC.UK] On Behalf Of Dom 
Bellini
Sent: Tuesday, October 29, 2013 3:34 PM
To: CCP4BB@JISCMAIL.AC.UK
Subject: Re: [ccp4bb] Comparison of Water Positions across PDBs

Hi Elise,



How about taking the homologue structure with highest number of waters and use 
it to run molecular replacement on all other datasets? Then you could keep only 
the good waters (manually unfortunately) which will ensure they will all have 
the same numbers.



Probably not the fastest way but it should give what I think you were asking 
for?



HTWorks



D






From: CCP4 bulletin board [CCP4BB@JISCMAIL.AC.UK] on behalf of Swastik Phulera 
[swastik.phul...@gmail.com]
Sent: 29 October 2013 21:16
To: ccp4bb
Subject: Re: [ccp4bb] Comparison of Water Positions across PDBs


Downloading structures of the same homologous family with blast. Then 
superimpose them on a reference structure. You may then try to look at over 
lapping water molecules

On 30 Oct 2013 02:23, Elise B ek...@case.edumailto:ek...@case.edu wrote:
Hello,  

I am working on a project with several (separate) structures of the same 
protein. I would like to be able to compare the solvent molecules between the 
structures, and it would be best if the waters that exist in roughly the same 
position in each PDB share the same residue number. Basically, I want to 
compare solvent molecule coordinates and assign similar locations the same name 
in each structure.

 What would be the best strategy for re-numbering the water molecules such that 
those with similar coordinates in all the structures receive the same residue 
number? I'd appreciate any suggestions.

Elise Blankenship




-- 

This e-mail and any attachments may contain confidential, copyright and or 
privileged material, and are for the use of the intended addressee only. If you 
are not the intended addressee or an authorised recipient of the addressee 
please notify us of receipt by returning the e-mail and do not use, copy, 
retain, distribute or disclose the information in or attached to the e-mail.

Any opinions expressed within this e-mail are those of the individual and not 
necessarily of Diamond Light Source Ltd. 

Diamond Light Source Ltd. cannot guarantee that this e-mail or any attachments 
are free from viruses and we cannot accept liability for any damage which you 
may sustain as a result of software viruses which may be transmitted in or with 
the message.

Diamond Light Source Limited (company no. 4375679). Registered in England and 
Wales with its registered office at Diamond House, Harwell Science and 
Innovation Campus, Didcot, Oxfordshire, OX11 0DE, United Kingdom

 


Re: [ccp4bb] Comparison of Water Positions across PDBs

2013-10-29 Thread Ed Pozharski

http://www.ccp4.ac.uk/html/watertidy.html


On 10/29/2013 04:43 PM, Elise B wrote:

Hello,

I am working on a project with several (separate) structures of the 
same protein. I would like to be able to compare the solvent molecules 
between the structures, and it would be best if the waters that exist 
in roughly the same position in each PDB share the same residue 
number. Basically, I want to compare solvent molecule coordinates and 
assign similar locations the same name in each structure.


 What would be the best strategy for re-numbering the water molecules 
such that those with similar coordinates in all the structures receive 
the same residue number? I'd appreciate any suggestions.


Elise Blankenship




--
Oh, suddenly throwing a giraffe into a volcano to make water is crazy?
Julian, King of Lemurs