from:"Martyn Symmons"

Re: [ccp4bb] Fw:[ccp4bb] on Cell & Symmetry in coot

2016-12-09 Thread Martyn Symmons

that's neat - I have used chimera to build filaments from fibre
diffraction where you specify the rise and angle rather than symmops.
'Sym' command specifying 'group h' then rise and angle.
Martyn

On Fri, Dec 9, 2016 at 1:34 PM, Paul Emsley <pems...@mrc-lmb.cam.ac.uk> wrote:
>
> I would use the function new_molecule_by_symop
>
> e.g.
>
> # python
> imol = new_molecule_by_symop(0, "Y,-X+Y,Z+1/6", 0,0,0)
>
> then do this 5 more times selecting the other symmetry operators that
> generate the helix around the z axis..
>
> or the attached script (adjust as necessary)
>
> Paul.
>
>
> On 09/12/16 13:06, Martyn Symmons wrote:
>>
>> Yes I would do it that way by clicking on the ones I want to see -
>> then saving them and then reading them back in. You can change the
>> chainids so Coot makes them different colours which might help.
>> Things might be easier in PyMol if you want interactive control over
>> the colours for a picture say.
>> cheers
>>   Martyn
>>
>> On Fri, Dec 9, 2016 at 12:59 PM, Smith Liu <smith_liu...@163.com> wrote:
>>>
>>> Dear All,
>>>
>>> I mean if the radius set in the Coot "Cell Symmetry" was too small, not
>>> enough monomers (less than 6) can be displayed to show the "continuous
>>> helix
>>> with a six-fold screw axis". If the radius was too large, as for the
>>> "continuous helix with a six-fold screw axis" can be regarded as a "rod",
>>> too large radius will lead to show several rods in one window. But with
>>> the
>>> Coot window, it cannot distinguish which monomer was from which rod. Thus
>>> I
>>> cannot identify 6 monomers forming the single rod, i.e., a  "continuous
>>> helix with a six-fold screw axis".
>>>
>>> Can anyone explain in this situation how can I identify the 6 monomers in
>>> the Coot "Cell and SYmmetry" windows forming a single "continuous helix
>>> with
>>> a six-fold screw axis"?
>>>
>>> Smith
>>>
>>>
>>>
>>>
>>>
>>>  Forwarding messages 
>>> From: "Smith Lee" <0459ef8548d5-dmarc-requ...@jiscmail.ac.uk>
>>> Date: 2016-12-09 18:12:20
>>> To: CCP4BB@JISCMAIL.AC.UK
>>> Subject: [ccp4bb] on Cell & Symmetry in coot
>>>Dear All,
>>>
>>> There is a pdb, once opended in coot, it was a monomer (space group   P
>>> 65 2
>>> 2 ). But in the correspondence paper, it writes, "the subunits form a
>>> continuous helix with a six-fold screw axis".
>>>
>>> I have tried to view with Coot the "six-fold screw axis" formed by 6
>>> monomers. But in the "Cell & Symmetry" in Coot, if the radius is small, 6
>>> monomers cannot be shown. If I increase the radius, more than 6 monomers
>>> would occur in the window, and it can hardly distinguish the 6 monomers
>>> forming the "six-fold screw axis".
>>>
>>> In this situation, will you please let me know how to use Coot to
>>> identify
>>> the 6 monomers forming the "six-fold screw axis"? In addition, suppose  6
>>> monomers forming the "six-fold screw axis" have been identified in Coot,
>>> in
>>> order to save the pdb of each monomer, I need to click each monomer in
>>> mouse, then by "Save symmetry coordinates" to save the pdb of each
>>> monomer,
>>> right?
>>>
>>> I am looking forward to getting your reply.
>>>
>>> Smith
>>>
>>>
>>>
>

Re: [ccp4bb] Fw:[ccp4bb] on Cell & Symmetry in coot

2016-12-09 Thread Martyn Symmons

Yes I would do it that way by clicking on the ones I want to see -
then saving them and then reading them back in. You can change the
chainids so Coot makes them different colours which might help.
Things might be easier in PyMol if you want interactive control over
the colours for a picture say.
cheers
 Martyn

On Fri, Dec 9, 2016 at 12:59 PM, Smith Liu  wrote:
> Dear All,
>
> I mean if the radius set in the Coot "Cell Symmetry" was too small, not
> enough monomers (less than 6) can be displayed to show the "continuous helix
> with a six-fold screw axis". If the radius was too large, as for the
> "continuous helix with a six-fold screw axis" can be regarded as a "rod",
> too large radius will lead to show several rods in one window. But with the
> Coot window, it cannot distinguish which monomer was from which rod. Thus I
> cannot identify 6 monomers forming the single rod, i.e., a  "continuous
> helix with a six-fold screw axis".
>
> Can anyone explain in this situation how can I identify the 6 monomers in
> the Coot "Cell and SYmmetry" windows forming a single "continuous helix with
> a six-fold screw axis"?
>
> Smith
>
>
>
>
>
>  Forwarding messages 
> From: "Smith Lee" <0459ef8548d5-dmarc-requ...@jiscmail.ac.uk>
> Date: 2016-12-09 18:12:20
> To: CCP4BB@JISCMAIL.AC.UK
> Subject: [ccp4bb] on Cell & Symmetry in coot
>   Dear All,
>
> There is a pdb, once opended in coot, it was a monomer (space group   P 65 2
> 2 ). But in the correspondence paper, it writes, "the subunits form a
> continuous helix with a six-fold screw axis".
>
> I have tried to view with Coot the "six-fold screw axis" formed by 6
> monomers. But in the "Cell & Symmetry" in Coot, if the radius is small, 6
> monomers cannot be shown. If I increase the radius, more than 6 monomers
> would occur in the window, and it can hardly distinguish the 6 monomers
> forming the "six-fold screw axis".
>
> In this situation, will you please let me know how to use Coot to identify
> the 6 monomers forming the "six-fold screw axis"? In addition, suppose  6
> monomers forming the "six-fold screw axis" have been identified in Coot, in
> order to save the pdb of each monomer, I need to click each monomer in
> mouse, then by "Save symmetry coordinates" to save the pdb of each monomer,
> right?
>
> I am looking forward to getting your reply.
>
> Smith
>
>
>

Re: [ccp4bb] on Cell & Symmetry in coot

2016-12-09 Thread Martyn Symmons

Under the coot Show symmetry option - switch it to 'Symmetry by
Molecule> Display as CA' - (click on the Symmetry by Molecule menu
item to bring up the Symmetry Controller options window).

Unlike the  'Display sphere' option, with 'Display as CA' you can
increase the Symmetry Atom Display Radius by a large number and get a
nice simple view of the packing even over several cells.

(there is also a colour by symmetry option in the Symmetry Controller window)
The clicking on a chain to save still works really well for the CA
display version of your packing (kudos to Paul Emsley for this tool).

all the best
 Martyn

Martyn Symmons
Cambridge

On Fri, Dec 9, 2016 at 10:12 AM, Smith Lee
<0459ef8548d5-dmarc-requ...@jiscmail.ac.uk> wrote:
>   Dear All,
>
> There is a pdb, once opended in coot, it was a monomer (space group   P 65 2
> 2 ). But in the correspondence paper, it writes, "the subunits form a
> continuous helix with a six-fold screw axis".
>
> I have tried to view with Coot the "six-fold screw axis" formed by 6
> monomers. But in the "Cell & Symmetry" in Coot, if the radius is small, 6
> monomers cannot be shown. If I increase the radius, more than 6 monomers
> would occur in the window, and it can hardly distinguish the 6 monomers
> forming the "six-fold screw axis".
>
> In this situation, will you please let me know how to use Coot to identify
> the 6 monomers forming the "six-fold screw axis"? In addition, suppose  6
> monomers forming the "six-fold screw axis" have been identified in Coot, in
> order to save the pdb of each monomer, I need to click each monomer in
> mouse, then by "Save symmetry coordinates" to save the pdb of each monomer,
> right?
>
> I am looking forward to getting your reply.
>
> Smith

Re: [ccp4bb] Superpose program in CCP4

2016-10-30 Thread Martyn Symmons

I think superpose does not output CAs outwith a certain cut-off based
on the quality of the superimposition. It does try to match as many as
give reasonable RMSDs - but it is mainly focussed on the residues in
matching secondary elements as this is where it starts the
superimposition.

A simpler approach is to use LSQKAB which will report all CA RMSDs in
a file if you specify that you want DELTAS. But notice that as Gert
said this will probably give you a different kind of superimposition.
A workaround would be to restrict LSQKAB to the residue ranges
SUPERPOSE used - then it will produce the same superimposition but can
report DELTAS for _all_ residues (! phew).

A non-CCP4 approach is Chimera Multi-align Viewer which calculates
RMSD without doing any superimposition. So you could run it on your
SUPERPOSE output coordinate files. I recall RMSD is a 'header' in that
tool and can be output to a file.
https://www.cgl.ucsf.edu/chimera/docs/ContributedSoftware/multalignviewer/framemav.html
(You have to read your structures into Chimera before launching
Multi-align viewer).

hope that helps.
 Martyn

Martyn Symmons
Cambridge

On Sun, Oct 30, 2016 at 10:54 AM, Anastassis Perrakis <a.perra...@nki.nl> wrote:
> Dear Wenhe,
>
> Besides this advice, have a look at the
> http://webapps.embl-hamburg.de/rapido/ server.
>
> Sometimes its goodo to re-think of what you want to do, and wonder why its
> not easily doable in software (perhaps because its not the right thing to do
> …)
>
> A.
>
>
> On 30 Oct 2016, at 11:45, Gert Vriend <gerrit.vri...@radboudumc.nl> wrote:
>
> Dear Wenhe,
>
> No 3D superpose tool will always align/map all Calphas. If in the one
> protein the loop turns left, and in the other it turns right, then mapping
> those loops is meaningless and thus not done by good software. The other
> problem is that often two proteins that get compared do not even have
> equally many residues so that there will always be some unaligned/unmapped
> Calphas left at the end. Look for some articles by Arthur M Lesk on this
> topic, he has explained protein superposition (problems) very clearly.
>
> Gert
>
> Ps, if you want proteins superposed and get different output from what the
> standard software gives you, just mail me those PDB files and I can see what
> I can do.
>
>
> On 29-10-2016 17:47, WENHE ZHONG wrote:
>
> Dear all,
>
> I always use the SUPERPOSE tool in CCP4 to superpose molecules. This time I
> want to use the RMSD values of superposed C-alpha atoms to plot a RMSD graph
> (instead of using the graph automatically made by the program). However,
> there are many atoms missing in the RMSD list.
>
> In the settings I chose “Superpose specific atoms/residues”, checked “Output
> all distances to a file”, fit “C-alpha atoms”. The superposed structures
> have exactly the same sequence.
>
> My question is: is there any way to get the completed list of RMSD value for
> each C-alpha atom? Or is there any other program for this purpose?
>
> Thank you!
>
> Kind regards,
> Wenhe
>
>

Re: [ccp4bb] [Fwd: Re: [ccp4bb] FW: New ligand 3-letter code (help-7071)]

2015-06-22 Thread Martyn Symmons

Well the problem is there is a lot more to a ligand than PDB
coordinates - little things like bond orders... In addition people can
publish ligands with atoms for which they have no density - so
zero-occupancy is allowed too. So who should get priority - the group
who publishes a ligand first, or the ones who actually have density
for all the atoms?

These sorts of complications mean we all benefit from peer-review of
the structure - that is why we put things on hold. And authors should
have a chance to change their ligand definition based on reviewers'
comments - just as they are allowed to improve the PDB coordinates. So
it is a worry for them that the PDB might 'publish' the ligand aspect
of  their work before they have completed the peer-review process.

Maybe you don't believe is peer-review - in reply to which I'd
paraphrase what people say about democracy - it's pretty bad but
better than the alternatives.

But to return to the point I made: what really is the problem with
maintaining and modifying _separate_ definitions with authors'
_separate_ deposited coordinates (and bond orders) while structures
are on hold and being reviewed? Journals manage to keep all those
submitted papers separate in their databases.

cheers
 M.

On Mon, Jun 22, 2015 at 3:12 AM, Edward A. Berry ber...@upstate.edu wrote:
   I can't imagine a journal doing that can you?  When I work on my
 supplementary material in a paper I don't expect that the journal will
 take a bit out and publish it separately to support the work of my
 competitors. Not out of spite that I was beaten - but because I don't
 want to take the responsibility for checking their science for them!


 I don't see the problem here. What about the dozens of authors who
 will benefit from using your ligand in their structure _after_ your
 structure comes out? You don't take responsibility for checking their
 science. Every author gets a copy of his final structure to check
 before it is released and each is responsible for his own.
 The only difference here is whether the competitor got to use it first,
 (which might sting a bit) or only after you had already made it your
 own with the first structure.

 I guess the ligand database is the responsibility of the pdb, but
 they depend on first depositors to help set up each ligand, so
 it is not surprising if the type model has coordinates from the
 first depositor's structure (although it would be convenient if
 they were all moved to c.o.m. at 0,0,0). When another group publishes
 a structure with the ligand, they will not be publishing the first
 depositor's coordinates because the ligand will be moved to its position
 in their structure and refined against their data, probably with
 somewhat different restraints.

 If the ligand is a top secret novel drug lead that your company is
 developing I guess it would come as a shock to find someone else has
 already deposited it, and it might be good to hasten not the
 publication but protecting of the compound with a patent!

 Although Miriam says a new 3-letter code is generated when no match is
 found,
 I believe the depositor's code will be used if it is available,
 at least one of mine was last year, so there is some use for Nigel's
 utility if you want to stamp your new compound with a rememberable name.

 eab


 On 06/21/2015 06:33 PM, Martyn Symmons wrote:

 Miri raises important points about issues in the PDB Chemical
 Component Dictionary - I think part of the problem is that this is
 published completely separately from the actual PDB - so for example I
 don't think we have an archive of the CCD for comparison alongside the
 PDB snapshots? This makes it difficult to follow the convoluted track
 of particular ligands through the PDB's many,many changes to small
 molecule definitions.

 But following discussion with other contributors offline I want to
 make it clear what is my understanding of the ZA3 (2Y2I /2Y59) case:

 I am clear there was no unethical behaviour by either group in the
 course of their work on these structures and the publication of them.

 The problem I am highlighting is that the PDB don't understand
 publishing ethics - what happened in ZA3 was that they published a
 little bit of one group's work to support the work of someone who was
 scooping them!

   I can't imagine a journal doing that can you?  When I work on my
 supplementary material in a paper I don't expect that the journal will
 take a bit out and publish it separately to support the work of my
 competitors. Not out of spite that I was beaten - but because I don't
 want to take the responsibility for checking their science for them!

 All the best
Martyn

 Cambridge

 On Sun, Jun 21, 2015 at 7:01 PM, Miri Hirshberg
 02897e8e9f0f-dmarc-requ...@jiscmail.ac.uk wrote:

 Sun., June 21st 2015

 Good evening,

 adding several general points to the thread.

 (1) Fundamentally PDB unlike other chemical databases
 insists that all equal structures should have the same 3

Re: [ccp4bb] [Fwd: Re: [ccp4bb] FW: New ligand 3-letter code (help-7071)]

2015-06-21 Thread Martyn Symmons

 the same compound, 
  i.e. same 2D structure or connectivity.  Each deposition should have its 
  own 3D coordinates.  If a different publication gets your ligand 3D 
  coordinates (2Y59 actually embodies the atomic coordinates from the 
  2Y2I), that looks to me an oversight by PDB.  It is hard to believe that 
  PDB intended to use the 3D coordinates from one entry for the other, 
  ligand or not.  In fact, the restraints as described by the ligand 
  dictionary should also be kept separate as that reflects how the authors 
  refine their ligand.
 
  Yong
 
  -Original Message-
  From: CCP4 bulletin board [mailto:CCP4BB@JISCMAIL.AC.UK] On Behalf Of 
  Martyn Symmons
  Sent: Friday, June 19, 2015 8:39 PM
  To: CCP4BB@JISCMAIL.AC.UK
  Subject: Re: [ccp4bb] FW: New ligand 3-letter code (help-7071)
 
  By oversimplifying the situation here the PDB does not answer my related 
  point about competing crystallographers:
  My scenario:
 
  Group A deposits structure with new drug - gets their three-letter code 
  for example ZA3  they then get to check the coordinates and chemical 
  definition of this ligand.
 
  But suppose a little after that a competing group B deposits their 
  structure with the same drug which they think is novel - but no...
  they get assigned the now described ZA3 which has been checked by the 
  other group.
 
   Then it is a race to see who gets to publish and release first. And if it 
  is the second group B who wins then they are publishing the work of their 
  A competitors - who have done the depositing and checking of the ligand  
  description.
 
   Sounds unlikely? Well, it actually happened in 2011 for my exact example 
  ZA3 - present in 2Y2I and in 2Y59 from competing groups.
 
   From the dates in the mmcif it was 2Y2I depositors who set up and had a 
  chance to review the description of ZA3 ligand. Only to see it released a 
  week before their crystal structure, when their ZA3 appeared to accompany 
  competing 2Y59! It is amazing that the PDB did not spot this and arrange a 
  suitable workaround.
 
  Just to check:
  mmcif for ZA3 shows it was created for 2Y2I:
  ...
  _chem_comp.pdbx_model_coordinates_db_code2Y2I
  ...
  But it was modified for release:
  ...
  _chem_comp.pdbx_modified_date2011-07-22
  ...
  corresponding to the early 2011-07-27 release date of the competing
  structure: 2Y59 even though this PDB was  _deposited_ second.
 
  The ZA3 ligand definition released with 2Y59 actually embodies the atomic 
  coordinates from the 2Y2I structure:
 
  mmcif
  ZA3 O6   O6   O 0  1 N N N 8.279  7.165  40.963 0.311  -1.061 -0.920
  O6   ZA3 1
  ZA3 C5   C5   C 0  1 N N N 9.132  8.047  40.908 0.147  -0.205 -0.073
  C5   ZA3 2  ...
  PDB 2Y2I
  HETATM 3598  O6  ZA3 A1000   8.279   7.165  40.963  1.00 41.25 
O
  HETATM 3599  C5  ZA3 A1000   9.132   8.047  40.908  1.00 63.20
C ...
 
  Surely a better approach would be to allow both groups a chance to work 
  through and sign off on independent ligand descriptions?
 
  Then whoever releases first would release both a novel structure and the 
  ligand definition _they_ deposited and checked. Their priority can then be 
  asserted and the other group contacted to ask if they agree to accept this 
  definition. This also has the advantage of better confidentiality 
  pre-publication.
 
  Another problem from any cross-linking of definitions is that say group A 
  are motivated by reviewers' reports to change the definition of ZA3 
  pre-release. Well now the change impinges on the chemical meaning of other 
  group B's deposited structure. For example ZA3 mmcif has a statement:
 
  ZA3 Modify aromatic_flag 2011-06-04 RCSB
 
  so this change was pre-release - but we cannot be sure what motivated this 
  - whether it was signed off by the 2Y2I authors or the 2Y59 authors (or 
  both?)
 
  With the accelerating pace of drug discovery for sure this sort of 
  uncertainty is going to happen again.Unless the PDB have changed their 
  practice for ligand deposition?
 
  All the best
   Martyn
 
  Cambridge.
 
  On Fri, Jun 19, 2015 at 1:49 PM, Sheriff, Steven steven.sher...@bms.com 
  wrote:
   All:
  
  
  
   Since the original query was cross-posted on both the COOT mailing
   list and the CCP4BB Rachel Green gave me permission to forward this to
   both. She provides links about the mechanism of assignment of 3-letter
   codes. In the third link below, my original suggestion to the COOT
   mailing list that one could just use UNK is incorrect as that is 
   reserved for unknown amino acids.
   According to this document, I should have suggested UNL for an unknown
   ligand.
  
  
  
   Steven
  
  
  
   From: Rachel Kramer Green [mailto:kra...@rcsb.rutgers.edu]
   Sent: Tuesday, June 16, 2015 10:21 AM
   To: Sheriff, Steven
   Cc: info
   Subject: Re: New ligand 3-letter code (help-7071)
  
  
  
   Dear Steven,
  
   During annotation of ligands, all chemical

Re: [ccp4bb] FW: New ligand 3-letter code (help-7071)

2015-06-19 Thread Martyn Symmons

By oversimplifying the situation here the PDB does not answer my
related point about competing crystallographers:
My scenario:

Group A deposits structure with new drug - gets their three-letter
code for example ZA3
 they then get to check the coordinates and chemical definition of this ligand.

But suppose a little after that a competing group B deposits their
structure with the same drug which they think is novel - but no...
they get assigned the now described ZA3 which has been checked by the
other group.

 Then it is a race to see who gets to publish and release first. And
if it is the second group B who wins then they are publishing the work
of their A competitors - who have done the depositing and checking of
the ligand  description.

 Sounds unlikely? Well, it actually happened in 2011 for my exact
example ZA3 - present in 2Y2I and in 2Y59 from competing groups.

 From the dates in the mmcif it was 2Y2I depositors who set up and had
a chance to review the description of ZA3 ligand. Only to see it
released a week before their crystal structure, when their ZA3
appeared to accompany competing 2Y59! It is amazing that the PDB did
not spot this and arrange a suitable workaround.

Just to check:
mmcif for ZA3 shows it was created for 2Y2I:
...
_chem_comp.pdbx_model_coordinates_db_code2Y2I
...
But it was modified for release:
...
_chem_comp.pdbx_modified_date2011-07-22
...
corresponding to the early 2011-07-27 release date of the competing
structure: 2Y59 even though this PDB was  _deposited_ second.

The ZA3 ligand definition released with 2Y59 actually embodies the
atomic coordinates from the 2Y2I structure:

mmcif
ZA3 O6   O6   O 0  1 N N N 8.279  7.165  40.963 0.311  -1.061 -0.920
O6   ZA3 1
ZA3 C5   C5   C 0  1 N N N 9.132  8.047  40.908 0.147  -0.205 -0.073
C5   ZA3 2  ...
PDB 2Y2I
HETATM 3598  O6  ZA3 A1000   8.279   7.165  40.963  1.00 41.25   O
HETATM 3599  C5  ZA3 A1000   9.132   8.047  40.908  1.00 63.20
  C ...

Surely a better approach would be to allow both groups a chance to
work through and sign off on independent ligand descriptions?

Then whoever releases first would release both a novel structure and
the ligand definition _they_ deposited and checked. Their priority can
then be asserted and the other group contacted to ask if they agree to
accept this definition. This also has the advantage of better
confidentiality pre-publication.

Another problem from any cross-linking of definitions is that say
group A are motivated by reviewers' reports to change the definition
of ZA3 pre-release. Well now the change impinges on the chemical
meaning of other group B's deposited structure. For example ZA3 mmcif
has a statement:

ZA3 Modify aromatic_flag 2011-06-04 RCSB

so this change was pre-release - but we cannot be sure what motivated
this - whether it was signed off by the 2Y2I authors or the 2Y59
authors (or both?)

With the accelerating pace of drug discovery for sure this sort of
uncertainty is going to happen again.Unless the PDB have changed their
practice for ligand deposition?

All the best
 Martyn

Cambridge.

On Fri, Jun 19, 2015 at 1:49 PM, Sheriff, Steven steven.sher...@bms.com wrote:
 All:



 Since the original query was cross-posted on both the COOT mailing list and
 the CCP4BB Rachel Green gave me permission to forward this to both. She
 provides links about the mechanism of assignment of 3-letter codes. In the
 third link below, my original suggestion to the COOT mailing list that one
 could just use UNK is incorrect as that is reserved for unknown amino acids.
 According to this document, I should have suggested UNL for an unknown
 ligand.



 Steven



 From: Rachel Kramer Green [mailto:kra...@rcsb.rutgers.edu]
 Sent: Tuesday, June 16, 2015 10:21 AM
 To: Sheriff, Steven
 Cc: info
 Subject: Re: New ligand 3-letter code (help-7071)



 Dear Steven,

 During annotation of ligands, all chemical components present in the
 structure are compared against the definitions in the Chemical Component
 Dictionary (http://www.wwpdb.org/data/ccd). If the ligand is not in the
 dictionary, a three letter code is assigned. See
 http://www.wwpdb.org/documentation/policy#toc_assignment.  In the future, a
 group of three-letter codes may be set aside to be used during refinement to
 flag new ligands.

 Clarification about the ligand ids assignment and in particular the usage of
 UNX/UNL/UNK residues can be found at
 http://www.wwpdb.org/documentation/procedure#toc_2.

 Best wishes,
 Rachel



 

 Rachel Kramer Green, Ph.D.

 RCSB PDB

 kra...@rcsb.rutgers.edu



 New! Deposit X-ray data with the wwPDB at:

 http://deposit.wwpdb.org/deposition (NMR and 3DEM coming soon).

 ___

 Twitter: https://twitter.com/#!/buildmodels

 Facebook: http://www.facebook.com/RCSBPDB







 On 6/5/2015 7:50 AM, Sheriff, Steven wrote:

 All:



 Why the concern for unassigned

Re: [ccp4bb] New ligand 3-letter code

2015-06-14 Thread Martyn Symmons

One thing to beware of is that of course un-released structures may have
the code that you choose - structures can be on hold for over a year (the
rules say a year but in practice release is often slower) and any new
ligands and codes with them. Ligand codes are assigned at deposition and
the description stored privately in a wwPDB database I believe. So your
ligand code may be already there but not visible until it is copied into
the public Chemical Component Dictionary at the time of release.

Conversely your 'new' ligand _structure_ may be in an un-released entry
deposited by some other crystallographer and already assigned a 3-letter
code. So your choice will be overwritten at deposition in any case.

This produces a problem when multiple competing groups deposit structures
containing the same novel ligand. As the database will not expect
duplicates, only the first depositor will get assigned a new 3-letter code.

However then there should be alarm bells for the second depositor if their
'novel' ligand turns out to have a pre-existing three letter code! I
suspect there must have been cases where this happened but was not spotted
by the depositors. But if it happens to you then maybe it's time to speed
up publication...

I don't know if it has happened that one crystallographer has 'hastened'
the release of the ligand description of another worker by publishing it in
complex with a solved macromolecule. Perhaps the clever people at the PDB
do a switch at the last minute?

Best wishes,
 Martyn

Martyn Symmons
Cambridge






On Mon, Jun 8, 2015 at 3:12 AM, Lau Sze Yi (SIgN) 
lau_sze...@immunol.a-star.edu.sg wrote:

 Thanks Eleanor for relaying my question from COOT and everyone that
 follows up with the thread.

 So this new ligand code search feature will be available in Phenix Gui?
 Look forward to it.

 Thanks,
 Sze Yi




 From: Nigel Moriarty nwmoria...@lbl.gov
 Reply-To: Nigel Moriarty nwmoria...@lbl.gov
 Date: Sunday, 7 June 2015 11:18 pm
 To: CCP4BB@JISCMAIL.AC.UK CCP4BB@JISCMAIL.AC.UK
 Subject: Re: [ccp4bb] New ligand 3-letter code

 Ed

 Thanks for the feature request. I will be available in the next nightly
 build.

 nigel% elbow.get_new_ligand_code A?3

 Unique ligand code : AV3

 Cheers

 Nigel

 ---
 Nigel W. Moriarty
 Building 64R0246B, Physical Biosciences Division
 Lawrence Berkeley National Laboratory
 Berkeley, CA 94720-8235
 Phone : 510-486-5709 Email : nwmoria...@lbl.gov
 Fax   : 510-486-5909   Web  : CCI.LBL.gov

 On Sat, Jun 6, 2015 at 4:20 PM, Edward A. Berry ber...@upstate.edu
 wrote:

 Neat! It's true the PDB will choose a unique code for you,
 but they will use what you supply if it is already unique,
 and it is nice to be able to choose something that can help
 you remember what it stands for.
 Gone are the days when we could choose meaningfull pdb ID's
 (like 1PRC, 2PRC, 3PRC etc are all photosynthetic reaction center),
 but there is still some freedom in choosing the ligand ID's.
 They are filling up fast, though!

 locust 123% elbow.get_new_ligand_code AC
 Unique ligand code : AC3

 locust 129% elbow.get_new_ligand_code A?3
 Unique ligand code : A?3

 eab

 On 06/06/2015 12:13 PM, Nigel Moriarty wrote:

 I wrote something a while ago the finds an unused code.

 % elbow.get_new_ligand_code


 Unique ligand code : 7V8

 % elbow.get_new_ligand_code A


 Unique ligand code : A6E


 Cheers

 Nigel

 ---
 Nigel W. Moriarty
 Building 64R0246B, Physical Biosciences Division
 Lawrence Berkeley National Laboratory
 Berkeley, CA 94720-8235
 Phone : 510-486-5709 Email : nwmoria...@lbl.gov
 Fax   : 510-486-5909   Web  : CCI.LBL.gov http://CCI.LBL.gov

 On Fri, Jun 5, 2015 at 6:59 AM, Eleanor Dodson 
 eleanor.dod...@york.ac.uk mailto:eleanor.dod...@york.ac.uk wrote:


 I use any 3 letter/number code that i want. If you read the
 corresponding cif file into coot it is used in preference to any in the
 library. The PDB deposition team will assign a code if it is a new ligand
 to the database. Could you relay this to original poster?

 Thanks

 Jim Brannigan


 On 5 June 2015 at 14:58, Eleanor Dodson eleanor.dod...@york.ac.uk
 mailto:eleanor.dod...@york.ac.uk wrote:

 OK - thank you.
 How are things?
 E


 -- Forwarded message --
 From: *Jim Brannigan* jim.branni...@york.ac.uk mailto:
 jim.branni...@york.ac.uk
 Date: 5 June 2015 at 14:39
 Subject: Re: New ligand 3-letter code
 To: Eleanor Dodson eleanor.dod...@york.ac.uk mailto:
 eleanor.dod...@york.ac.uk


 Hi Eleanor

 I use any 3 letter/number code that i want. If you read the
 corresponding cif file into coot it is used in preference to any in the
 library. The PDB deposition team will assign a code if it is a new ligand
 to the database. Could you relay this to original poster?

 Thanks

 Jim Brannigan

 On 5 June 2015 at 11:28, Eleanor Dodson 
 eleanor.dod...@york.ac.uk

Re: [ccp4bb] Antw: Re: [ccp4bb] Offtopic: Software to closely visualize interacting partnets in protein complex

2015-03-19 Thread MARTYN SYMMONS

As this is a ccp4 bb we should mention the jsPISA server which is at 
http://www.ccp4.ac.uk/pisa/
This is a very nice implementation of the PISA approach.
Another approach is to use the Chimera clash finding approach 
http://www.cgl.ucsf.edu/chimera/docs/ContributedSoftware/findclash/findclash.html
But this defaults to atom to atom contacts which means you have to prune it 
down with a script for picking residues. Also does not know about H-bond 
angles
Cheers
Martyn
Cambridge
Original message
From : bar...@fmp-berlin.de
Date : 18/03/2015 - 14:18 (GMTST)
To : CCP4BB@JISCMAIL.AC.UK
Subject : [ccp4bb] Antw: Re: [ccp4bb] Offtopic: Software to closely visualize 
interacting partnets in protein complex
Hi,
I just checked the PIC sever tim suggested. very nice indeed. If 
you only want to map different interfaces and the amino acids involved 
in, I suggest to run the pisa server, too. http://www.ebi.ac.uk/pdbe/pisa/ . I 
used it extensively to find out whether a certain set of crystal contacs leads 
to a certain crystal packing. 
best, matthias
 Tim tim.schu...@rub.de 11.03.15 18.25 Uhr 
Hi,
Molprobity is also a nice software to do this kind of analysis - if you 
use it as implemented in phenix you also get good visualization in coot.
I also asked the pymol community to create an implementation for pymol, 
but I did not follow if somebody took that up.
Also this 'protein interactions calculation' server is very neat:
http://pic.mbu.iisc.ernet.in/
/Tim
On 10/03/15 11:25, Debasish Kumar Ghosh wrote:
 Dear All,

 Apologies for this little off-topic inquiry. I want to closely visualize the 
 interacting residues in an multimeric protein complex to understand the 
 nature of interactions. Is there any good software to give this information 
 with good clarity.
 Any suggestion is highly appreciated.

 Thanks,
 Best !!!

 Debasish Kumar Ghosh

 CSIR- Junior Research Fellow (PhD Scholar)
 C/o: Dr. Akash Ranjan
 Computational and Functional Genomics Group
 Centre for DNA Fingerprinting and Diagnostics
 Hyderabad, INDIA

 Email(s): dkgh...@cdfd.org.in, dgho...@gmail.com
 Telephone: 0091-9088334375 (M), 0091-40-24749396 (Lab)
 Lab URL: 
 http://www.cdfd.org.in/labpages/computational_functional_genomics.html

[ccp4bb] [ccp4] NAGging problem?

2014-12-05 Thread MARTYN SYMMONS

For my sins I am making figures of TLR structures. These are dimeric in 
activated forms and glycosylated. So I sort of assumed that the sugars would 
inherit the chain id of the polypeptide they are attached to. But then in 2Z7X 
(TLR1/2 heterodimer) when I coloured the gycolysation to fit with the chains it 
did not look right. Looking at the LINK record again I found the following:
LINK ND2 ASN A 199 C1  NDG A 901 1555   1555  1.45  
LINK ND2 ASN A 414 C1  NAG A 911 1555   1555  1.45  
LINK ND2 ASN A 442 C1  NAG A 921 1555   1555  1.45  
LINK ND2 ASN B  51 C1  NAG A 801 1555   1555  1.46  
LINK ND2 ASN B 163 C1  NAG B 811 1555   1555  1.45  
LINK ND2 ASN B 330 C1  NAG A 821 1555   1555  1.46  
LINK ND2 ASN B 429 C1  NAG A 831 1555   1555  1.45  
So only _one_ of the NAGs bound to chain B ends up getting a B chain id! I'm 
guessing this is because in the dimers the bulk of the sugar ends up closer to 
chain A rather than the chain B which it is attached to. 
And according the the RCSB chain IDs for all bound moieties and waters are 
assigned based on their proximity (number of contacts) to the nearest 
polymer.(http://deposit.rcsb.org/depoinfo/download/dp.pdf section 4.4.1 Chain 
ID assignment ). So is this why these chains get switched? If so, I have to say 
that is crazy - surely the covalent bonds, detected when preparing the LINK 
description, should always take precedence? 
Maybe more reasonable treatment has been applied in some cases - but I would've 
hoped that there would be some consistency - and so maybe the rules need 
re-writing to remove these anomalies. 
I was only making a figure - but for sure someone will be trying to use the 
data for something more useful and quantitative. This sort of stuff will make 
their analysis difficult.
All the best
  Martyn 
Martyn Symmons
Cambridge

Re: [ccp4bb] Command line tool for RMSD calculation

2014-10-16 Thread MARTYN SYMMONS

Mentioned recently but worth mentioning again?
I have used Chimera rmsd from its own command line (find the command line under 
the 'favorites' tab) - it is scriptable.
https://www.cgl.ucsf.edu/chimera/docs/UsersGuide/midas/rmsd.html
Must have equal number of atoms matched - use selection to make that true - for 
example restricting to Calpha atoms. 
Cheers - Martyn 
Cambridge
Original message
From : c.z...@yale.edu
Date : 15/10/2014 - 21:05 (GMTDT)
To : CCP4BB@JISCMAIL.AC.UK
Subject : [ccp4bb] Command line tool for RMSD calculation
Dear all,
I am just wondering whether there is a simple command line tool for RMSD 
calculation between two aligned structures? I just cannot find a good answer 
online, and ccp4bb can always impress me.
Thank you so much in advance,
Chen

Re: [ccp4bb] Coot - way to move the center pointer to a specific x,y,z coordinate?

2014-09-09 Thread MARTYN SYMMONS

I usually add a fiducial waters to the pdb file (chain Z - remember those?) - 
or otherwise heavy atoms which will come up in a distinctive colour usually... 
Then 'go to atom' under draw menu?
Martyn
Cambridge
Original message
From : alejandro.virru...@yale.edu
Date : 09/09/2014 - 21:17 (GMTDT)
To : CCP4BB@JISCMAIL.AC.UK
Subject : [ccp4bb] Coot - way to move the center pointer to a specific x,y,z 
coordinate?
Does anyone knowhow to move the center pointer to a specific x,y,z coordinate? 
Or to place some kind of marker at a specific x,y,z coordinate?
Thanks,
Alex

Re: [ccp4bb] correlated alternate confs - validation?

2014-07-23 Thread MARTYN SYMMONS

I think there has been historically some glitch in the deposition of some 
refmac altloc waters 
So for example  4a9x from Buster is annotated correctly
REMARK   3   PROGRAM : BUSTER 2.11.2   
HETATM 9407  O   HOH A2517 -22.429   1.144 -35.218  1.00 29.73   O  
HETATM 9408  O  AHOH A2518 -18.933  -9.507 -28.753  0.50  3.00   O  
HETATM 9409  O  BHOH A2518 -20.467  -9.093 -29.900  0.50 16.03   O  
so the A and B altloc retained.
But 
for example in 4ac7 
REMARK   3   PROGRAM : REFMAC 5.6.0117 
where these two waters look like they originally were deposited as altloc waters
HETATM 6205  O   HOH A2043  -3.612  86.934 100.843  0.50 25.12   O  
ANISOU 6205  O   HOH A2043 2630   2166   4747   -153 68   1496   O  
HETATM 6206  O   HOH A2044  -3.506  85.122 100.190  0.50 22.20   O  
ANISOU 6206  O   HOH A2044 3710   2476   2248862188 16   O  
and these two
HETATM 6328  O   HOH B2073  -1.182 117.989  73.544  0.50 15.63   O  
ANISOU 6328  O   HOH B2073 2509   1684   1745408173289   O  
HETATM 6329  O   HOH B2074  -0.561 117.641  72.161  0.50 14.37   O  
ANISOU 6329  O   HOH B2074 2023   1187   2247107123   -211   O  

These waters also end up being annotated as close contacts in Remark 500 - 
which would not have happened if they had retained their altlocs.
REMARK 500 THE FOLLOWING ATOMS ARE IN CLOSE CONTACT.
...   
REMARK 500   OHOH A  2043 OHOH A  2044  1.93
REMARK 500   OHOH B  2073 OHOH B  2074  1.56

Probably other examples can be retrieved but this latter file dates from my 
time at the PDBe and so may have been mangled at my hands.

In my defence I think it is owing to the annotation step when the waters are 
renumbered to fit with the associated macromolecule N to C (for proteins).  I 
think the script that was originally developed for this was not able to deal 
with waters with the same number but differing altlocs. Also alternate 
positions in this step can be associated with different residues by reason of 
being closer to one or other bit of chain. In this case they can have very 
different numbers - or chains.

I think this can be handled better in the new mmCif-based annotation system 
with the cooperation of refinement program authors. 
all the best
Martyn 

Martyn Symmons
Cambridge


Original message
From : f...@bernstein-plus-sons.com
Date : 23/07/2014 - 16:20 (GMTDT)
To : CCP4BB@JISCMAIL.AC.UK
Subject : Re: [ccp4bb] correlated alternate confs - validation?

I agree that it would be excellent to be able to associate
alternate conformations (beyond the individual residue) but
when we defined the PDB format we had an 80-column limitation
per atom and so only one column was allowed for alternate
conformations.  In an ASCII world only 36 characters were available
to define alternate conformations.  This is inadequate to allow
for many residues with independent alternate conformations -
one residue with three conformations would use up 3 of the 36
characters.  Thus there was no way to say that alternate
conformation A in one residue is or is not associated with
alternate conformation A in another residue.

  Frances

=
Bernstein + Sons
*   *   Information Systems Consultants
5 Brewster Lane, Bellport, NY 11713-2803
*   * ***
 *Frances C. Bernstein
   *   ***  f...@bernstein-plus-sons.com
  *** *
   *   *** 1-631-286-1339FAX: 1-631-286-1999
=

On Wed, 23 Jul 2014, Tim Gruene wrote:

 -BEGIN PGP SIGNED MESSAGE-
 Hash: SHA1

 Hi Bernhard,

 That's right, the definition is not in the PDB, but REFMAC sensibly
 handles it this way. It is certainly a short-coming if not bug in the
 PDB (definition).

 Best,
 Tim

 On 07/23/2014 04:49 PM, Bernhard Rupp wrote:
 ?? Refmac knows because of the group definition, otherwise I cannot
 force grouped occupancy refinement. There is no definition in PDB
 that eg ALTLOC A (of whatever residue) belongs to ALTLOC A of
 (whatever) residue/water.

 BR

 -Original Message- From: Tim Gruene
 [mailto:t...@shelx.uni-ac.gwdg.de] Sent: Mittwoch, 23. Juli 2014
 15:38 To: b...@hofkristallamt.org; CCP4BB@JISCMAIL.AC.UK Subject: Re:
 [ccp4bb] correlated alternate confs - validation?

 Hi Bernhard,

 do you refer to the PDB who, according to Martyn, remove the altloc
 indicator? That's certainly a serious bug that should be fixed as
 quickly as possible.

 Within refmac the preservation is fine and as you would expect it
 to be: altA only sees atoms in altA and those witout altLoc, etc,
 which makes sure you PDB file is interpreted correctly by refmac5.
 That's how I refmac works.

 Best, Tim

 On 07/23/2014 03:26 PM, Bernhard Rupp wrote:
 I would

Re: [ccp4bb] EDSTATS for an extracted fragment

2014-06-12 Thread MARTYN SYMMONS

Is n't the danger here that you are just fitting to noise generated by the rest 
of the Unit Cell that you are ignoring. So it is easy to do a 'good bit' of 
modelling in a specific region. Just like some tennis players have very good 
serves but do not win matches. 

 From: George Devaniranjan devaniran...@gmail.com
To: CCP4BB@JISCMAIL.AC.UK 
Sent: Wednesday, 11 June 2014, 19:22
Subject: Re: [ccp4bb] EDSTATS for an extracted fragment

Thank you Gerard for your suggestion.

I will look into the paper you suggested and MAPMAN too.

I am idealizing fragments of PDB and while this would not agree when compared 
with the  well  refined structure, my goal really is seeing if these ideal 
fragments can still be identifiable when I look at the electron density and how 
to quantify that.
(Ideal here refers to a small number of Phi and Psi values I picked as 
representative of different regions in the Ramachandran map).

Isn't the danger here that you are can't tell how much improvement is just 
fitting to noise from improvements that should be made in the rest of the 
model? 

Unless you go on and do a final full refinement you can't know if your 
modelling is warranted by a global improvement in fit.

It could be the case that you are improving globally but I don't know how you 
can tell that by fitting to just a bit of the map. 

We have all had that experience of rebuilding a residue so it looks 'right' to 
us locally but then the refinement program knows better after making a global 
assessment and puts it back. 

If some way to give local density 'error bars' that might be helpful so that a 
maximum likelihood approach could be used during real space fitting. Is that 
similar to what those useful LLG maps give in heavy atom model refinement?

regards
Martyn

Cambridge 

I picked on EDSTATS to explore since it gave individual residue information but 
realize now that since I am dealing with a small sub-set of residues an 
alternative is required.

On Wed, Jun 11, 2014 at 2:01 PM, Gerard DVD Kleywegt ger...@xray.bmc.uu.se 
wrote:

What you want is a test for how well each model agrees with its own map. It is 
fair to argue that the model that is more self-consistent (agrees better with 
its own map) is the better model.  But you won't learn that by comparing model 
A to map B.

However, conversely, if your modified model fits the original map better than 
the model that was used to calculate the map itself, you've done a good bit of 
model building. If you want to do this calculation (with all the warnings and 
caveats), you can also use MAPMAN - 
http://xray.bmc.uu.se/usf/mapman_man.html#S41 . The method you propose is 
essentially the same as this one: http://www.ncbi.nlm.nih.gov/pubmed/18598022 
but for a fragment of your macromolecule instead of for a ligand (if you don't 
have access to the journal, you can request a reprint here: 
http://xray.bmc.uu.se/cgi-bin/gerard/reprint_mailer.pl?pref=87 )

--Gerard

**
                           Gerard J. Kleywegt

      http://xray.bmc.uu.se/gerard   mailto:ger...@xray.bmc.uu.se
**
   The opinions in this message are fictional.  Any similarity
   to actual opinions, living or dead, is purely coincidental.
**
   Little known gastromathematical curiosity: let z be the
   radius and a the thickness of a pizza. Then the volume
            of that pizza is equal to pi*z*z*a !
**

Re: [ccp4bb] experimental phasing at low resolution

2014-06-03 Thread MARTYN SYMMONS

Don't forget that your SeMet has real differences from your native - the S to 
Se brings in 18 extra electrons (less obviously if there is lower 
incorporation). But that mean if your native is isomorphous enough (you didn't 
say) then you possibly have 'ordinary' isomorphous differences to work with. 
You could then work in the anomalous as a sort of SIRAS solution.  
SeMet as isomorphous has been used previously in many structure solutions.
All the best
  Martyn

Martyn Symmons
Cambridge 



 From: vijay srivastava vijaytec...@yahoo.co.in
To: CCP4BB@JISCMAIL.AC.UK 
Sent: Tuesday, 3 June 2014, 9:37
Subject: [ccp4bb] experimental phasing at low resolution
 


Dear All,
Does anyone know of a facility
that provides amino acid analysis for determination of SeMet
incorporation in a recombinant protein?
We're looking for some advice
about how to proceed with a structure we're working on.  Our
protein is 350 amino acids (10 methionine are present) and naturally binds 
magnesium.  We
have a SeMet data set at peak that goes down to  5.4 angstroms. 
We tried to find the heavy atom position with Shelxcde program. The
log file is given as below.

Resl.   Inf - 8.0 - 6.0 - 5.6 - 5.4 -
5.2 - 5.0 - 4.8 - 4.6 - 4.4 - 4.2 - 3.98 
N(data) 529   643   24129
0 0 0 0 0 0 0 
I/sig63.5  19.5  10.6 
10.0   0.0   0.0   0.0   0.0   0.0   0.0   0.0 
%Complete  94.1  98.9  99.2  18.0  
0.0   0.0   0.0   0.0   0.0   0.0   0.0 
d/sig   3.06  1.24  1.07 1.21 

I am also attaching the .lst and .res file and
from that we were considering that only one selenomethionine
position.We have one monomer per AU
(-unfortunately, MR is not working for this project). The space group
is P3 .  We also have a native set down to 3.4 angstroms. 
While we understand that we may need more phasing information  we're
wondering if anyone might have some other suggestions or insights
about how we can move forward given the data that we currently have. 
Can we extend the phases with our native data set. Thanks in advance
for any advice.

regards

 
Vijay

Re: [ccp4bb] Sea Urchin like crystals

2014-05-21 Thread MARTYN SYMMONS

Dear V.
  are you sure these are crystalline? - I have had similar growth from 
'spherulites' that I thought was promising. But then when I went in with a 
needle to break them for seeding, it turned out they were fibrous not 
crystalline. This made me think they were a product of denatured protein. So I 
would say poke them first to check they snap like a crystal would. Actually 
mine were birefringent in four big sectors similar to the domains in the 
underlying spherulites (which had a four sectors with opposite quarters the 
same colour - like the Burundi flag (had to look that up;) ) .  My feeling is 
fibrous protein is not a good crystallization lead.
  But maybe you have had better luck 
  Martyn 
  



 From: venkatareddy dadireddy venkatda...@gmail.com
To: CCP4BB@JISCMAIL.AC.UK 
Sent: Wednesday, 21 May 2014, 10:58
Subject: [ccp4bb] Sea Urchin like crystals
 


Dear All,

I'm working on 24.3 KDa protein (pI 4.84) with (Asp + Glu): 36 and (Arg + Lys): 
16. Protein is in buffer Tris (10mM), pH 7.5. I got sea urchin like crystal 
clusters @ protein conc 38.5 mg/ml in sitting drop method; condition 0.2M NH4I, 
20% PEG3350  pH 6.8 (PEG/ION A12). These crystals are very thin. I tried 
different pH values ranging from  6.0 -9.0 with 0.5 units increment, PEG conc. 
from 10 -30% and NH4I conc. from 0.1 -0.5 M independently in hanging drop 
method and ended up with NO Crystals. Can somebody suggest strategy to improve 
crystals. Please find the crystal pic with attachment.

Regards,
Venkat.

Re: [ccp4bb] PDB passes 100,000 structure milestone

2014-05-15 Thread MARTYN SYMMONS

I agree some forum for community annotation and commenting would be a good 
thing for users of structural data. 
There was an attempt to do that with the pdbwiki project which was a community 
resource for the bioinformatics community. Unfortunately pdbwiki has now folded 
(see http://pdbwiki.org/) They are now directing people to Proteopedia. However 
Proteopedia has a more educative focus I think - rather than capturing 
technical questions and input.

Pubmed commons (http://www.ncbi.nlm.nih.gov/pubmedcommons/), which is a forum 
for discussing the literature, is currently under testing. Perhaps this is the 
sort of thing that could work for structural data?
 
cheers
 Martyn 



 From: Ethan A Merritt merr...@u.washington.edu
To: CCP4BB@JISCMAIL.AC.UK 
Sent: Wednesday, 14 May 2014, 19:22
Subject: Re: [ccp4bb] PDB passes 100,000 structure milestone
 

On Wednesday, 14 May, 2014 13:52:02 Phil Jeffrey wrote:
 As long as it's just a Technical Comments section - an obvious concern 
 would be the signal/noise in the comments themselves.  I'm sure PDB 
 would not relish having to moderate that lot.
 
 Alternatively PDB can overtly link to papers that discuss technical 
 issues that reference the particular structure - wrong or fraudulent 
 structures are often associated with refereed publications that point 
 that out, and structures with significant errors often show up in that 
 way too.  I once did a journal club on Muller (2013) Acta Cryst 
 F69:1071-1076 and wish that could be associated with the relevant PDB 
 file(s).

Perhaps some combination of those two ideas?

The PDB could associate with each deposited structure  a crowd-sourced
list of published articles citing it.     They already make an effort to
attach the primary citation, but so far as I know there is currently
no effort to track subsequent citations.  

While spam comments in a free-format forum are probably inevitable,
spam submission of citing papers seems less likely to be a problem.

    - Ethan

  On Wed, May 14, 2014 at 12:32 PM, Zachary Wood z...@bmb.uga.edu
  mailto:z...@bmb.uga.edu wrote:
 
      Hello All,
 
      Instead of placing the additional burden of policing on the good
      people at the PDB, perhaps the entry page for each structure could
      contain a comments section. Then the community could point out
      serious concerns for the less informed users. At least that will
      give users some warning in the case of particularly worrisome
      structures. The authors of course could still reply to defend their
      structure, and it may encourage some people to even correct their
      errors.
 
-- 
Ethan A Merritt
Biomolecular Structure Center,  K-428 Health Sciences Bldg
MS 357742,   University of Washington, Seattle 98195-7742

Re: [ccp4bb] PDB passes 100,000 structure milestone

2014-05-15 Thread MARTYN SYMMONS

Dear Zachary
I once suggested this sort of discussion forum to a PDB PI as a possible 
service that could be a ' validation fight-club '  - which suggestion was not 
well received ;) But as you say it is down to setting the correct professional 
tone. 

One thing that would allow a visual discussion would be the possibility to 
share molecular representations - similar to the way Proteopedia uses Jmol 
scripting to drive the graphics from the webpages. In this way commentators 
could share viewpoint and representations to make their points. This would make 
it most useful to the wider biological community. 

all the best
 Martyn 

  



 From: Zachary Wood z...@bmb.uga.edu
To: CCP4BB@JISCMAIL.AC.UK 
Sent: Thursday, 15 May 2014, 14:47
Subject: Re: [ccp4bb] PDB passes 100,000 structure milestone
 



Adding to Tim’s comment, I would not expect a tremendous amount of spurious 
comments about a single PDB out of 100,000 unless there was a problem. 
Especially if the Pubmed Commons model was applied, and only depositors could 
comment. I would assume this would be very beneficial, given that we are 
conscientious professionals.  Could actually be a great forum for authors to go 
a little deeper into specific approaches or problems that they had with a 
structure. Not all interesting details make it to the pub.  



Best regards,

Z


***
Zachary A. Wood, Ph.D.
Associate Professor                      
Department of Biochemistry  Molecular Biology
University of Georgia
Life Sciences Building, Rm A426B
120 Green Street
Athens, GA  30602-7229
Office: 706-583-0304
Lab:    706-583-0303
FAX: 706-542-1738
***







On May 15, 2014, at 9:39 AM, Tim Gruene t...@shelx.uni-ac.gwdg.de wrote:

-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

Dear all,

isn't the ccp4bb a very good example that spam may not be such an
issue for a discussion platform on structures in the PDB? There is a
great variety of opinions, some to agree with, some to disagree, but
all of them interesting and contributing, and I hardly remember a
message I would classify as spam. And it all works without restraints.

Best,
Tim


On 05/15/2014 03:21 PM, Zachary Wood wrote:

I agree with Martyn,

Pubmed Commons could be a great model. I believe you have to be a
published author to obtain an account. It might cut down on some of
the spam/noise if the PDB adopted such a model for depositors.

Best regards,

Z


*** Zachary A. Wood,
Ph.D. Associate Professor Department of Biochemistry  Molecular
Biology University of Georgia Life Sciences Building, Rm A426B 120
Green Street Athens, GA  30602-7229 Office: 706-583-0304 Lab:
706-583-0303 FAX: 706-542-1738 
***







On May 15, 2014, at 7:29 AM, MARTYN SYMMONS
martainn_oshioma...@btinternet.com wrote:


I agree some forum for community annotation and commenting would
be a good thing for users of structural data. There was an
attempt to do that with the pdbwiki project which was a community
resource for the bioinformatics community. Unfortunately pdbwiki
has now folded (see http://pdbwiki.org/) They are now directing
people to Proteopedia. However Proteopedia has a more educative
focus I think - rather than capturing technical questions and
input.

Pubmed commons (http://www.ncbi.nlm.nih.gov/pubmedcommons/),
which is a forum for discussing the literature, is currently
under testing. Perhaps this is the sort of thing that could work
for structural data?

cheers Martyn

From: Ethan A Merritt merr...@u.washington.edu To:
CCP4BB@JISCMAIL.AC.UK Sent: Wednesday, 14 May 2014, 19:22 
Subject: Re: [ccp4bb] PDB passes 100,000 structure milestone

On Wednesday, 14 May, 2014 13:52:02 Phil Jeffrey wrote:

As long as it's just a Technical Comments section - an obvious
concern would be the signal/noise in the comments themselves.
I'm sure PDB would not relish having to moderate that lot.

Alternatively PDB can overtly link to papers that discuss
technical issues that reference the particular structure -
wrong or fraudulent structures are often associated with
refereed publications that point that out, and structures with
significant errors often show up in that way too.  I once did a
journal club on Muller (2013) Acta Cryst F69:1071-1076 and wish
that could be associated with the relevant PDB file(s).

Perhaps some combination of those two ideas?

The PDB could associate with each deposited structure  a
crowd-sourced list of published articles citing it.    They
already make an effort to attach the primary citation, but so far
as I know there is currently no effort to track subsequent
citations.

While spam comments in a free-format forum are probably
inevitable, spam submission of citing papers seems less likely to
be a problem.

- Ethan


On Wed, May 14, 2014 at 12:32 PM, Zachary Wood
z...@bmb.uga.edu mailto:z

Re: [ccp4bb] Renumbering of water molecules during deposition

2014-04-10 Thread MARTYN SYMMONS

I had thought that after discussion on this bulletin board we could find no 
rational scientific reason why the PDB renumbers waters from the N to the C 
term of the polypeptides (and 5' to 3' of nucleic chains?). 
https://www.jiscmail.ac.uk/cgi-bin/webadmin?A2=ind1311L=CCP4BBD=0P=12314

I think that provided you follow the agreed rules on chain ids (see below) then 
you have a 'right of veto' on the water renumbering. You also have a 'right of 
veto' for the water movement to symmetry-related positions - which would 
involve chain id change. 

So probably the best way is to assign the chain ids yourself and just say you 
don't want any of your waters renumbered. Then keep track of them yourself as 
Ed suggested using documented community tools.

One related question is how waters with alternate positions are dealt with. I 
cannot find clear documentation on this. Often sidechains with alternate 
positions motivate alternate waters during refinement. But I believe these are 
removed during the deposition water renumbering step. So if you have particular 
waters in alternative positions - say owing to a Michaelis complex at the 
active site - then the renumbering will likely split your water in alternate 
positions to produce two apparently unrelated, differently-numbered, solvent 
atoms. It could be argued how can you know these are alt pos of the same water? 
But if you have refined them like that then it is good for users of your 
coordinates to see that documented.  

It may be that the new PDB Deposition System is more flexible in this regard - 
I have not used it yet.

Below is the current agreed procedure from the wwPDB website.

Hope that helps. 
 regards
  Martyn 
How are chain IDs assigned to chemical groups and waters?
All chemical groups and waters within 5 Ångstroms of any atom of a polymeric 
chain will be considered to be associated with that chain and will be assigned 
the same chain ID as that polymer. If a particular chemical group/water is 
equidistant from more than one chain, then the chain ID is randomly assigned to 
be that of any one of these polymers.
Waters further than 5 Ångstroms away from the polymer that can be moved by 
symmetry to within 5 Ångstroms, will then be automatically moved and the author 
will be notified. If there are objections, the waters will be moved back to 
their original positions.
Waters further than 5 Ångstroms away from any polymer, which cannot be brought 
closer to a polymer chain by application of symmetry, will be brought to the 
attention of the depositor. The waters will be listed in REMARK 525 with the 
distance to the nearest macromolecule listed. Authors will be given the 
opportunity to remove the waters or update the coordinates. Authors may choose 
to retain the distant waters in the entry.



 From: Edward A. Berry ber...@upstate.edu
To: CCP4BB@JISCMAIL.AC.UK 
Sent: Monday, 7 April 2014, 15:25
Subject: Re: [ccp4bb] Renumbering of water molecules during deposition
 

After deposition (my experience is with RCSB) you will get back a processed
file to verify.  See how the waters have been renumbered, and renumber
in such a a way that your original scheme is easily identifiable but in
keeping with whatever the renumbering was trying to achieve- e.g. your
waters X1,X2,X3 could become A1001, A1002, A1003 and the symmetry related
waters B1001, B1002, B1003. Discuss with the annotator to see how your
waters can be renumbered in a recognizable way in keeping with requirements
of the PDB conventions. Then in future depositions keep this same scheme
(you may still need to renumber the original processed file each time).
eab

Eckhard Hofmann wrote:
 Hi all,
 during deposition of structures the water numbering is frequently changed.
 I would love to keep specific numbers for critical water molecules e.g.
 in an active site in related proteins (mutants, ligand soaks), to
 facilitate easier discussion in a manuscript.

 Is there a general agreement how to treat these?

 One way to deal with this is would be to use an arbitrary offset:
 A1-A233 protein
 A301 ligand
 A302 another ligand
 A401-404 4 special waters
 A500- general solvent

 Any input welcome ;-)
 All the best,
 Eckhard
 --
 Prof. Dr. Eckhard Hofmann eckhard.hofm...@bph.ruhr-uni-bochum.de
 Ruhr-Uni Bochum
 AG Proteinkristallographie, LS Biophysik, ND04/318
 44780 Bochum
 Tel: +49-(0)234/32-24463, Sekr. -24461, FAX: -14762

Re: [ccp4bb] Convert DSSP to PDB format

2014-03-02 Thread MARTYN SYMMONS

The PDB don't seem to use DSSP for the PDB format coordinate files - but 
instead some sort of in-house algorithm that gives slightly different answers 
at times. 

This can trip you up - I recently loaded 3mop into Pymol and then deleted the 
header including the PDB secondary structure and loaded it again. Surprisingly 
deleting the PDB header gave  significant differences in the death domain 
helices  - most likely because Pymol reads the PDB secondary structure if 
present; or otherwise follows its own version of DSSP rules.   

To add to confusion the PDB actually shows the DSSP-based secondary structure 
for the chains in 3mop 
(http://www.rcsb.org/pdb/explore/remediatedSequence.do?structureId=3MOPbionumber=1)
 and apparently other structures. 


You might expect that the secondary structure in the files downloaded from the 
PDB would correspond to that displayed on their own webpages - but not so 

I don't know of a publication describing the secondary structure dictionary 
used by the PDB deposition software. But if you want to be sure of what appears 
in the header of your deposited file then best thing is to use dssp2pdb as 
suggested.  Then supply the output to the PDB. They will replace their version 
with the ones you provide. But will include remarks in the PDB files to 
indicate that you have not followed their standard procedure (for example: 
REMARK 650 DETERMINATION METHOD: AUTHOR PROVIDED.).

I don't know if the dictionary used by the new mmCif-based Deposition and 
Annotation system will produce definitions equivalent to those used previously 
to produce the PDB file versions. Or if the new DA now produces definitions 
that are equivalent to DSSP. Or if it produces some third type of definition... 

Perhaps someone at the PDB will clarify - when I ask them specific questions as 
an individual then they don't bother replying - but if the bulletin board shows 
sufficient interest then we usually get at 'Public Service Announcement' of 
some kind ;) 

All the best
  Martyn 

Martyn Symmons
Cambridge



 From: Mark Brooks mark.x.bro...@gmail.com
To: CCP4BB@JISCMAIL.AC.UK 
Sent: Saturday, 1 March 2014, 16:19
Subject: Re: [ccp4bb] Convert DSSP to PDB format
 

Hi Wenhe,
       One answer to Q#2 is James Stroud's dssp2pdb to take the
output of DSSP  write HELIX and SHEET lines etc.
http://www.jamesstroud.com/software/dssp2pdb/

Mark
P.S. I believe the answer to question #1 is DSSP but I suspect the
answer is more complex.

On 1 March 2014 13:50, Wenhe wenhezhong.xmu@gmail.com wrote:
 Dear CCP4 friends,

 I have two questions after I solving one structure:

 1. What tool does PDB databank use to generate secondary structure 
 information which also shows in the pdb. file?
 2. I had a secondary structure file which generated by DSSP, how I can 
 convert it to pdb format?

 Thank you.

 Kind regards,
 Wenhe

Re: [ccp4bb] resubmission of pdb

2014-02-03 Thread MARTYN SYMMONS

Yes, thanks Robbie
That is just my point - a structure submitted and then validated for paper 
reviewers by the PDB can be changed almost completely after the paper is 
accepted.
The data can be changed too and only stipulation seems to be that it cannot be 
new data i.e. it has to have a date of collection _before_ submission. 

Changes are of course not bad in principle - they may be motivated by peer 
review.

But they need to be tracked. ln a way that is understandable for users of the 
data. Perhaps they are available in the new mmCif-based deposition and 
annotation system. I have not used it yet. 

The PDB provides a comparison between obsoleted and superseding entries 
(coordinates at least). So a similar approach could be used.

all the best
Martyn

 


 From: Robbie Joosten robbie_joos...@hotmail.com
To: CCP4BB@JISCMAIL.AC.UK 
Sent: Sunday, 2 February 2014, 20:13
Subject: Re: [ccp4bb] resubmission of pdb
  

Hi Martyn,

 I have recently had the same problem. But generally, the PDB will usually
 allow a further 6 months hold for review or modifications to an already
 submitted paper.
That is good to hear. I guess the 1 year limit is mostly to avoid structures
to stay in limbo too long.

 But what I wanted to say was that the correct term is 'withdrawal' if the
entry
 is removed pre-release - 'retraction' carries a pejorative connotation.
Even
 after release, pulling an entry would be called obsoleting (status OBS)
 without superseding. So some structures have been 'obsoleted' owing to
 retraction of a published paper. (Superseding is when a better structure
 replaces the original - this process is tracked by the PDB.)
Indeed, bad choice of words on my side. Just to complete the list, the
possible statuses are here: http://www.ebi.ac.uk/pdbe-srv/status/search/doc

 Most pre-release 'withdrawn' entries are of course subsequently released
 after re-submission. But the PDB does not seem to track these connections
-
 although they maintain a list of withdrawn entries - which means ids
cannot
 really be recycled.
That's too bad, there are not that many possible PDBids.

 Interestingly, before release entries can be 'replaced' which means a new
 structure can take the place (and 4 letter code) of the old one - this
would
 have to have the same meta-data - so source and expression - but could
 have different resolution, space group, coordinates, and small molecules.
 Changes in these could for example be motivated by referees' comments on
 the submitted paper or maybe the authors got lucky with a better crystal.
But
 this pre-release replacement could also be potentially used to 'sex up' a
 structure - for example by adding a 'novel' small molecule 'overlooked' in
the
 original deposition. Such changes are tracked privately by the PDB but are
not
 publically available... even after release.
I didn't know this was an option. It seems sensible for peer review, but
does present a potential loop-hole. I saw a fairly recent PDB entry that was
deposited as a C-alpha trace (in 2013), but presented as a full model in
Table 2 of the linked publication. The model was deposited a month before
the paper was accepted, so referees could have noticed this (in theory). But
now I wonder I the model was not 'downgraded' before the release. Perhaps
I'm just paranoid.

Cheers,
Robbie

 Even more interestingly, the ligand definitions such as bond orders can be
 modified _after_ release (as in the recent R12 case I noticed*)... I think
this is
 owing to the lack of clear rules on small molecule changes - which means
the
 PDB should be considered of limited value as a definitive record of small
 molecule chemistry.
 
 Cheers - M
 *https://www.mail-archive.com/ccp4bb@jiscmail.ac.uk/msg33403.html
 
 
 
 
 
 
 
 
 From: Robbie Joosten robbie_joos...@hotmail.com
 To: CCP4BB@JISCMAIL.AC.UK
 Sent: Saturday, 1 February 2014, 12:48
 Subject: Re: [ccp4bb] resubmission of pdb
 
 
 Hi Folmer,
 
 Perhaps because of the one year limit of keeping PDB entries in the 'HPUB'
 status.
 
 So when a PDB entry is retracted before release, is the PDBid recycled
after
 a while?
 
 Cheers,
 Robbie
 
  -Original Message-
  From: CCP4 bulletin board [mailto:CCP4BB@JISCMAIL.AC.UK] On Behalf Of
  Folmer Fredslund
  Sent: Saturday, February 01, 2014 10:33
  To: CCP4BB@JISCMAIL.AC.UK
  Subject: Re: [ccp4bb] resubmission of pdb
 
  Hi Faisal,
 
  There is one thing I don't understand:
 
  Some time back i had submitted a coordinate in PDB but because of
  unacceptance of the manuscript we had to retract the submission
 
  Why would you need to retract your deposited structure just because the
  paper describing the structure didn't get accepted?
 
 
  Venlig hilsen
  Folmer Fredslund
 
  On Jan 31, 2014 10:04 PM, Faisal Tarique faisaltari...@gmail.com
 wrote:
 
 
      Dear all
 
      Dear Dr. PDB,
 
      Some time back i had submitted a coordinate in PDB but because of
  unacceptance of the manuscript we had to

Re: [ccp4bb] resubmission of pdb

2014-02-01 Thread MARTYN SYMMONS

I have recently had the same problem. But generally, the PDB will usually allow 
a further 6 months hold for review or modifications to an already submitted 
paper. 

But what I wanted to say was that the correct term is 'withdrawal' if the entry 
is removed pre-release - 'retraction' carries a pejorative connotation. Even 
after release, pulling an entry would be called obsoleting (status OBS) without 
superseding. So some structures have been 'obsoleted' owing to retraction of a 
published paper. (Superseding is when a better structure replaces the original 
- this process is tracked by the PDB.)

Most pre-release 'withdrawn' entries are of course subsequently released after 
re-submission. But the PDB does not seem to track these connections - although 
they maintain a list of withdrawn entries - which means ids cannot really be 
recycled. 

Interestingly, before release entries can be 'replaced' which means a new 
structure can take the place (and 4 letter code) of the old one - this would 
have to have the same meta-data - so source and expression - but could have 
different resolution, space group, coordinates, and small molecules. Changes in 
these could for example be motivated by referees' comments on the submitted 
paper or maybe the authors got lucky with a better crystal. But this 
pre-release replacement could also be potentially used to 'sex up' a structure 
- for example by adding a 'novel' small molecule 'overlooked' in the original 
deposition. Such changes are tracked privately by the PDB but are not 
publically available... even after release.

Even more interestingly, the ligand definitions such as bond orders can be 
modified _after_ release (as in the recent R12 case I noticed*)... I think this 
is owing to the lack of clear rules on small molecule changes - which means the 
PDB should be considered of limited value as a definitive record of small 
molecule chemistry.

Cheers - M
*https://www.mail-archive.com/ccp4bb@jiscmail.ac.uk/msg33403.html
 






 


 From: Robbie Joosten robbie_joos...@hotmail.com
To: CCP4BB@JISCMAIL.AC.UK 
Sent: Saturday, 1 February 2014, 12:48
Subject: Re: [ccp4bb] resubmission of pdb
  

Hi Folmer,

Perhaps because of the one year limit of keeping PDB entries in the 'HPUB'
status. 

So when a PDB entry is retracted before release, is the PDBid recycled after
a while? 

Cheers,
Robbie

 -Original Message-
 From: CCP4 bulletin board [mailto:CCP4BB@JISCMAIL.AC.UK] On Behalf Of
 Folmer Fredslund
 Sent: Saturday, February 01, 2014 10:33
 To: CCP4BB@JISCMAIL.AC.UK
 Subject: Re: [ccp4bb] resubmission of pdb
 
 Hi Faisal,
 
 There is one thing I don't understand:
 
 Some time back i had submitted a coordinate in PDB but because of
 unacceptance of the manuscript we had to retract the submission
 
 Why would you need to retract your deposited structure just because the
 paper describing the structure didn't get accepted?
 
 
 Venlig hilsen
 Folmer Fredslund
 
 On Jan 31, 2014 10:04 PM, Faisal Tarique faisaltari...@gmail.com
wrote:
 
 
     Dear all
 
     Dear Dr. PDB,
 
     Some time back i had submitted a coordinate in PDB but because of
 unacceptance of the manuscript we had to retract the submission. During
 this procedure i got few zipped file from the annotator such as 1.
 rcsb0.cif-public.gz,  2. rcsb0.pdb.gz and  3. rcsb0-
 sf.cif.gz..Now i want to submit the same ..My question is what is the best
 way to do it again..??
     Should we start  from the beginning through ADIT Deposition tool
 and resubmit it with a new PDB id or there is some way to submit again
those
 zip files which the annotator sent us after retraction..May you please
suggest
 what could be the easiest way to submit our structure to PDB without much
 efforts.
 
 
     --
     Regards
 
     Faisal
     School of Life Sciences
     JNU

Re: [ccp4bb] How to show the ligands (both as sticks and sphere) as shown here in Pymol

2014-01-20 Thread MARTYN SYMMONS

Not a pymol approach but I've noticed this dual mode display with transparency 
is the default in Molsoft ICM browser if you toggle on  ball and sticks 
together with spacefill for the same atoms.
Molsoft ball-and-stick also shows the bond order or aromaticity of ligand atoms 
- which I guess is a motivation for combining sticks with spacefill. Can export 
higher res png of the graphics too. 
  all the best
   Martyn   



 From: Wei Shi wei.shi...@gmail.com
To: CCP4BB@JISCMAIL.AC.UK 
Sent: Monday, 20 January 2014, 4:40
Subject: [ccp4bb] How to show the ligands (both as sticks and sphere) as shown 
here in Pymol
 


Hi all,

Please see attached Fig where they show the ligand both as sticks and spheres 
at 
the same time. Does anyone happen to know how to display the ligand like this? 
Thank you so much! 


Best,
Wei

Re: [ccp4bb] Problematic PDBs

2013-11-17 Thread MARTYN SYMMONS

Hmm
So the backstory for the problematic ligand R12 in this thread is that a 
sharp-eyed worker at the wwPDB recently spotted that there was an error 
compared with the original 1999 paper. Correcting the R12 ligand is a friendly 
gesture from the PDB as it appears that the error must have been the authors' - 
the atom correcting the R12 ligand has been inserted by the PDB staff rather 
than retreived from a deposited structure.
 
It is a shame the same helpful approach is not always applied. One current 
example I spotted is a separate 'problematic ligand' 5AX which has been added 
by the wwPDB to at least four other authors' entries, starting in 2006 with the 
latest in 2009. 
 
5AX is basically a fragment ligand which the PDB software produces if a NAG has 
wandered too far from its Asn sidechain during refinement. If 5AX is generated 
during the PDB processing of a deposition, then it really should be highlighted 
for the authors as a geometric issue - rather than, as in these cases, being 
simply added to the coordinates. 
 
Reading the authors' papers for the 5AX-containing entries makes it clear that 
they never expected anything other than NAG to appear in their deposited 
coordinates.
 
And given its artifactual production during deposition, 5AX should never have 
'escaped into the wild'. 
 
So if a retrospective fix can be applied to R12 (which similar in lacking an 
atom) then it seems to me that, in fairness, a clean up of the 5AX entries 
should be arranged. 
 
Yours (not holding his breath),
Martyn
 
 

  


 From: Rachel Kramer Green kra...@rcsb.rutgers.edu
To: CCP4BB@JISCMAIL.AC.UK 
Sent: Wednesday, 6 November 2013, 16:49
Subject: Re: [ccp4bb] Problematic PDBs
  


Dear Martyn,

wwPDB staff regularly reviews and remediates PDB data and related
  dictionaries such as the Chemical Component Dictionary (CCD).

As part of our on-going remediation efforts, the chemical
  components in the archive are regularly reviewed to ensure the
  correctness and the completeness of the chemical representation.
  Such reviews show that in some cases, the author has failed to
  provide a complete description of the chemistry. To address any
  such errors, the definitions are corrected. The chemical name and
  formula are changed in the PDB file, but the coordinates are not
  changed.

In the case of entry 3CBS, issues were found with the chemical
  component definition for its ligand R12. The methyl group was not
  in the deposited coordinates and it was missing from the original
  definition. In addition, the bond order in one of the
  carbon-carbon bonds was incorrectly defined. The CCD definition
  for R12 was updated in 2011 to add the methyl group and to correct
  the bond order based on information in the primary citation. The
  coordinates for this PDB entry were not changed. Therefore, in
  accordance with wwPDB policy, the file was not obsoleted.

Sincerely,
Rachel Green


 


  
Rachel Kramer Green, Ph.D. 
RCSB PDB 
kra...@rcsb.rutgers.edu 
  
  
Twitter: https://twitter.com/#!/buildmodels 
Facebook: http://www.facebook.com/RCSBPDB 
   
On 10/21/2013 6:28 AM, MARTYN SYMMONS wrote:
 
As a postscript it might be worth mentioning one problematic ligand that 
suggested to me a way to correct some of the errors mentioned in this thread 

R12 is indicated as 9-(4-HYDROXY-2,6-DIMETHYL-PHENYL)-3 in the  most 
recent Coot monomer library. But in the PDB ligand description it is 
9-(4-hydroxy-2,3,6-trimethylphenyl)-3,7-dimethylnona-2,4,6,8-tetraenoic acid 
with an additional carbon C16. To make a long story short this ligand was 
originally deposited missing this extra methyl goup in 1999 (as part of 3CBS) 
and then apparently updated in 2011 by the PDB. 

 
(the relevant lines in the cif are 
snip 
R12 C16 C16 C 0 1 N N N ?      ?      ?      -6.631 1.502  0.990  C16 R12 44  
R12 H1  H1  H 0 1 N N N ?      ?      ?      -6.602 1.511  2.080  H1  R12 45  
R12 H23 H23 H 0 1 N N N ?      ?      ?      -6.422 2.503  0.613  H23 R12 46  
R12 H24 H24 H 0 1 N N N ?      ?      ?      -7.619 1.186  0.656  H24 R12 47  
snip  

 
with the ? ? ? indicating that refined coordinates were not available at the 
time of the update. There was initially an explanation line at the end of the 
cif: 

 
snip 
R12 Other modification 2011-10-25 RCSB CS 'add missing methyl group, 
re-define bond order based on publication' 
snip 

 
But this has mutated for some reason (premature stop codon?) over the past 
year to the following. 

 
snip   
R12 Other modification 2011-10-25 RCSB  
snip 

 
Obviously the full correct ligand could not have been incorporated into the 
PDB entry coordinates without these undergoing a full obsolete - supersede 
process (somewhat embarrassing perhaps as one author is now a wwPDB PI ;) 

 
But it is frustrating for users of the PDB that in such cases easily 
correctable

Re: [ccp4bb] Comparison of Water Positions across PDBs

2013-11-04 Thread MARTYN SYMMONS

Thank you for that, Rachel

Even though the tone of your comment does not suggest that you want to carry on 
a dialogue about this, I thought I would reply in any case - since dialogue is 
what this forum is supposed to be about.

Thing is,  I was sort of looking for an explanation of why the rule was adopted 
that waters were to be renumbered from N to C terminus. If this is not 
functioning to put the waters in register across a set of related structures 
then it seems somewhat arbitrary. And other schemes might be suggested to be 
better for the usability and interpretation of the structural data. 

In some cases there are only a few waters but in many structures the wwPDB 
partners renumber hundreds. And this process makes it difficult for authors to 
check the final deposited structure against the output of their refinement. 

I have to say that I agree with other contributors to this thread. It would be 
much better to let the refinement program authors agree on a default water 
numbering scheme. And then maintain that through deposition. 

I thought of six possible schemes before breakfast... one of my favourites was 
to order by B-factor - which might appeal to crystallographers. Another was to 
give priority to those in the coordination sphere of any metal ions - these 
actually get priority in the PDB as they are included in the LINKS records 
above the coordinates. These coordinated waters are often refined together with 
the metals and so it would make sense to move them closer to their friendly 
ion.    

And of course one other clearly suitable option would be to leave the waters in 
the authors' preferred order - chosen with help from their refinement suite. 
This is what happens during deposition with the residues of the polymers - 
(provided the authors chainids are suitably chosen). Following your link the 
rule for polymers is that: 'If the coordinate residue numbers, as provided by 
the author, are unique and sequential within a particular chain ID, the 
residues will not be renumbered.' 

I'm presuming that if the authors have a preferred suitable set of water 
numbers then that would be maintained similarly?

Perhaps that is what is happening in the cases I notice that do not follow 
wwPDB rules?

On Friday I was looking at TIRAP structures and in 3ub2 the protein construct 
starts at residue 78 and its final residue is 221 - but the associated DTT is 
labelled back at residue 1 in the same chain. Then the first ten out of eleven 
waters are residues 2 to 11 but then oddly the eleventh water is residue 222. 
Is there a difference in this C-terminal water compared with the N-terminal 
ones? I imagined it was perhaps maintained to fit in with the associated 
publication - or maybe started out life modelled as a metal ion - unfortunately 
I can find no mention of it in the paper. 

But, regardless of this distracting feature,  surely this entry does not 
conform to the expected numbering scheme you mentioned as the wwPDB standard: 
polymer - heterogen - water? 

Yours perplexedly
 Martyn 



From: Rachel Kramer Green kra...@rcsb.rutgers.edu
To: CCP4BB@JISCMAIL.AC.UK 
Sent: Friday, 1 November 2013, 20:18
Subject: Re: [ccp4bb] Comparison of Water Positions across PDBs
 


In PDB format files, each polymer is assigned a unique chain ID. Chain IDs for 
all bound moieties and waters are assigned based on their proximity (number of 
contacts) to the nearest polymer. Once the polymers and non-polymer residues 
associated with them are assigned chain IDs, they are also assigned unique 
residue numbering with the order polymer residues, ligands and then waters. 

Please see: http://www.wwpdb.org/procedure.html#toc_4

The wwPDB has established this rule to improve the usability and
  interpretation of the structural data. Assigning the same chain ID
  for all moieties associated with a polymer enables rapid and
  uniform identification of feature analysis.

Sincerely,
Rachel Green

 


 
Rachel Kramer Green, Ph.D.
RCSB PDB
kra...@rcsb.rutgers.edu
 
 
Twitter: https://twitter.com/#!/buildmodels
Facebook: http://www.facebook.com/RCSBPDB
 
On 10/30/2013 8:09 AM, Eugene Krissinel wrote:

This is to be answered by PDB people, who definitely read BB :) Would be nice 
to have a tool common between CCP4/Phenix and the PDB which sorts this out 
Eugene On 30 Oct 2013, at 12:09, Andreas Förster wrote: 
Dear all, this water discussion is flowing increasingly towards a place where 
I feel a bit out of my depth. What is the convention for numbering water 
molecules?  Is there preference for: - putting waters into a separate chain (W 
for water or S for solvent)?
- splitting waters according to the peptide chains in the structure?
- appending all waters to chain A? Thanks. Andreas On 30/10/2013 11:57, MARTYN 
SYMMONS wrote: 
At deposition the PDB runs a script that renumbers authors'  waters
according to a scheme based on the residue

Re: [ccp4bb] Comparison of Water Positions across PDBs

2013-11-04 Thread MARTYN SYMMONS

Thanks Bernhard 
  you have helpfully distinguished between the two processes - there is 
certainly a movement of waters to symmetry replacements closer to a chain - and 
that gets documented in Remark 525 of the PDB file returned to authors - 
although then it is stripped out, I think, before the entry is released.

 But generally a renumbering is applied to all the waters - these are not 
actually moved but are re-ordered. And of course the number count of them may 
change - in order to accommodate any waters that are swapped in or out of the 
chain during the symmetry operation.

 Currently I don't think authors are given access to an audit of what is 
happening - they can of course check their favourite waters by a 
superimposition. Still have to say best way to avoid errors would be to check 
symmetry and re-order at the end of refinement, pre-deposition. 

All the best 
  Martyn 



 From: Bernhard Rupp hofkristall...@gmail.com
To: CCP4BB@JISCMAIL.AC.UK 
Sent: Monday, 4 November 2013, 14:50
Subject: Re: [ccp4bb] Comparison of Water Positions across PDBs
 


As far as confusion as a result of PDB renumbering is concerned: It was useful 
to run the old REM525 standalone program (I think I got it from PDBe/Kim 
Hendrick) at the end of solvent building. It does what the PDB did with water 
renumbering when creating the REMARK 525 (probably based on ccp4 contact with 
additions). Is there an updated standalone PDB tool available one can use these 
days to avoid at least that issue? 
 
Thx, BR
 
From:CCP4 bulletin board [mailto:CCP4BB@JISCMAIL.AC.UK] On Behalf Of MARTYN 
SYMMONS
Sent: Montag, 4. November 2013 15:17
To: CCP4BB@JISCMAIL.AC.UK
Subject: Re: [ccp4bb] Comparison of Water Positions across PDBs
 
Thank you for that, Rachel
 
Even though the tone of your comment does not suggest that you want to carry on 
a dialogue about this, I thought I would reply in any case - since dialogue is 
what this forum is supposed to be about.
 
Thing is,  I was sort of looking for an explanation of why the rule was adopted 
that waters were to be renumbered from N to C terminus. If this is not 
functioning to put the waters in register across a set of related structures 
then it seems somewhat arbitrary. And other schemes might be suggested to be 
better for the usability and interpretation of the structural data. 
 
In some cases there are only a few waters but in many structures the wwPDB 
partners renumber hundreds. And this process makes it difficult for authors to 
check the final deposited structure against the output of their refinement. 
 
I have to say that I agree with other contributors to this thread. It would be 
much better to let the refinement program authors agree on a default water 
numbering scheme. And then maintain that through deposition. 
 
I thought of six possible schemes before breakfast... one of my favourites was 
to order by B-factor - which might appeal to crystallographers. Another was to 
give priority to those in the coordination sphere of any metal ions - these 
actually get priority in the PDB as they are included in the LINKS records 
above the coordinates. These coordinated waters are often refined together with 
the metals and so it would make sense to move them closer to their friendly 
ion.    
 
And of course one other clearly suitable option would be to leave the waters in 
the authors' preferred order - chosen with help from their refinement suite. 
This is what happens during deposition with the residues of the polymers - 
(provided the authors chainids are suitably chosen). Following your link the 
rule for polymers is that: 'If the coordinate residue numbers, as provided by 
the author, are unique and sequential within a particular chain ID, the 
residues will not be renumbered.' 
 
I'm presuming that if the authors have a preferred suitable set of water 
numbers then that would be maintained similarly?
 
Perhaps that is what is happening in the cases I notice that do not follow 
wwPDB rules?
 
On Friday I was looking at TIRAP structures and in 3ub2 the protein construct 
starts at residue 78 and its final residue is 221 - but the associated DTT is 
labelled back at residue 1 in the same chain. Then the first ten out of eleven 
waters are residues 2 to 11 but then oddly the eleventh water is residue 222. 
Is there a difference in this C-terminal water compared with the N-terminal 
ones? I imagined it was perhaps maintained to fit in with the associated 
publication - or maybe started out life modelled as a metal ion - unfortunately 
I can find no mention of it in the paper. 
 
But, regardless of this distracting feature,  surely this entry does not 
conform to the expected numbering scheme you mentioned as the wwPDB standard: 
polymer - heterogen - water? 
 
Yours perplexedly
 Martyn 




From:Rachel Kramer Green kra...@rcsb.rutgers.edu
To: CCP4BB@JISCMAIL.AC.UK 
Sent: Friday, 1 November 2013, 20:18
Subject

Re: [ccp4bb] Comparison of Water Positions across PDBs

2013-10-30 Thread MARTYN SYMMONS

At deposition the PDB runs a script that renumbers authors'  waters according 
to a scheme based on the residue they are nearest from N to C terminus along 
each chain. This renumbering started  when waters were assigned to 
macromolecular chains rather than getting a chain id of their own.  I have 
failed to find the rationale explained in any PDB documents - but it could be 
motivated by this sort of consideration when waters from different chains or 
entries are to be compared. Having said that I do not know if there are any 
cases where this approach has successfully matched waters. ..


However an associated step which is certainly a help is that, in the case of 
multiple chains, the crystal symmetry is applied to replace waters with their 
symmetry equivalent position if it is closer to a different chain.

I believe a freely available program implementing a similar approach is 
WATERTIDY in CCP4 which might be a good place to start.  It gives a pretty 
complete output, detailing residues actually H-bonded to the waters, and you 
could parse that for further analysis and comparisons.

Best wishes.
  Martyn   


 From: Joel Sussman joel.suss...@weizmann.ac.il
To: CCP4BB@JISCMAIL.AC.UK 
Sent: Wednesday, 30 October 2013, 6:49
Subject: Re: [ccp4bb] Comparison of Water Positions across PDBs
 


For detailed examination of this topic, see: 


Koellner, G., Kryger, G., Millard, C. B., Silman, I., Sussman, J. L.  Steiner, 
T. (2000). 
Active-site gorge and buried water molecules in crystal structures of 
acetylcholinesterase from Torpedo californica. Journal of Molecular Biology 
296, 713-735.

http://www.ncbi.nlm.nih.gov/pubmed/10669619

best regards,
Joel

On 30 Oct 2013, at 01:35, Ed Pozharski epozh...@umaryland.edu wrote:

http://www.ccp4.ac.uk/html/watertidy.html


On 10/29/2013 04:43 PM, Elise B wrote:

Hello,

I am working on a project with several (separate) structures of the same 
protein. I would like to be able to compare the solvent molecules between the 
structures, and it would be best if the waters that exist in roughly the same 
position in each PDB share the
 same residue number. Basically, I want to compare solvent molecule coordinates 
and assign similar locations the same name in each structure.

What would be the best strategy for re-numbering the water molecules such 
that those with similar coordinates in all the structures receive the same 
residue number? I'd appreciate any suggestions.

Elise Blankenship



-- 
Oh, suddenly throwing a giraffe into a volcano to make water is crazy?
   Julian, King of Lemurs

Re: [ccp4bb] Problematic PDBs

2013-10-21 Thread MARTYN SYMMONS

As a postscript it might be worth mentioning one problematic ligand that 
suggested to me a way to correct some of the errors mentioned in this thread
 
R12 is indicated as 9-(4-HYDROXY-2,6-DIMETHYL-PHENYL)-3 in the  most recent 
Coot monomer library. But in the PDB ligand description it is 
9-(4-hydroxy-2,3,6-trimethylphenyl)-3,7-dimethylnona-2,4,6,8-tetraenoic acid 
with an additional carbon C16. To make a long story short this ligand was 
originally deposited missing this extra methyl goup in 1999 (as part of 3CBS) 
and then apparently updated in 2011 by the PDB.

(the relevant lines in the cif are
snip
R12 C16 C16 C 0 1 N N N ?      ?      ?      -6.631 1.502  0.990  C16 R12 44 
R12 H1  H1  H 0 1 N N N ?      ?      ?      -6.602 1.511  2.080  H1  R12 45 
R12 H23 H23 H 0 1 N N N ?      ?      ?      -6.422 2.503  0.613  H23 R12 46 
R12 H24 H24 H 0 1 N N N ?      ?      ?      -7.619 1.186  0.656  H24 R12 47 
snip 

with the ? ? ? indicating that refined coordinates were not available at the 
time of the update. There was initially an explanation line at the end of the 
cif:

snip
R12 Other modification 2011-10-25 RCSB CS 'add missing methyl group, 
re-define bond order based on publication'
snip

But this has mutated for some reason (premature stop codon?) over the past year 
to the following.

snip  
R12 Other modification 2011-10-25 RCSB 
snip

Obviously the full correct ligand could not have been incorporated into the PDB 
entry coordinates without these undergoing a full obsolete - supersede process 
(somewhat embarrassing perhaps as one author is now a wwPDB PI ;)

But it is frustrating for users of the PDB that in such cases easily 
correctable errors are not actually updated by the authors. Would it not be 
helpful if there were a mechanism to make and track useful improvements in 
deposited structures? - Perhaps suggested by members of the community to the 
authors. 

These changes could be considered as 'corrigenda' and could be documented and 
tracked - complete with an explanation of the reasoning behind the change and 
attributing the motivation and origin of the improvement.

This would be a good way for the wider scientific community (who maybe do not 
read this bulletin board) to access the best current model without the authors 
suffering the full process of retracting and redepositing their PDB entry. The 
test for obsoleting would then be the same as for a paper - that the change 
invalidates a fundamental interpretation of the data. 

All the best
  Martyn 



 From: Pavel Afonine pafon...@gmail.com
To: CCP4BB@JISCMAIL.AC.UK 
Sent: Sunday, 20 October 2013, 19:49
Subject: Re: [ccp4bb] Problematic PDBs
 


Hello,

just for the sake of completeness: this paper lists a bunch of known 
pathologies (I would not be surprised if they've been remediated by now):

http://www.phenix-online.org/papers/he5476_reprint.pdf


Pavel



On Thu, Oct 17, 2013 at 6:51 AM, Lucas lucasbleic...@gmail.com wrote:

Dear all,

I've been lecturing in a structural bioinformatics course where graduate 
students (always consisting of people without crystallography background to 
that point) are expected to understand the basics on how x-ray structures are 
obtained, so that they know what they are using in their bioinformatics 
projects. Practices include letting them manually build a segment from an 
excellent map and also using Coot to check problems in not so good structures.

I wonder if there's a list of problematic structures somewhere that I could 
use for that practice? Apart from a few ones I'm aware of because of (bad) 
publicity, what I usually do is an advanced search on PDB for entries with 
poor resolution and bound ligands, then checking then manually, hopefully 
finding some examples of creative map interpretation. But it would be nice to 
have specific examples for each thing that can go wrong in a PDB construction.

Best regards,
Lucas

Re: [ccp4bb] Docking models into low-res SAD map

2013-09-19 Thread Martyn Symmons


Hi Oliver
  a few years ago I used the ccp4 FFFEAR program from Kevin Cowtan for 
this sort of domain search. It was very good but also quite 
labour-intensive. I have not followed up since to see if it was updated 
or the functionality was included in Buccaneer.


cheers
Martyn


--
Martyn F. Symmons
Dept Veterinary Medicine
Cambridge

On 18/09/2013 23:48, Oliver Clarke wrote:

Hi all,

Can anyone recommend software to dock previously solved domains into a (very) 
low-resolution experimentally phased map?

I am working on a rather marginal case where this would be useful - very large 
assembly (500kDA per ASU), native goes to 6.9A, and the best derivative goes to 
8A with SAD phases from a heavy atom cluster to 11. Unfortunately no NCS is 
present.

The map is (for the resolution!) not too bad - the solvent boundary is clear 
after SHELXE, and I can manually place a previously solved structure and get a 
reasonably convincing fit (and the fit I obtain agrees with that previously 
determined using CryoEM by another group).

There are a couple of other domains that I would like to place, and I believe I 
have some idea of where they are, but manual fitting is somewhat subjective, so 
I'd like an automated routine for doing the same - something like the 
functionality that ADP_EM provides when working with CryoEM maps. Does 
something like this exist? I tried using ADP_EM but it gives a segfault when 
used with a crystallographic density map.

I have tried using MOLREP to look for the model in the map to no avail, and 
PHASER didn't work either.

On another note, if anyone has any tips for density modification/phase 
extension in this resolution range I would love to hear them - haven't had a 
whole lot of luck with PARROT, DM, or SOLOMON, the maps seem stuck at ~10A 
despite data going to 8. I tried using SHARP with the SPHCLUSTER tag on, but it 
gave no improvement over what I obtain treating the cluster as a super atom in 
SHELXD.

Many thanks in advance!

Oliver.

Re: [ccp4bb] What kind of reflection data to deposit to PDB

2013-09-17 Thread Martyn Symmons


Sorry to contradict,

But the mmCIF data model certainly does not seem to require hkl unique 
within the reflection data.


Like CIF the mmCIF formalism has been developed to allow a complete 
description of a diffraction experiment and the data arising from it. 
There is a full description at 
http://mmcif.pdb.org/dictionaries/mmcif_pdbx_v40.dic/Categories/diffrn_refln.html 
(I am grateful to Rachel Kramer Green at the RCSB for pointing out these 
links to the dictionary and the papers describing its development).


Having chosen mmCIF for the archive and then not using its flexibility 
seems a bit like having your cake and NOT eating it.


It is strange to hear on a discussion board that recently considered the 
advantages of depositing complete image data, that a case will have to 
be made for allowing the deposition of full unmerged datasets.


++Martyn


On 16/09/2013 14:03, Gerard DVD Kleywegt wrote:

Dear all,

At present, unmerged data cannot be handled properly as part of a PDB 
deposition. One reason for this is that changes to the mmCIF/PDBx data 
model will be required (at the moment, hkl must be unique within the 
reflection data, which is logical for merged data but precludes 
handling of unmerged data). There are other (easier to resolve) issues 
to work out, e.g. having to do with file naming and distribution via 
the wwPDB ftp archive.


The wwPDB partners are presently focusing all efforts on rolling out 
the new joint deposition and annotation system. Once this system is 
reasonably stable we will look into accepting/validating/distributing 
new kinds of data. This concerns not only unmerged Is for X-ray but 
also unassigned NOE peak lists for NMR. We will seek the advice of the 
corresponding wwPDB VTFs (Validation Task Forces) to help define the 
data items that need to be captured, how the data should be processed 
by wwPDB, and what kind(s) of validation is/are required.


--Gerard

---
Gerard J. Kleywegt, PDBe, EMBL-EBI, Hinxton, UK
ger...@ebi.ac.uk . pdbe.org
Secretary: Pauline Haslam pdbe_ad...@ebi.ac.uk




On Thu, 5 Sep 2013, Raji Edayathumangalam wrote:


Hi Folks,

Sorry for the non-ccp4 post.

I am trying to determine what is the best form of unmerged reflection 
data

to deposit to the PDB. I have single wavelength anomalous data for my
structure and I have two flavors of scaled files from the same exact 
set of
diffraction images: (1) data indexed and scaled in p1, and (2) data 
indexed
in p222, scaled in Scalepack using the no merge original index 
option and
converted to .mtz since the unit cell in the header of the output 
.sca file

was missing.

The space group for the dataset is p212121.

Please could you let me know what might be the best approach.

Many thanks and cheers,
Raji

--
Raji Edayathumangalam
Instructor in Neurology, Harvard Medical School
Research Associate, Brigham and Women's Hospital
Visiting Research Scholar, Brandeis University




Best wishes,

--Gerard

**
   Gerard J. Kleywegt

http://xray.bmc.uu.se/gerard mailto:ger...@xray.bmc.uu.se
**
   The opinions in this message are fictional.  Any similarity
   to actual opinions, living or dead, is purely coincidental.
**
   Little known gastromathematical curiosity: let z be the
   radius and a the thickness of a pizza. Then the volume
of that pizza is equal to pi*z*z*a !
**

Re: [ccp4bb] What kind of reflection data to deposit to PDB

2013-09-09 Thread MARTYN SYMMONS

Hi there,
To my knowledge the PDB does not include unmerged data in entries - you are 
required to deposit the data that the coordinates were refined against. 

I believe you can deposit other data which will be archived and can be 
requested by other users - problem is that none of the distribution web sites 
fesses up to the existence of these extra datasets as part of the entries - so 
users of your structure will be ignorant of them.

You could ask for an authors' remark to be included in the appropriate part of 
your coordinate header to alert users to the archived material. 

Currently you can upload data as mtz files and this format would of course be 
most that would be the most convenient. Of course the mmcif format could 
include the deposition of unmerged data.

Currently some distributed mmcif SF files in fact have multiple datasets - so 
for example relating to other crystals or other wavelengths. So you could 
simply provide your unmerged data as an mmcif block to include at the end of 
the dataset used for refinement. I'm guessing eds and pdbredo just pick up the 
first block as representative of the entry. But you can include other stuff 
after it. Again it would be useful to ask for explanatory remarks to be 
included for users of the data.

For example, the authors of the excellent structure 3ne5 submitted a 'phasing 
dataset' with their SFs. This appears (after 'Format standardization' 
2013-08-28) at the end of the native refinement dataset and I imagine refers to 
a SeMet crystal. (Although the values are so small I think they are perhaps the 
estimated intensities of just the anomalous-substructure? If you are 
interested, below is a bit from that file starting with the end of the native 
data - I imagine the negative sigmas indicate absent reflections).

You can check the mmcif dictionary at 
http://mmcif.rcsb.org/dictionaries/mmcif_pdbx.dic/Categories/refln.html

to see whether you can find an appropriate category for your data. I hope the 
PDB will be happy to spend the time required to negotiate with you an 
appropriate representation (although I was told that time constraints often 
mitigate against this).

all the best
   Martyn 


snip
1 1 1   47    1   25 o    306.1  133.1 1034.6586 1520.8234 
1 1 1   47    1   28 o    348.2  137.8 1795.0702 1421.0972 
#END
data_r3ne5Asf
# 
_cell.entry_id      3ne5 
_cell.length_a      160.076 
_cell.length_b      160.076 
_cell.length_c      680.069 
_cell.angle_alpha   90.000 
_cell.angle_beta    90.000 
_cell.angle_gamma   120.000 
# 
_diffrn.id                  1 
_diffrn.crystal_id          1 
_diffrn.ambient_temp        ? 
_diffrn.crystal_treatment   ? 
_diffrn.details             '  SeMet phasing set' 
# 
_diffrn_radiation_wavelength.id           1 
_diffrn_radiation_wavelength.wavelength   0.97920 
# 
_entry.id   3ne5 
# 
_exptl_crystal.id   1 
# 
_reflns_scale.group_code   1 
# 
_symmetry.entry_id               3ne5 
_symmetry.space_group_name_H-M   'H 3 2' 
# 
#
loop_
_refln.crystal_id
_refln.wavelength_id
_refln.scale_group_code
_refln.index_h
_refln.index_k
_refln.index_l
_refln.status
_refln.pdbx_I_plus
_refln.pdbx_I_plus_sigma
_refln.pdbx_I_minus
_refln.pdbx_I_minus_sigma
1 1 1    1    0  -29  o       13.6      1.1        0.0     -1.0 
1 1 1    1    0  -26  o        4.3      0.4        0.0     -1.0 
1 1 1    1    0  -23  o       13.1      0.8        0.0     -1.0 
1 1 1    1    0  -20  o      141.7      5.0        0.0     -1.0 
1 1 1    1    0  -17  o      305.0     10.6        0.0     -1.0 
1 1 1    2   -1   15  o      189.0      6.2      178.5      6.8
snip



 From: Raji Edayathumangalam r...@brandeis.edu
To: CCP4BB@JISCMAIL.AC.UK 
Sent: Friday, 6 September 2013, 3:42
Subject: [ccp4bb] What kind of reflection data to deposit to PDB
 


Hi Folks,

Sorry for the non-ccp4 post.

I am trying to determine what is the best form of unmerged reflection data to 
deposit to the PDB. I have single wavelength anomalous data for my structure 
and I have two flavors of scaled files from the same exact set of diffraction 
images: (1) data indexed and scaled in p1, and (2) data indexed in p222, scaled 
in Scalepack using the no merge original index option and converted to .mtz 
since the unit cell in the header of the output .sca file was missing. 

The space group for the dataset is p212121. 

Please could you let me know what might be the best approach.

Many thanks and cheers,
Raji

-- 
Raji Edayathumangalam
Instructor in Neurology, Harvard Medical School
Research Associate, Brigham and Women's Hospital
Visiting Research Scholar, Brandeis University

Re: [ccp4bb] Fractional coordinate shift with two-character chain names?

2013-09-02 Thread MARTYN SYMMONS

Hi Martyn 
   can you give me a reference where the rationale for this preference for 
mmcif is set out? I tried a pubmed search but cannot find anything. 
  thanks
   regards
   Martyn 



 From: Martyn Winn martyn.w...@stfc.ac.uk
To: CCP4BB@JISCMAIL.AC.UK 
Sent: Monday, 2 September 2013, 10:24
Subject: Re: [ccp4bb] Fractional coordinate shift with two-character chain 
names?
 

Hi,

There have been many discussions along these lines over the years.  The problem 
is understood well. The favoured solution is the use of mmCIF (as posted 
several times on the BB). But I would rather leave it to those officially 
involved to comment on the pros and cons. On the practical side, pdbcur will 
work equally well with mmCIF files.

Cheers
Martyn

*
*   Dr. Martyn Winn
*   STFC Daresbury Laboratory, Daresbury, Warrington, WA4 4AD, U.K.
*   Tel: +44 1925 603455 (DL)   or   +44 1235 567865 (RcaH)
*   E-mail: martyn.w...@stfc.ac.ukmailto:martyn.w...@stfc.ac.uk       Skype: 
martyn.winn
*


From: CCP4 bulletin board [mailto:CCP4BB@JISCMAIL.AC.UK] On Behalf Of Dirk 
Kostrewa
Sent: 02 September 2013 09:38
To: ccp4bb
Subject: Re: [ccp4bb] Fractional coordinate shift with two-character chain 
names?

Dear Martyn, Pavel and other interested,

I think, an official extension by the PDB to two characters for the chain names 
and 5 digits for residues would really help. I'm currently working on a 
structure with 6x15 chains (through NCS) - it is huge and only a few programs 
can handle this by extending the PDB format.
The last PDB format revision 
3.3http://www.wwpdb.org/documentation/format33/sect9.html#ATOM still only 
allows one character for the chain name and four digits for the residue number.
More bigger structures will be published in the future and an official 
human-readable extended PDB format would really help.

Cheers,

Dirk.

Am 30.08.13 18:14, schrieb MARTYN SYMMONS:
Hold your horsemen!
Does not this option save us from 'formatagedon'?
We currently only have single letters or numbers for chains. But we could 
easily agree to switch to double letters. And long chains can be a sequence of 
letter number permutations such as A1, A2, A3 etc (actually I notice single 
numbers are allowed for the PDB - although are deprecated until all the letters 
have been used).
We could allow the first character to be a number as well - so 11 12 13 as a 
valid sequence.for a single polymer.
Conversely we could expand the atom identifier to include letters as is the 
case with most computing identifiers - however not many programs seem to pay 
attention to the atom 'numbers' in any case.

Cheers
  Martyn

Martyn Symmons (not Winn)
Cambridge


From: Dirk Kostrewa 
kostr...@genzentrum.lmu.demailto:kostr...@genzentrum.lmu.de
To: CCP4BB@JISCMAIL.AC.UKmailto:CCP4BB@JISCMAIL.AC.UK
Sent: Friday, 30 August 2013, 15:36
Subject: Re: [ccp4bb] Fractional coordinate shift with two-character chain 
names?

Hi Martyn,

excellent - this worked!

Many thanks!

Cheers,

Dirk.

Am 30.08.13 16:04, schrieb Martyn Winn:
 IIRC the CCP4 library (i.e. mmdb) can handle 2-character chain names. There 
 may be something specific in pdbset which interferes. You can try pdbcur as 
 an alternative. Something like:

 pdbcur xyzin toxd_AA.pdb xyzout toxd_out.pdb eof
 translate * frac 0 0.2 0
 end
 eof

 I just tried it on a little example, and it works for me.

 Cheers
 Martyn

 -Original Message-
 From: CCP4 bulletin board 
 [mailto:CCP4BB@JISCMAIL.AC.UKmailto:CCP4BB@JISCMAIL.AC.UK] On Behalf Of
 Dirk Kostrewa
 Sent: 30 August 2013 14:41
 To: ccp4bb
 Subject: [ccp4bb] Fractional coordinate shift with two-character chain
 names?

 Dear CCP4ers,

 I want to apply a fractional coordinate shift along a polar b-axis with
 coordinates that have non-standard two-character chain names, such as
 AA, AB, and so forth. Unfortunately, neither the old USF moleman2
 nor the actual CCP4 pdbset can handle these chain names. To my
 knowledge, only COOT and PHENIX can cope with them.
 Before I start writing my own little jiffy, is there a quick way to use
 COOT or PHENIX to apply a fractional coordinate shift, or could you
 tell me, which other program I can use in this special case?

 Best regards,

 Dirk.

 --

 ***
 Dirk Kostrewa
 Gene Center Munich
 Department of Biochemistry
 Ludwig-Maximilians-Universität München
 Feodor-Lynen-Str. 25
 D-81377 Munich
 Germany
 Phone:     +49-89-2180-76845
 Fax:     +49-89-2180-76999
 E-mail:    kostr...@genzentrum.lmu.demailto:kostr...@genzentrum.lmu.de
 WWW:    www.genzentrum.lmu.dehttp://www.genzentrum.lmu.de
 ***

--

***
Dirk Kostrewa
Gene Center Munich
Department of Biochemistry
Ludwig-Maximilians-Universität München
Feodor-Lynen-Str. 25
D-81377 Munich
Germany
Phone:     +49-89-2180-76845
Fax:     +49-89-2180

Re: [ccp4bb] Fractional coordinate shift with two-character chain names?

2013-08-30 Thread MARTYN SYMMONS

Hold your horsemen!
Does not this option save us from 'formatagedon'?
We currently only have single letters or numbers for chains. But we could 
easily agree to switch to double letters. And long chains can be a sequence of 
letter number permutations such as A1, A2, A3 etc (actually I notice single 
numbers are allowed for the PDB - although are deprecated until all the letters 
have been used). 
We could allow the first character to be a number as well - so 11 12 13 as a 
valid sequence.for a single polymer.
Conversely we could expand the atom identifier to include letters as is the 
case with most computing identifiers - however not many programs seem to pay 
attention to the atom 'numbers' in any case. 

Cheers

  Martyn 

Martyn Symmons (not Winn)
Cambridge



 From: Dirk Kostrewa kostr...@genzentrum.lmu.de
To: CCP4BB@JISCMAIL.AC.UK 
Sent: Friday, 30 August 2013, 15:36
Subject: Re: [ccp4bb] Fractional coordinate shift with two-character chain 
names?
 

Hi Martyn,

excellent - this worked!

Many thanks!

Cheers,

Dirk.

Am 30.08.13 16:04, schrieb Martyn Winn:
 IIRC the CCP4 library (i.e. mmdb) can handle 2-character chain names. There 
 may be something specific in pdbset which interferes. You can try pdbcur as 
 an alternative. Something like:

 pdbcur xyzin toxd_AA.pdb xyzout toxd_out.pdb eof
 translate * frac 0 0.2 0
 end
 eof

 I just tried it on a little example, and it works for me.

 Cheers
 Martyn

 -Original Message-
 From: CCP4 bulletin board [mailto:CCP4BB@JISCMAIL.AC.UK] On Behalf Of
 Dirk Kostrewa
 Sent: 30 August 2013 14:41
 To: ccp4bb
 Subject: [ccp4bb] Fractional coordinate shift with two-character chain
 names?

 Dear CCP4ers,

 I want to apply a fractional coordinate shift along a polar b-axis with
 coordinates that have non-standard two-character chain names, such as
 AA, AB, and so forth. Unfortunately, neither the old USF moleman2
 nor the actual CCP4 pdbset can handle these chain names. To my
 knowledge, only COOT and PHENIX can cope with them.
 Before I start writing my own little jiffy, is there a quick way to use
 COOT or PHENIX to apply a fractional coordinate shift, or could you
 tell me, which other program I can use in this special case?

 Best regards,

 Dirk.

 --

 ***
 Dirk Kostrewa
 Gene Center Munich
 Department of Biochemistry
 Ludwig-Maximilians-Universität München
 Feodor-Lynen-Str. 25
 D-81377 Munich
 Germany
 Phone:     +49-89-2180-76845
 Fax:     +49-89-2180-76999
 E-mail:    kostr...@genzentrum.lmu.de
 WWW:    www.genzentrum.lmu.de
 ***

-- 

***
Dirk Kostrewa
Gene Center Munich
Department of Biochemistry
Ludwig-Maximilians-Universität München
Feodor-Lynen-Str. 25
D-81377 Munich
Germany
Phone:     +49-89-2180-76845
Fax:     +49-89-2180-76999
E-mail:    kostr...@genzentrum.lmu.de
WWW:    www.genzentrum.lmu.de
***

Re: [ccp4bb] modified amino acids in the PDB

2013-07-09 Thread MARTYN SYMMONS

Hi Clemens
   I guess the reason you say 'arbitrary' is because there is no explanation of 
this rule decision? 
  It would be nice if some rationalization was available alongside the values 
given. So a sentence along the lines of 'we set the number owing to the 
following considerations' ? 

  However a further layer of variation is that the rule does not seem to be 
consistently applied
 - just browsing CYS modifications:
   iodoacetamide treatment gives a CYS with only 4 additional atoms but it is 
split off as  ACM.
   However some ligands much larger than 10 residues have been kept with the 
cysteine ( for example CY7 in 2jiv and NPH in 1a18.  
   My betting is that it depends on whether something has been seen 'going 
solo' as a non-covalent ligand previously so that it pops up as an atomic 
structural match with a pre-defined three-letter code.
  This would explain for example the ACM case which you might expect to occur 
in a modified Cys.  But it has also been observed as a non-polymer ligand in 
its own right so goes on as a separate modification?
   However to be honest I am not sure I have ever seen the rationale for this 
written down. 

  'Non-polymer' heterogens can turn up either linked or not. Once they are in 
the residues they have to make a call on which kind of backbone they will 
feature in within the pdb.
  That is why there is  'D5M' for non-polymer deoxyAMP. Also known as ' DA' 
when it is 'DNA-linking' but so far not fessing up to life under a third code 
as 'RNA-linking' 

Now is perhaps the time to ask for explanations of these nomenclature features 
before they become hard-wired in the new pdb deposition system (however there 
may be time - I refer you to my previous posting ;). 
  
 Cheers
    Martyn 
  
Dr Martyn Symmons
Cambridge



 From: Michael Weyand michael.wey...@uni-bayreuth.de
To: CCP4BB@JISCMAIL.AC.UK 
Sent: Monday, 8 July 2013, 10:03
Subject: [ccp4bb] modified amino acids in the PDB
 

Dear colleagues,

We deposited protein structures with modified lysine side chains and
were surprised that the PDB treats the modification as an independent
molecule, with a “LINK” record indicating the covalent bond – instead of
defining a modified residue (that’s what we had uploaded to the PDB).
Apparently, anything attached to an amino acid is considered an
independent molecule (and the lysine just called a regular lysine) if it
comprises more than 10 atoms (see below for the PDB guidelines).

I think that’s kind of arbitrary and would give all modified residue
also modified names – i.e. individual names for all modified lysines, as
it is done for acetyl- or methyl-lysines, for example. I wonder what
other people’s opinion is?!

Best regards

Clemens



This is in accordance to the wwPDB annotation guidelines
(http://www.wwpdb.org/procedure.html#toc_2).
*Modified amino acids and nucleotides* If an amino acid or nucleotide
is modified by a chemical group greater than 10 atoms, the residue will
be split into two groups: the amino acid/nucleotide group and the
modification. A link record will be generated between the amino
acid/nucleotide group and the modification. For modified amino acids and
nucleotides that were not split will follow standard atom nomenclature.

Re: [ccp4bb] modified amino acids in the PDB

2013-07-09 Thread MARTYN SYMMONS

Hiya Mark
   once again I find myself asking why not give the authors deposited file 
alongside any other 'cleaned-up' PDB files. 
 The authors' one could be foo.pdb_0 then after PDB heterogen annotation you 
get foo.pdb_1 - if the heterogens are subsequently handled differently then you 
could have foo.pdb_2 for the 'remediated' file. 
  This latter has certainly happened as there appear to be 'orphan' heterogen 
definitions in the pdb that are not currently used by any entries - most likely 
these were 'split' when atoms were taken out and associated with polymer 
entities, or in some cases, I notice, 'lumped' when they have been included 
with residues alongside to give a new larger heterogen.
  PDB file versioning would also give transparency in cases such as occupancy 
remediation when the pdb altered all occupancies that summed to 1.0 to enforce 
a  total of 1.0 (giving holes in the authors' density presumably) which may 
mystify the sharp-eyed user.    
 Currently there are the REV_DAT lines in the header but these give only the 
titles of changed cards not any detailed explanation. Likewise REVDAT only 
starts at foo.pdb_1 in my example - still missing the authors original intent.  
  all the best
   Martyn 


 From: Mark J van Raaij mjvanra...@cnb.csic.es
To: CCP4BB@JISCMAIL.AC.UK 
Sent: Tuesday, 9 July 2013, 15:23
Subject: Re: [ccp4bb] modified amino acids in the PDB
 

- really the only complicated case would be where a group is covalently linked 
to more than one amino acid, wouldn't it? Any case where only one covalent link 
with an is present could (should?) be treated as a special amino acid, i.e. 
like selenomethionine.
- groups without any covalent links to the protein are better kept separate I 
would think (but I guess this is stating the obvious).

Mark J van Raaij
Lab 20B
Dpto de Estructura de Macromoleculas
Centro Nacional de Biotecnologia - CSIC
c/Darwin 3
E-28049 Madrid, Spain
tel. (+34) 91 585 4616
http://www.cnb.csic.es/~mjvanraaij





On 9 Jul 2013, at 12:49, Frances C. Bernstein wrote:

 In trying to formulate a suggested policy on het groups
 versus modified side chains one needs to think about the
 various cases that have arisen.
 
 Perhaps the earliest one I can think of is a heme group.
 One could view it as a very large decoration on a side
 chain but, as everyone knows, one heme group makes four
 links to residues.  In the early days of the PDB we decided
 that heme obviously had to be represented as a separate group.
 
 I would also point out that nobody would seriously suggest that
 selenomethionine should be represented as a methionine with a
 missing sulfur and a selenium het group bound to it.
 
 Unfortunately all the cases that fall between selenomethionine
 and heme are more difficult.  Perhaps the best that one must
 hope for is that whichever representation is chosen for a
 particular case, it be consistent across all entries.
 
                          Frances
 
 P.S. One can also have similar discussions about the representation
 of microheterogeneity and of sugar chains but we should leave those
 for another day.
 
 =
                 Bernstein + Sons
 *   *       Information Systems Consultants
     5 Brewster Lane, Bellport, NY 11713-2803
 *   * ***
  *            Frances C. Bernstein
  *   ***      f...@bernstein-plus-sons.com
 ***     *
  *   *** 1-631-286-1339    FAX: 1-631-286-1999
 =
 
 On Tue, 9 Jul 2013, MARTYN SYMMONS wrote:
 
 Hi Clemens
    I guess the reason you say 'arbitrary' is because there is no explanation 
of this
 rule decision? 
   It would be nice if some rationalization was available alongside the 
values given.
 So a sentence along the lines of 'we set the number owing to the following
 considerations' ? 
   However a further layer of variation is that the rule does not seem to be
 consistently applied
  - just browsing CYS modifications:
    iodoacetamide treatment gives a CYS with only 4 additional atoms but it 
is split
 off as  ACM.
    However some ligands much larger than 10 residues have been kept with the 
cysteine
 ( for example CY7 in 2jiv and NPH in 1a18.  
    My betting is that it depends on whether something has been seen 'going 
solo' as a
 non-covalent ligand previously so that it pops up as an atomic structural 
 match with
 a pre-defined three-letter code.
   This would explain for example the ACM case which you might expect to 
occur in a
 modified Cys.  But it has also been observed as a non-polymer ligand in its 
 own right
 so goes on as a separate modification?
    However to be honest I am not sure I have ever seen the rationale for 
this written
 down. 
   'Non-polymer' heterogens can turn up either linked or not. Once they are 
in the
 residues they have to make a call on which kind of backbone they will 
 feature in
 within the pdb.
   That is why there is  'D5M

Re: [ccp4bb] PDB secondary structure assignments

2013-07-04 Thread MARTYN SYMMONS

Authors can supply the secondary structure assignments to the PDB and these 
trump the PDBs own determinations and will appear in the authors' entry.

Remarks 650 (Helix) and 700 (Sheet) are then included which allow the 
documentation of the origins of the secondary structure definitions and any 
useful details from the authors on the determination methods used (presumably 
down to the trivial ' this looked prettier to me than the DSSP ones').

For the record you can check the pdb annotation manual 
(http://deposit.rcsb.org/depoinfo/download/dp.pdf) that seems to indicate the 
secondary structure is cooked up by the in-house 'maxit' program.

However there are some delightful sketch drawings (pages 92 to 93) showing how 
the annotators are supposed to work stuff out with a pencil-and-paper.

This may explain cases where the annotators have included material in the 
secondary structure lines which were presumably helpful comments that should've 
been trimmed out 

For example in  entry 3pcy:

HELIX    1   A ALA A   52  SER A   56  1ONLY TURN OF HELIX IN MOLCULE      5    
SHEET    1   I 4 GLY A  10  VAL A  15  0                                        
SHEET    2   I 4 ILE A   1  ALA A   7 -1  N  GLY A   6   O  ALA A  13           
SHEET    3   I 4 GLU A  25  ASN A  32  1  O  LYS A  26   N  ILE A   1           
SHEET    4   I 4 GLY A  67  LEU A  74 -1  O  GLU A  68   N  ASN A  31           

which is, I suppose useful, if you were looking for the 'only turn of helix in 
molcule'?

Of course we can expect these quirks to be a thing of the past when the long 
promised wwPDB 'Common Deposition and Annotation Tool' is rolled out. 

This Common Tool is apparently now 'nearing readiness for testing by external 
users' (http://www.wwpdb.org/wwpdbac-pages/wwPDBAC2012_report.pdf).

Hmm...
Previously advertised as:
'completion is on track for 2012.' 
(http://www.wwpdb.org/wwpdbac-pages/wwPDBAC_2011_Report.pdf)

or that 'substantial completion in 2011 is likely.' 
http://www.wwpdb.org/wwpdbac-pages/wwpdbac-2010-report.pdf


or perhaps more pertinently:

'In early 2010, the annotation module will be put into service,'

(http://www.wwpdb.org/wwpdbac-pages/wwpdbac-2010-report.pdf)


Perhaps as in many things it is often better to travel hopefully than to 
arrive... 

Cheers
  Martyn

Martyn Symmons
Cambridge UK



 From: Mark J van Raaij mjvanra...@cnb.csic.es
To: CCP4BB@JISCMAIL.AC.UK 
Sent: Wednesday, 3 July 2013, 16:00
Subject: Re: [ccp4bb] PDB secondary structure assignments
 

you are probably right, but the depositor can override the automatic secondary 
structure assignment by submitting his own. Perhaps this is what happened here.
I would recommend the original poster to generate them by hand by inspection of 
the structure plus density if possible - i.e. not trust any program...
Or, in case he wants to do it for a lot of structures and inspection of them 
all is not feasible, regenerate them all with the same version of his favourite 
program.


Mark J van Raaij
Lab 20B
Dpto de Estructura de Macromoleculas
Centro Nacional de Biotecnologia - CSIC
c/Darwin 3
E-28049 Madrid, Spain
tel. (+34) 91 585 4616
http://www.cnb.csic.es/~mjvanraaij





On 3 Jul 2013, at 16:46, Robbie Joosten wrote:

 Hi Miha,
 
 I thought the PDB actually uses DSSP. Perhaps it is a different version,
 there have been some new releases recently. Anyway, there is no reason why
 you should stick to the assignment of the PDB. If another program gives
 slightly different results you can use those as long as you make sure it is
 obvious which program you used (cite the program).  
 The next CCP4 release will have the official dssp (which is used to make the
 DSSP databank).
 
 Cheers,
 Robbie
 
 
 
 -Original Message-
 From: CCP4 bulletin board [mailto:CCP4BB@JISCMAIL.AC.UK] On Behalf Of
 Pavšic, Miha
 Sent: Wednesday, July 03, 2013 16:01
 To: CCP4BB@JISCMAIL.AC.UK
 Subject: [ccp4bb] PDB secondary structure assignments
 
 Dear CCP4BB members,
 
 what is the usual practice regarding secondary structure assignments when
 preparing publication figures of protein structures and topology diagrams
 from deposited PDB files? The deposited PDB files already contain such
 assignments in the header section (using PROMOTIF?). Should these
 assignments be obeyed or is it common to used other software/algorithms
 (e.g., DSSP and Stride). In my case assignments using DSSP result in
 slightly
 differ from PDB assignments in the regions of short alpha-helical
 structure
 (corresponding to stretch of 4 aa residues).
 
 Thank you for your suggestions.
 
 Regards,
 Miha
 
 
 **
 Miha Pavsic, Ph.D.
 University of Ljubljana
 Faculty of Chemistry and Chemical Technology Chair of Biochemistry Cesta v
 Mestni log 88a
 SI-1000 Ljubljana
 Slovenia
 
 e-mail miha.pav...@fkkt.uni-lj.si
 skype mihapavsic
 phone (lab) +386 1 2419 488
 fax +386 1 2419 487
 **

Re: [ccp4bb] showing electron density on an ipad

2013-05-01 Thread MARTYN SYMMONS

Dear Dave
    I have found the Molsoft ICM browser useful for showing students 
structures. There is a freely distributed version available for both ipad and 
android. I have not used it for maps but I believe it can display them. 
  SGC Oxford originally showed me this software and have great examples 
displaying their structures on their web site and in their PLoS articles. Sorry 
but  I have not used the ipad version - I think you have to pay a nominal fee 
for that.
 All the best
  Martyn    



 From: David Roberts drobe...@depauw.edu
To: CCP4BB@JISCMAIL.AC.UK 
Sent: Wednesday, 1 May 2013, 20:56
Subject: [ccp4bb] showing electron density on an ipad
 

Hello all,

So, I find an ipad is a wonderful device for teaching (any tablet really - but 
I'm partial to the ipad).  I can project it in a classroom, run pymol and a few 
other chemistry/biochemistry things, and really get the students interested in 
these subjects easily.  I actually don't have a laptop - and our classrooms are 
such that there are computers connected to the projectors but they have 
standard University software packages installed on them.

It would be very helpful if I could just display electron density using an 
ipad.  The pymol app will load a map (it is an option) - but when I take a map 
from my linux machines it doesn't work.  Any thoughts here?  Has anybody done 
this

Thanks

Dave

Re: [ccp4bb] Puzzling Structure

2013-04-30 Thread MARTYN SYMMONS

And as a footnote*
-
* I notice that, following discussion on this bulletin board, the space group 
of the 2wlv coordinate file has been updated.

 But at this point did it not occur to the PDB to change the space group in the 
structure factor file as well? So the coordinates indicate P21 21 2 but the 
structure factor file still says P21 21 21... Hmm consistency is highly 
over-rated...

Also the author's SF data listing starts with c axis reflections including 0, 
0, l odd as well as even, which is sort of a hint here



 From: MARTYN SYMMONS martainn_oshioma...@btinternet.com
To: CCP4BB@JISCMAIL.AC.UK 
Sent: Tuesday, 16 April 2013, 14:30
Subject: Re: [ccp4bb] Puzzling Structure
 


Just as a postscript.

It has been pointed out to me that the space group was correct in the uploaded 
PDB and in the mtz reflection file. And there was no editing of this data 
pre-deposition. 

I am sort of surprised how quickly people are to beat up on the authors in such 
cases since we do not have access to the full data and facts.

Having the uploaded data available would allow better checking. I'm mainly 
ignorant of the details, but isn't this the way sequence data is dealt with? - 
there are the unannotated sequence files which are gradually checked and 
assimilated (but not over-written).   

Cheers 
  Martyn 



 From: Michel Fodje michel.fo...@lightsource.ca
To: CCP4BB@JISCMAIL.AC.UK 
Sent: Monday, 15 April 2013, 17:19
Subject: Re: [ccp4bb] Puzzling Structure
 

Just to round up this topic, a bug report was submitted to PDBe concerning 
entry 2wlv and the PDB has how responded acknowledging the problem. An updated 
entry will be available in one week.

As pointed out by Savvas, it is very likely the CRYST1 record was manually 
edited prior to deposition resulting in the wrong spacegroup being parsed in by 
AutoDep and subsequent processing moving the waters around in the wrong space 
group. Since there is no logical reason for the authors to do this, it was 
probably inadvertent. I imagine somebody accidentally deleted a space between P 
21 21 2 and 18 and tried to fix
 it by adding it back, between 1 and 8.  

/Michel

-Original Message-
From: CCP4 bulletin board [mailto:CCP4BB@JISCMAIL.AC.UK] On Behalf Of
Edward A. Berry
Sent: April-14-13 9:59 AM
To: CCP4BB@JISCMAIL.AC.UK
Subject: Re: [ccp4bb] Puzzling Structure

Robbie Joosten wrote:
 Hi Martyn,

     I think the question is where the error was made - seeing the
 uploaded file would clear this up. But it seems unlikely to me that
 the depositor saw a huge R factor discrepancy at the end of refinement
and just blithely uploaded it.
 So scenario 3 :-
 PDB : we cannot reproduce your R factor with our programs Depositor
 :
 that's your problem mate - it was fine when it left me...up to you to sort
it...
     Which seems a sort of reasonable attitude to me.

 Not quite, the depositor has to give, i.e. type, the space group (example
depositions: https://www.ebi.ac.uk/pdbe- 
xdep/autodep/AutoDep?param=QovCsvhNv06Mpnr%2BvIkqqfuqeeBd8leFN
AVymZgS%2Fe7mULyfrCaTMN8jsyaGZUTyUDQyN3gMF3o%3D). Don't ask me
why, because it is clearly a source of error.


In my experience with RCSB autodep2, assuming the information was in the
uploaded pdb file, this information is already pre-filled and the depositor 
just
glances over to see it is correct. A missing or extra (1) might not be 
noticed.
So perhaps it is a parsing error, perhaps related to the different ways the
space group is represented on the CRYST1 card, and the different stringency
of different programs in interpreting the CRYST1 card.

But the validation report is included with the approval request letter, and
major discrepancies are noted in the text of the validation letter in case the
depositor is too busy to actually read the validation report.
So if the
 depositor read more than the first line or two of the letter he should
have known there was a problem.

Then there is the 2-week default release policy:

  No major issues were raised during data processing. A summary of some of
the key annotations
  in your entry is shown below. Please verify that these are correct. If we do
not hear from
  you within two weeks, we will consider this entry to have been approved by
you. The entry
  will then be released according to the release/hold instructions you have
provided.

If this 2-week default release is the rule even when there are issues, the
depositor may have put it aside to find the problem when time was available,
and waited too long and let the default release kick in.

eab

Re: [ccp4bb] Puzzling Structure

2013-04-16 Thread MARTYN SYMMONS

Just as a postscript.

It has been pointed out to me that the space group was correct in the uploaded 
PDB and in the mtz reflection file. And there was no editing of this data 
pre-deposition. 

I am sort of surprised how quickly people are to beat up on the authors in such 
cases since we do not have access to the full data and facts.

Having the uploaded data available would allow better checking. I'm mainly 
ignorant of the details, but isn't this the way sequence data is dealt with? - 
there are the unannotated sequence files which are gradually checked and 
assimilated (but not over-written).   

Cheers 
  Martyn 



 From: Michel Fodje michel.fo...@lightsource.ca
To: CCP4BB@JISCMAIL.AC.UK 
Sent: Monday, 15 April 2013, 17:19
Subject: Re: [ccp4bb] Puzzling Structure
 

Just to round up this topic, a bug report was submitted to PDBe concerning 
entry 2wlv and the PDB has how responded acknowledging the problem. An updated 
entry will be available in one week.

As pointed out by Savvas, it is very likely the CRYST1 record was manually 
edited prior to deposition resulting in the wrong spacegroup being parsed in by 
AutoDep and subsequent processing moving the waters around in the wrong space 
group. Since there is no logical reason for the authors to do this, it was 
probably inadvertent. I imagine somebody accidentally deleted a space between P 
21 21 2 and 18 and tried to fix it by adding it back, between 1 and 8.  

/Michel

-Original Message-
From: CCP4 bulletin board [mailto:CCP4BB@JISCMAIL.AC.UK] On Behalf Of
Edward A. Berry
Sent: April-14-13 9:59 AM
To: CCP4BB@JISCMAIL.AC.UK
Subject: Re: [ccp4bb] Puzzling Structure

Robbie Joosten wrote:
 Hi Martyn,

     I think the question is where the error was made - seeing the
 uploaded file would clear this up. But it seems unlikely to me that
 the depositor saw a huge R factor discrepancy at the end of refinement
and just blithely uploaded it.
 So scenario 3 :-
 PDB : we cannot reproduce your R factor with our programs Depositor :
 that's your problem mate - it was fine when it left me...up to you to sort
it...
     Which seems a sort of reasonable attitude to me.

 Not quite, the depositor has to give, i.e. type, the space group (example
depositions: https://www.ebi.ac.uk/pdbe- 
xdep/autodep/AutoDep?param=QovCsvhNv06Mpnr%2BvIkqqfuqeeBd8leFN
AVymZgS%2Fe7mULyfrCaTMN8jsyaGZUTyUDQyN3gMF3o%3D). Don't ask me
why, because it is clearly a source of error.


In my experience with RCSB autodep2, assuming the information was in the
uploaded pdb file, this information is already pre-filled and the depositor 
just
glances over to see it is correct. A missing or extra (1) might not be 
noticed.
So perhaps it is a parsing error, perhaps related to the different ways the
space group is represented on the CRYST1 card, and the different stringency
of different programs in interpreting the CRYST1 card.

But the validation report is included with the approval request letter, and
major discrepancies are noted in the text of the validation letter in case the
depositor is too busy to actually read the validation report.
So if the depositor read more than the first line or two of the letter he 
should
have known there was a problem.

Then there is the 2-week default release policy:

  No major issues were raised during data processing. A summary of some of
the key annotations
  in your entry is shown below. Please verify that these are correct. If we do
not hear from
  you within two weeks, we will consider this entry to have been approved by
you. The entry
  will then be released according to the release/hold instructions you have
provided.

If this 2-week default release is the rule even when there are issues, the
depositor may have put it aside to find the problem when time was available,
and waited too long and let the default release kick in.

eab

Re: [ccp4bb] Puzzling Structure

2013-04-13 Thread MARTYN SYMMONS

Dear Robbie

A shame then that these 'helpful' annotators did not make use of  Pavel's basic 
sanity on the space group (*mentioned below) and check back to the one listed 
in the uploaded PDB file. 

I often wonder why the PDB does not make the deposited coordinate file publicly 
available so that these sorts of issues can be checked and tracked. Depositors' 
coordinate files nowadays are the output of reputable, documented programs.  

The whole PDB data (excluding the EMDB that was recently merged into it) 
amounts to about a laptop hard drive's worth of data  - so surely space can be 
made for the deposited coordinates? (and restraint files which will be very 
useful for other workers including pdb-redo).

Structural genomic groups could also take a lead here by making their final 
refined coordinates and restraint files available alongside the PDB data. 

Having the depositors' uploaded data would help me understand other puzzling 
features of structures such as the current 4GRV.pdb which seems to have a list 
of TLS groups but contains not a single ANISOU line!...  

Cheers
  Martyn 

*In this particular case attempting to calculate R-factor using data and model 
files and making sure that the R you get is not twice as large as published one 
would entirely suffice -:)

Pavel



 From: Robbie Joosten robbie_joos...@hotmail.com
To: CCP4BB@JISCMAIL.AC.UK 
Sent: Friday, 12 April 2013, 22:57
Subject: Re: [ccp4bb] Puzzling Structure
 

Waters are moved during annotation using the perceived space group's
symmetry operation. So if the authors give the wrong space group, then the
annotation pipeline understandably messes things up. If the originally
uploaded PDB file was kept by PDBe, then the problem can be recovered quite
easily by the annotators. Perhaps the topic starter, Michel Fodje, can send
a bug report to PDBe. In my experience, the annotators are very helpful
resolving these matters.  

potential flame
Hoping that the depositors solve the problem by themselves, is probably in
vain: There are many crystallographers who do not read the CCP4BB (which is
a shame, really); they didn't notice the enormous amount of water related
bumps in their final model (which is in the validation report you get after
deposition and in REMARK 500 of the PDB file you have to approve); they also
didn't notice the huge number of symmetry-related bumps; the R-factors in
the PDB file are different from (and better than) the ones in Table 1. Also
notice that the paper was submitted on April 21st 2009 and the model was
deposited on June 29th 2009. Paper accepted on July 8th 2009. But I'm sure
the referees had a chance to properly assess the quality of the structure
model ;-)
/ potential flame

Cheers,
Robbie

P.S. It's pretty awesome that the problem was solved in less than 20 minutes
by the CCP4BB (that is, by Phoebe Rice)


 -Original Message-
 From: CCP4 bulletin board [mailto:CCP4BB@JISCMAIL.AC.UK] On Behalf Of
 Garib N Murshudov
 Sent: Friday, April 12, 2013 21:39
 To: CCP4BB@JISCMAIL.AC.UK
 Subject: Re: [ccp4bb] Puzzling Structure
 
 It is typo:
 R factor for p212121 - 0.4
                for p21212    - around 0.18
 
 Although water seem to have been moved around using p212121
 
 
 
 
 On 12 Apr 2013, at 16:33, Phoebe A. Rice wrote:
 
 
     Looks like a typo to me: if you change the CRYST space group record
 from P212121 to P21212, as the paper says it is, the packing problem goes
 away.
 
     ++
 
     Phoebe A. Rice
     Dept. of Biochemistry  Molecular Biology
     The University of Chicago
 
     773 834 1723; pr...@uchicago.edu
     http://bmb.bsd.uchicago.edu/Faculty_and_Research/
 
     http://www.rsc.org/shop/books/2008/9780854042722.asp
 
     
     From: CCP4 bulletin board [CCP4BB@JISCMAIL.AC.UK] on behalf of
 Michel Fodje [michel.fo...@lightsource.ca]
     Sent: Friday, April 12, 2013 2:17 PM
     To: CCP4BB@JISCMAIL.AC.UK
     Subject: Re: [ccp4bb] Puzzling Structure
 
     By the way, you will need to show symmetry atoms to see the
 problem.
 
 
 
         -Original Message-
 
 
         From: CCP4 bulletin board [mailto:CCP4BB@JISCMAIL.AC.UK]
 On Behalf Of
 
 
         Michel Fodje
 
 
         Sent: April-12-13 1:14 PM
 
 
         To: CCP4BB@JISCMAIL.AC.UK
 
 
         Subject: [ccp4bb] Puzzling Structure
 
 
 
         Has anyone else noticed a problem with the structure  of the
 N-terminal
 
 
         capsid domain of HIV-2  PDB 2wlv.
 
 
 
         Load it up to in coot and navigate to residue B118.
 
 
 
 
 
         /Michel.
 
 
 Dr Garib N Murshudov
 Group Leader, MRC Laboratory of Molecular Biology Francis Crick Avenue
 Cambridge Biomedical Campus Cambridge
 CB2 0QH UK
 Email: ga...@mrc-lmb.cam.ac.uk
 Web http://www.mrc-lmb.cam.ac.uk http://www.mrc-lmb.cam.ac.uk/

Re: [ccp4bb] PDBe website update: What's new

2013-03-14 Thread MARTYN SYMMONS

Ah yes a new logo...

But it might be of interest to hear in this forum what has happened to the CCP4 
logo that used to appear on the PISA and Fold pages in acknowledgement of the 
support from the BBSRC on the underlying algorithms for these services.

It is only by acknowledgement of the contribution of projects such as CCP4 that 
we can establish a continuing funding stream for the improvement of scientific 
software. I distinctly remember that this was acknowledged by the presence of 
the CCP4 logo on appropriate pages in the old PDBe website. Surely there is 
still space for that acknowledgement too?

Yours perplexedly,

Martyn 

Dr Martyn Symmons
Cambridge



 From: Gary Battle bat...@ebi.ac.uk
To: CCP4BB@JISCMAIL.AC.UK 
Sent: Thursday, 14 March 2013, 10:27
Subject: [ccp4bb] PDBe website update: What's new
 

Dear all,

Twice a year, the Protein Data Bank in Europe (PDBe; http://pdbe.org) releases 
new and improved tools and services. Our first website update of 2013 features:

1. A new weekly overview of new biology in the PDB
2. Improved searching of EMDB entries with rapid filtering of
results
3. New features for visual analysis of EMDB entries
4. Improved pages with chemical-shift-validation information for NMR
entries
5. Easy sharing of PDBe pages



1. A new weekly overview of new biology in the PDB.

You can now view instances of new biology in the PDB including Pfam
families, GO terms and UniProt entries that appear in the archive
for the first time. For example, this week’s release includes the
first structure ever from a member of the Alternative Oxidase Pfam
sequence family
(http://www.ebi.ac.uk/pdbe/searchResults.html?display=latesttab=xref). 
Note that two kinds of new biology are shown: cases where an existing Pfam 
family etc. maps to a newly released structure in the PDB for the first time, 
and cases where an existing PDB entry maps to a new entry in Pfam, GO, UniProt, 
CATH or SCOP.

2. Improved searching of EMDB entries with rapid filtering of
results.

We have made searching the EMDB easier by implementing search
filters that enable you to rapidly narrow down the set of search
results. Filters based on experimental details, journal, organism
etc. can be applied when searching (http://pdbe.org/emsearch) or browsing 
the contents of the archive (http://pdbe.org/embrowse).

3. New features for visual analysis of EMDB entries.

We have introduced several new features to aid visual analysis and
validation of EMBD entries, including:

* A volume-estimate graph that shows how the enclosed volume of a map 
varies with the contour level (e.g. http://pdbe.org/emd-5357/analysis).
* A Fourier-shell correlation (FSC) curve used to estimate the 
resolution of single particle maps (e.g. http://pdbe.org/emd-5357/analysis).
* A residue-based atom-inclusion chart that can be used to assess the 
quality of fit of PDB models to an EMDB map (e.g.
* http://pdbe.org/emd-2017/analysis).
* An image showing the overlay of any deposited masks on a map, that 
show segmentations or some particular feature of an entry (e.g. 
http://pdbe.org/emd-1206/analysis).
(Note that not all features are available or applicable for all entries.)

4. Improved pages with chemical-shift-validation information for NMR
entries.

We have improved the presentation of information on the validation
of chemical shifts with VASCO. The redesigned webpages now show
statistics on the assigned chemical shifts, any referencing
corrections and a list of atoms with unusual chemical shifts values
(e.g. http://www.ebi.ac.uk/pdbe-apps/nmr/vasco/searchEntry?pdbCode=2knr).

5. Easy sharing of PDBe pages.

Regular visitors of the PDBe website may have noticed some small
changes to the headers of our pages. The new headers include
feedback and sharing buttons on every PDBe webpage. If you want to
share what you find on one of our pages with friends or colleagues,
use the share button to post the URL on Facebook, Twitter, etc. or
to email it. If you have suggestions for improvements, please let us
know by using the feedback button.

You may also have noticed our new logo. Based on a motif that is
found at all levels of structure, we hope that our new logo will
quickly become synonymous with the provision of high‐quality
information about 3D molecular and cellular structure. Our new logo
and guidelines for its use are available from our website at 
http://pdbe.org/logo

As always, we welcome your comments and suggestions through the
feedback button at the top of every PDBe webpage.

Gary.


-- 
Gary Battle
Protein Data Bank in Europe (PDBe) http://www.facebook.com/proteindatabank 
http://twitter.com/PDBeurope

Re: [ccp4bb] strict structure based alignment

2012-07-16 Thread MARTYN SYMMONS

Hi -  SSM algorithm at PDBeFold will do this sort of thing


http://www.ebi.ac.uk/msd-srv/ssm/

I wrote a tutorial for multiple alignment to go with a Phaser story

http://www.ebi.ac.uk/pdbe-apps/quips?story=Phaserauxpage=MultiplePDBeFoldminitutorial

The last page of this tells you where to get an multiple fastA alignment of the 
superimposed structures and explains the format. 


Hope that helps.
  Martyn 


Martyn Symmons

EBI Cambridge

    





 From: Eric Pettersen p...@cgl.ucsf.edu
To: CCP4BB@JISCMAIL.AC.UK 
Sent: Monday, 16 July 2012, 2:17
Subject: Re: [ccp4bb] strict structure based alignment
 

On Jul 13, 2012, at 4:00 PM, Christian Roth wrote:

I want align a couple or protein structures by secondary structure matching to 
one target and want get a kind of aminoacid alignment file e.g. what residue 
fit 
the other, without adjustments due to sequence based alignments. 
I tried Strap, but as far as I understood it, it takes also the sequence into 
account. I tried also Rapido, but this does only a pairwise comparison. 
Superpose does align it nicely (ccp4 based or Coot based) but there seems to 
be no option to print the sequence alignment in a file and it is again  just a 
pairwise comparison .
Is there an other program which does something similar?

If you use UCSF Chimera (www.cgl.ucsf.edu/chimera), you can use the MatchMaker 
tool to superimpose the structures.  MatchMaker allows you to adjust the weight 
of sequence similarity vs. secondary structure matching, so you can just make 
the sequence similarity 0% and the secondary structure 100%.  With the 
structures superimposed, you can use the Match-Align tool to generate a 
sequence alignment based solely on the proximity of residues to one another in 
space.  Be warned that Match-Align will be very slow for 10+ structures, but 
is fine for half a dozen or so.  The generated alignment will be displayed in a 
window that will have a File menu where you can save the alignment to a 
variety of common formats.

--Eric

Eric Pettersen
UCSF Computer Graphics Lab

[ccp4bb] studentship posting

2011-12-16 Thread MARTYN SYMMONS

Posted on behalf of Dr Callaghan - please contact her directly.


4 year Ph.D. Studentship on RNase structure and metabolic control.
School of Biological Sciences.
University of Portsmouth.

Supervisor: Dr Anastasia Callaghan 

Co-supervisor: Dr Sarah Newbury (Sussex Medical School)

Understanding how metabolism is controlled within a cell is fundamentally 
important and is directly applicable to medical, environmental and 
biotechnological advances. 

Our studies have recently identified that a molecule of central metabolism 
interacts with an RNase and affects its ability to destroy mRNA. This project 
will unravel the details of this newly discovered mechanism and investigate 
whether it represents a conserved metabolite-RNase communicative link in 
prokaryotes and eukaryotes. 

Training will be provided in a range of molecular biology, biochemical, 
biophysical, and structural techniques. 

Please see: www.sebnet.org.uk  www.findaphd.com

Informal enquires welcome: anastasia.callag...@port.ac.uk

Application deadline: 31st January 2012

Re: [ccp4bb] [off topic] Control of crystals' direction and position in the drop.

2011-11-15 Thread MARTYN SYMMONS

Hi there 

 one thing I did to get more space for needles in a drop was to make 
'book end' coverslips. Your take a plastic covership and with a razor blade you 
cut obliquely into the plastic to make a 'flap' of the plastic. Then you lever 
it up to vertical to give a projecting vertical surface. You can then apply the 
drop to the angle between the flap and the coverslip. This supports the drop so 
that it does not flatten so much during equilibration. Making the cut takes a 
bit of trial and error but the plastic coverslips are not that expensive. It 
certainly gave me some chunkier needles and gave them space to grow upwards. 
Otherwise they flattened down as the droplet flattened with equilibration and 
often they 'glued' themselves to the horizontal surface.


    Hope it helps
    Martyn 


Martyn Symmons
Cambridge




From: Frederic VELLIEUX frederic.velli...@orange.fr
To: CCP4BB@JISCMAIL.AC.UK
Sent: Monday, 14 November 2011, 22:24
Subject: Re: [ccp4bb] [off topic] Control of crystals' direction and position 
in the drop.

Hi,

Your email mentions drop.

What about trying another technique where you do not have drops, such as the 
liquid interface diffusion method (in capillaries), or the use of dialysis ? 
Crystallisation 
under oil (injection of the 2 components under oil) could also be tried.

Fred

 Message du 14/11/11 22:15
 De : Nian Huang 
 A : CCP4BB@JISCMAIL.AC.UK
 Copie à : 
 Objet : [ccp4bb] [off topic] Control of crystals' direction and position in 
 the drop.
 
 Dear All,
 Does anybody find a way to control a crystal's positioning in the drop? I
 have needle shaped crystals. What I found out is that the vertical
 positioned crystals always grow much thicker than the crystals laying
 flatly. But unfortunately, it is a completely random event and only 1%
 crystals can appear vertically. I have tried different formats of plates
 and equilibration techniques w/o much success. Any suggestion is highly
 appreciated.
 
 Nian Huang, Ph.D.
 UT Southwestern Medical Center

Re: [ccp4bb] [off topic] Control of crystals' direction and position in the drop.

2011-11-15 Thread MARTYN SYMMONS

Attached is a picture of making the bookend setup - step b. shows the razor 
levering up the flap from the plastic coverslip  - be careful not to cut right 
through (I have done that and the drop dries extra quick in such a case) also 
be careful to apply the protein/well droplet as shown with to the shiny side of 
the lifted flap. (Apologies for the attachment). 

hope your crystals go well.

Martyn 


From: Nian Huang huangn...@gmail.com
To: CCP4BB@JISCMAIL.AC.UK
Sent: Tuesday, 15 November 2011, 17:44
Subject: Re: [ccp4bb] [off topic] Control of crystals' direction and position 
in the drop.


Thank you guys. I basically tried almost everything that I can find in the 
hampton catalogue and in this bulletin, seeding, hanging, sitting, sandwitch 
drops (which made things worse), temperature, gel, oil batch, and additive 
screens. Manipulating the crystal with fiber increases the chance to make it 
grow twin. The book end coverslip sounds a fantasic idea. It might be just 
going to work.

Best,

Nianattachment: bookend.jpg

Re: [ccp4bb] Protein sequencing service?

2010-11-25 Thread MARTYN SYMMONS

I have had excellent service from Mike Weldon (m.a.wel...@bioc.cam.ac.uk) at 
PNAC University of Cambridge Biochemistry. He will accept outside orders 
subject to capacity being available.

http://www.bioc.cam.ac.uk/pnac/

Cheers
  Martyn 

Martyn Symmons
EBI - Cambridge
 
 On 24/09/2010 4:46, Rex Palmer wrote:
  Can anyone please recommend a UK protein sequencing
 service. Our protein
  is dimeric with reported molecuar weight of about
 85kDa.Our funds are
  somewhat limited.
  Thanks in advance.
  Rex Palmer
  Birkbeck College, London
  RexPalmer2010.homestead.com
 
 --         Andreas Förster,
 Research Associate
         Paul Freemont  Xiaodong
 Zhang Labs
 Department of Biochemistry, Imperial College London
             http://www.msf.bio.ic.ac.uk

Re: [ccp4bb] protein turns brown

2010-09-24 Thread MARTYN SYMMONS

and possibly the opposite to reduction which is oxidation - do you have cys 
residues? Perhaps your DTT or TCEP got exhausted? Remember you can add up to 
10mM (not the traditional token 1mM). But remember to neutralize TCEP. Other 
possibility is adventitious metals as these give strong charge transfer bands 
on binding to cys.

So make sure you include EDTA or similar chelator (after metal affinity 
obviously). Oxidation is more likely if your protein unfolds - so try to keep 
it happy with nicely buffered pH and any ligands that might stabilize it.
   Good luck. 
 Martyn
Martyn Symmons
Cambridge

--- On Fri, 24/9/10, vikrant saa powervikr...@yahoo.co.in wrote:

From: vikrant saa powervikr...@yahoo.co.in
Subject: Re: [ccp4bb] protein turns brown
To: CCP4BB@JISCMAIL.AC.UK
Date: Friday, 24 September, 2010, 10:50


sometime it does happen becoz of protein aggregation, or reducing environment.
but it may be your protein color as well that visible during concentration 
 
 
Vikrant


 
 
 






From: sandeep toskgu...@rediffmail.com
To: CCP4BB@JISCMAIL.AC.UK
Sent: Fri, 24 September, 2010 2:04:54 PM
Subject: [ccp4bb] protein turns brown

Dear all,

I have purified protein from E.coli. expression system. the protein has been 
purified with three independant columns. Now during concentration step using 
amicon, the protein shows brown colour. what could be the reason.

best regards and Thanks,
sandy

Re: [ccp4bb] Effect of NCS on estimate of data:parameter ratio

2010-09-23 Thread MARTYN SYMMONS

Dear All
one thing I remembered from what Gerard pointed out was the difference 
in the XPLOR/CNS formalism between strict and restrained which is not a 
continuum. Restrained was obviously when you had multiple copies and they were 
restrained with a weight (which was like a force constant) to be similar when 
superimposed. So if you increase the force constant then they can move during 
refinement but they all try to move together when they move. 

And the other extreme is strict where there was no force applied at all but 
only a single copy of the chain and the ASU is built by applying the NCS 
symmetry. The atoms are free to move but, unlike the case with restrained where 
there is superimposition on the fly, in the strict case there is no automatic 
update of the superimposition matrices. So every move gets religiously copied 
to all the chains when the ASU is made. At this point I guess the copies can 
bump and so apply a force on each other but that is a local, and likely to be 
perturbing, force. 

best wishes
 Martyn 

Martyn Symmons
Cambridge



--- On Thu, 23/9/10, Ian Tickle ianj...@gmail.com wrote:

 From: Ian Tickle ianj...@gmail.com
 Subject: Re: [ccp4bb] Effect of NCS on estimate of data:parameter ratio
 To: CCP4BB@JISCMAIL.AC.UK
 Date: Thursday, 23 September, 2010, 11:21
 Hi Gerard  Pavel
 
 Isn't this the proviso I was referring to, that one cannot
 in practice
 use an infinite weight because of rounding errors in the
 target
 function.  The weight just has to be 'big enough' such
 that the
 restraint residual becomes sufficiently small that it's no
 longer
 significant.
 
 In numerical constrained optimisation the method of
 increasing the
 constraint weights (a.k.a. 'penalty coefficients') until
 the
 constraint violations are sufficiently small is called the
 'penalty
 method', see http://en.wikipedia.org/wiki/Penalty_method .  The
 method
 where you substitute some of the parameters using the
 constraint
 equations is called (you guessed it!) the 'substitution
 method', see
 http://people.ucsc.edu/~rgil/Optimization.pdf . 
 There are several
 other methods, e.g. the 'augmented Lagrangian method' is
 very popular,
 see 
 http://www.ualberta.ca/CNS/RESEARCH/NAG/FastfloDoc/Tutorial/html/node112.html
 .  As in the penalty method, the AL method adds
 additional parameters
 to be determined (the Lagrange multipliers, one per
 constraint)
 instead of eliminating some parameters using the constraint
 equations;
 however the advantage is that it removes the requirement
 that the
 penalty coefficient be very big.
 
 The point about all these methods of constrained
 optimisation is that
 they are in principle only different ways of achieving the
 same
 result, at least that's what the textbooks say!
 
 And now after the penalties and substitutions it's time to
 blow the whistle ...
 
 Cheers
 
 -- Ian
 
 On Wed, Sep 22, 2010 at 10:00 PM, Pavel Afonine pafon...@lbl.gov
 wrote:
   I agree with Gerard. Example: it's unlikely to
 achieve a result of
  rigid-body refinement (when you refine six
 rotation/translation parameters)
  by replacing it with refining individual coordinates
 using infinitely large
  weights for restraints.
  Pavel.
 
 
  On 9/22/10 1:46 PM, Gerard DVD Kleywegt wrote:
 
  Hi Ian,
 
  First, constraints are just a special case of
 restraints in the limit
  of infinite weights, in fact one way of
 getting constraints is simply
  to use restraints with very large weights
 (though not too large that
  you get rounding problems). These
 'pseudo-constraints' will be
  indistinguishable in effect from the 'real
 thing'.  So why treat
  restraints and constraints differently as far
 as the statistics are
  concerned: the difference is purely one of
 implementation.
 
  In practice this is not true, of course. If you
 impose infinitely strong
  NCS restraints, any change to a thusly restrained
 parameter by the
  refinement program will make the target function
 infinite, so effectively
  your model will never change. This is very
 different from the behaviour
  under NCS constraints and the resulting models in
 these two cases will in
  fact be very easily distinguishable.
 
  --Gerard
 
 
 **
                            Gerard J.
  Kleywegt
    Dept. of Cell  Molecular Biology
  University of Uppsala
                    Biomedical Centre  Box
 596
                    SE-751 24 Uppsala
  SWEDEN
 
     http://xray.bmc.uu.se/gerard/  mailto:ger...@xray.bmc.uu.se
 
 **
    The opinions in this message are fictional.
  Any similarity
    to actual opinions, living or dead, is purely
 coincidental.
 
 **

Re: [ccp4bb] PEG in the pdb?

2010-08-12 Thread MARTYN SYMMONS

Dear Klaus
 depends how many missing residue you want advertised in the 
header of your deposited file. The pdb will put in a special Remark 610 and 
Remark 615 to indicate any missing residues from your heterogen list. However 
the policy is not to list every missing (or zero occupancy) atom. 
  I have fitted fragments of pegs into density - so I know there are defined 
hetero residues for several sizes of small fragments. But if you name a PEG as 
a larger fragment than you can see then, as each PEG would be only one residue, 
effectively you get only one line in a remark 610 for each PEG with missing 
bits. 
  Zero occupancy is generally a deprecated way of dealing with missing density 
as it is confusing for less experienced user of the coordinates. I think zero 
occupancy can be useful during refinement as the atoms help fill space (or for  
example satisfy NCS restraint format requirement) but then these atoms can be 
stripped out before deposition. They should in any case never be included in 
B-factor refinement as they will skew the statistics and possibly the B-factor 
restraint model.   
  It seems that large numbers of missing heterogen atoms were anticipated when 
the format was drawn up - hence the absence of a requirement to list all 
heterogen atoms that are missing. 
  Hope that helps. 
    Martyn 

EBI, Cambridge



--- On Thu, 12/8/10, Klaus Sengstack sengstack-kl...@yahoo.de wrote:

From: Klaus Sengstack sengstack-kl...@yahoo.de
Subject: [ccp4bb] PEG in the pdb?
To: CCP4BB@JISCMAIL.AC.UK
Date: Thursday, 12 August, 2010, 9:16

Hi everybody,

I just solved the structures of an enzyme an some variants. In the active site 
cavity of each variant I found one or two fragments of PEG1000 bound. I used 
PEG1000 in the crystallization condition. Among the enzyme variants the number 
of non-hydrogen atoms of these PEG fragments varies between 7 and 19 atoms. Now 
I want to deposit the structures in the pdb and my question is, if I have to 
define each fragment as a single ligand (what would be a lot of work) or can I 
define them as PEG1000 molecules? Thanks.

K.S.

Re: [ccp4bb] PEG in the pdb?

2010-08-12 Thread MARTYN SYMMONS

That's a good point, Ed

Based on the formula: HO CH2-(CH2-O-CH2)n-CH2OH, PEG MW = 44n+62, a table of n 
to MW goes like below which gives an idea of what range is possible. Someone 
maybe knows what polydispersity can be expected from the synthetic process. As 
you say though, a specific range could partition out of the bulk to the protein 
surface. And the main point of the original post was that only a subset of 
atoms in the bound PEG may be ordered enough to see. 
 n   MW
 5, 282.
 6, 326.
 7, 370.
 8, 414. 
 9, 458. 
 10, 502.
 11, 546.
 12, 590.
 13, 634.
 14, 678.
 15, 722.
 16, 766.
 17, 810.
 18, 854.
 19, 898. 
 20, 942.
 21, 986. 
 22, 1030. 
 23, 1074. 
 24, 1118.
 25, 1162.
 26, 1206. 
 27, 1250. 
 28, 1294. 
 29, 1338. 
 30, 1382.
 31, 1426. 
 32, 1470. 
 33, 1514. 
 34, 1558.
 35, 1602.
 36, 1646.
 37, 1690.
 38, 1734.
 39, 1778.
 40, 1822.
 41, 1866. 
 42, 1910. 
 43, 1954.
 44, 1998.
 45, 2042.
 46, 2086.
 47, 2130. 



--- On Thu, 12/8/10, Ed Pozharski epozh...@umaryland.edu wrote:

 From: Ed Pozharski epozh...@umaryland.edu
 Subject: Re: [ccp4bb] PEG in the pdb?
 To: CCP4BB@JISCMAIL.AC.UK
 Date: Thursday, 12 August, 2010, 17:35
 PEG solutions contain fragments of
 all sizes - it is the average size
 (however defined by the manufacturer) that is 1000. 
 So technically it
 is incorrect to claim that you have PEG1000 molecules bound
 to your
 protein, it is most likely much shorter fragments that can
 penetrate the
 channels in protein crystals.
 
 It's not a lot of work to generate monomer libraries for
 peg fragments
 of different length (and some are available from standard
 monomer
 libs).  
 
 I always wondered why PEG is not defined in the standard
 libraries as a
 polymer - perhaps because it is rarely needed.  Or is
 it?
 
 Ed.
 
 On Thu, 2010-08-12 at 08:16 +, Klaus Sengstack wrote:
  Hi everybody,
  
  I just solved the structures of an enzyme an some
 variants. In the
  active site cavity of each variant I found one or two
 fragments of
  PEG1000 bound. I used PEG1000 in the crystallization
 condition. Among
  the enzyme variants the number of non-hydrogen atoms
 of these PEG
  fragments varies between 7 and 19 atoms. Now I want to
 deposit the
  structures in the pdb and my question is, if I have to
 define each
  fragment as a single ligand (what would be a lot of
 work) or can I
  define them as PEG1000 molecules? Thanks.
  
  K.S.
  
  
 
 -- 
 I'd jump in myself, if I weren't so good at whistling.
                
            
    Julian, King of Lemurs

Re: [ccp4bb] How to distinguish microcrystalline, quasicystalline and precipitation?

2010-06-10 Thread MARTYN SYMMONS

Dear Joy
  - microcrystalline in most cases would be birefringent - this is 
difficult to observe these days in our plastic boxed crystallization world with 
colours everywhere. 

But if you really want to know you can pipette the droplet onto a glass 
coverslip and dim the room lights. Set the polarizers _exactly_ to extinction. 
Then you can see if the precipitate glows. 

Of course it could be a cubic space group crystal which I heard does not show 
BR.

And it could be micro-spherulites which are indeed quasicrystalline - these are 
strongly bireringent - maybe _too_ strongly is the clue here! If any get to a 
reasonable size then you can see a X-like pattern in the centre of these since 
they appear to be ordered in spherical shells of unfolded protein. 

One other clue can be the density - most protein crystals are not far off the 
droplet density. So if the precipitate settles strongly into a circle at the 
bottom of the droplet then I would say this is a bad sign :(

Hope this helps.

regards
    Martyn Symmons

EBI, Hinxton, Cambs. UK 

 

--- On Thu, 10/6/10, joybeiyang joybeiy...@gmail.com wrote:

From: joybeiyang joybeiy...@gmail.com
Subject: [ccp4bb] How to distinguish microcrystalline, quasicystalline and 
precipitation?
To: CCP4BB@JISCMAIL.AC.UK
Date: Thursday, 10 June, 2010, 7:06



 
 
#yiv1007907758 BLOCKQUOTE {
MARGIN-TOP:0px;MARGIN-BOTTOM:0px;MARGIN-LEFT:2em;}
#yiv1007907758 OL {
MARGIN-TOP:0px;MARGIN-BOTTOM:0px;}
#yiv1007907758 UL {
MARGIN-TOP:0px;MARGIN-BOTTOM:0px;}

 
Hi everyone, I am preparing a crystallization 
manual for our group, however, I found that it is very difficult to 
distinguish 
microcrystalline, quasicrystalline and precipitation, especially when the 
precipitation was shiny, like the grit on the beach. Is there a way to 
distinguish the three? 
 
Comments and suggestions will be greatly 
appreciated!




Joy

[ccp4bb] Fw: [ccp4bb] degradation of protein durring freez thaw

2010-04-23 Thread MARTYN SYMMONS

Sounds like good advice - I suppose that since pure water un-freezes last, you 
could have protein concentration inhomogeneities? So in areas where the protein 
concentration is raised it could start to feel like crashing out.


All the best
Martyn

Martyn Symmons
Cambridge



- Original Message 
From: Frank Niesen dr.frank.nie...@gmail.com
To: CCP4BB@JISCMAIL.AC.UK
Sent: Friday, 23 April, 2010 10:40:59
Subject: Re: [ccp4bb] degradation of protein durring freez thaw

An aspect that is sometimes overlooked is that the need to avoid mini 
freeze-thaw cycles does not only call for a quick, snap-freeze process for 
samples (in thin-walled PCR tube and not too large volume; 50-100 ul I'd 
recommend), but - in turn - also for quick thawing: I prefer holding the PCR 
tube under running cold water straight after removal from the freezer, and 
stick into the ice bucket only after I see that all of the sample is thawed.
Best,
Frank

Re: [ccp4bb] Phasing statistics

2010-04-15 Thread MARTYN SYMMONS

Sorry - I had not twigged that this was a SAD discussion. 

In this case DM is as you say gonna save you from bimodal ambiguities - as we 
all know that is why the FOMs get so much better during DM - but that seems 
fine to me as it is pretty much new information coming in from the solvent 
flattening, histogram matching, probably some averaging. 

So that improvement is not 'artifactual' but in fact part of the experimental 
detail of just how the structure was solved. I guess FOMs are really only 
useful in the Crick and Blow centroid case.

Still think any and all info on the experiment(s) is useful to see - if only 
for people to plan their own experiments building on past experience.

Best wishes and regards
Martyn 

Martyn Symmons
Cambridge

 



- Original Message 
From: Ian Tickle ianj...@gmail.com
To: CCP4BB@JISCMAIL.AC.UK
Sent: Wednesday, 14 April, 2010 15:42:51
Subject: Re: [ccp4bb] Phasing statistics

I have to say that I don't share James' enthusiasm for the FOM as a
useful statistic, even for experimental phases, and particularly not
in the SAD (and SIR) cases, which after all is what this thread is
supposed to be about.   This is even without delving further into the
murky issues of wildly inflated FOM estimates from DM that he and
others have raised.

My point is that the definition of FOM is that it's the *expected*
cos(phase error), as opposed to *actual* cos(phase error) and the
'expected' bit makes all the difference!  It's essentially the
difference between precision and accuracy, i.e. the FOM tells you how
precise the phase estimate is and not at all how accurate it is: for
the latter you need to calculate the errors relative to the phases of
the final model, as James suggested.  The FOM is roughly related
(anti-correlated to be precise) to the variance: this is clear if we
take the case of small phase deviations, then cos(x) expands to
(1-x^2/2) and the variance is x^2, where x is the deviation from
the mean (NOT the same thing as the error!).  So just like the
variance it measures the 'peakedness' of the distribution: a FOM=1
corresponds to variance=0, i.e. an infinitely sharp distribution
(delta function).

Now a particular problem arises in SAD (and SIR) because then in
general we always get a bimodal distribution: i.e. 2 separate peaks.
If these peaks happen to separated by 180 deg then the FOM is 0
(cos(90)=0).  A large separation is most likely to occur when the
anomalous contribution is large, when of course you expect the phase
estimates to be optimal, for example see Fig 2(a) in Wang et al., Acta
Cryst. (2007). D63, 751–758.  Conversely if the anomalous contribution
is small, so the phase estimates are poor, the separation between the
2 estimates is small and the FOM is close to 1!  So we apparently have
a situation where the *better* are the phases, the *lower* is the FOM!
What the Figure in the Wang paper ignores of course are the
experimental errors which will tend to broaden the distribution and
lower the FOM in the case where the anomalous contribution is small
relative to the errors.  Even so it means that the FOM is not a good
measure of phase quality.  Much better IMO is the phasing power (mean
heavy-atom amplitude / mean P-weighted lack of closure error).  This
essentially measures the degree to which the phase ambiguity is
capable of being successfully resolved by DM methods.

One other thing puzzles me: why do many programs that purport to
calculate average phase differences (relative to model phases say) use
statistics such as, well, average |phase difference| when we already
have a perfectly good measure in the FOM!, i.e. average cos(phase
difference)?  Then you would be able to directly compare the FOMs from
experimental phases with those from model phases.  Even better still
would be the average log likelihood: if the phase probability is exp(A
cos(delta_phi)) then the log likelihood is simply A cos(delta_phi) and
you just average that, i.e. it's essentially the averaged FOM-weighted
cos(phase difference), so that the average is weighted according to
the reliability of the phase estimates.  A poor phase estimate is
likely to be associated with a small F which is not going to
contribute much to the map anyway, so it makes no sense to give poor
phases the same weight as good phases in the average.

Cheers

-- Ian

On Tue, Apr 13, 2010 at 6:47 PM, James Holton jmhol...@lbl.gov wrote:
 Probably the only phasing stat that I pay any attention to these days is the
 Figure of Merit (FOM). This is because, the _definition_ of FOM is that it
 is the cosine of the phase error (or at least your best estimate of it).
  FOM=1 is perfect phases and FOM=0 is random phases, and a reasonable cutoff
 value for FOM is 0.5 (see Lunin  Woolfson, Acta D, 1993).  Yes, there are
 ways to get various programs to report very inaccurate values for FOM (such
 as running DM for thousands of cycles), and yes, there are often legitimate
 reasons to run these programs in this way

Re: [ccp4bb] Phasing statistics

2010-04-14 Thread MARTYN SYMMONS

I agree this is very interesting -

The 'real' FOM is a great idea (we should start a campaign for real FOMs!) 

What the 'real' FOM is trying to get at is the quality of the initial 
experimental map. This could also be subsequently calculated for NCS-averaged 
and density-modified maps subsequently to show how the building was achieved.

 Gamma correction and real space free residuals can be used to avoid 
overfitting (DM has these I know, prob other progs too). But the point that 
overfitting can help the crystallographer build a correct structure is a good 
one. Conversely I have heard of structures that are not just correct but also 
wonderfully original, built by talented crystallographers into initially 
apparently difficult-to-interpret maps. So we maybe need to celebrate what can 
be done; not use this as necessarily a criticism of the final structure.

One of the things I heard from F. VonD was 'don't get hung up on statistics - 
look at maps'. Any map is going to have ropey bits as well as easy bits - so 
the only way to convey that would not be one number like a FOM.  But more like 
a residue-by-residue real-space Rfactor of the _final_ model in the _original_ 
map?

In the real world ;) I don't see that happening anytime soon

all the best
   Martyn

Martyn Symmons
Cambridge





- Original Message 
From: Soisson, Stephen M stephen_sois...@merck.com
To: CCP4BB@JISCMAIL.AC.UK
Sent: Tuesday, 13 April, 2010 19:09:12
Subject: Re: [ccp4bb] Phasing statistics

This is an interesting thread, and perhaps I should not dive in on such
a heady topic, BUT, I do want to point out my own particular bias
regarding FOM that is not entirely consistent with James' point of view.
In my experience, the FOM obtained after density modification runs are
almost always extremely optimistic - I have seen relatively high
apparent FOM after density modification runs (0.7) that had nearly
uninterpretable maps. As such, I am much more interested in knowing
about the FOM from the phasing calculations themselves and NOT after
density modification.  

That said, applying arbitrary cut-offs to what would be deemed an
acceptable FOM, after phasing calculations, to generate maps that are
interpretable is not really a good thing to do.  For instance, I just
had a structure where the FOM was 0.35 after phasing (a rubbish
structure perhaps?), BUT, the data are highly redundant and the solvent
content in the high 70% range.  The post density modified maps are
stunningly good.  One could easily imagine many other scenarios (e.g.
NCS) where the modified maps and apparent FOM would be decoupled.

So, I do agree with James's suggestion that perhaps we should be
retrospectively calculating a real FOM between the final model and the
actual maps you built into (after whatever you did to get them).  This
seems like a very good idea indeed.

More personal biases revealed:  I actually look at, and use, the Cullis
R values on anomalous and isomorphous data to help determine how much
signal is in the data.  Simplified, these numbers are your Average
estimated lack-of-closure divided by your average observed difference.
That's an important thing to know, and I find them quite useful.

Steve

-Original Message-
From: CCP4 bulletin board [mailto:ccp...@jiscmail.ac.uk] On Behalf Of
James Holton
Sent: Tuesday, April 13, 2010 1:48 PM
To: CCP4BB@JISCMAIL.AC.UK
Subject: Re: [ccp4bb] Phasing statistics

Probably the only phasing stat that I pay any attention to these days is

the Figure of Merit (FOM). This is because, the _definition_ of FOM is 
that it is the cosine of the phase error (or at least your best estimate

of it).  FOM=1 is perfect phases and FOM=0 is random phases, and a 
reasonable cutoff value for FOM is 0.5 (see Lunin  Woolfson, Acta D, 
1993).  Yes, there are ways to get various programs to report very 
inaccurate values for FOM (such as running DM for thousands of cycles), 
and yes, there are often legitimate reasons to run these programs in 
this way.   But, there are also very wrong things one can do to get low 
Rmerge, Rcryst, and especially Rfree.  It is simply a matter of knowing 
(and reporting) what you are doing.


If you are worried that your favorite estimate of FOM is inaccurate, 
then you can always turn to your most accurate phases:  those of your 
final, refined model (the one that you have convinced yourself is 
right and ready to publish).  Taking these as the true phases, the 
true FOM can always be obtained by comparing the final-model phases to

those of your initial map (using PHISTATS or SFTOOLS).  This is by no 
means standard practice, but perhaps it should be?


Anyway, FOM is _supposed_ to be the cosine of the phase error, and is 
therefore the most relevant statistic when it comes to how good your 
phases were when you started building.  This is why it is important as a

reviewer to know what it is.  If I am faced with a structure that was 
built into a MAD map with initial FOM = 0.8 to 2

Re: [ccp4bb] Phasing statistics

2010-04-12 Thread MARTYN SYMMONS

Hiya Frank

Well my take on it would be that they did a certain experiment - which includes 
phasing datasets - and we have an expectation to see all the steps reported. 
Good structures are not the 'be all' and 'end all'. Reporting data is not 
merely to convince readers that this is not a 'made up' structure. It is like 
in mathematics when you don't just report the correct answer but your 'show 
your workings' as my old maths teacher used to say. So maybe phasing statistics 
are pretty important in that respect. 

And it would be good to have datasets with experimental phases deposited. I 
should've done this with my 1e3h years ago as the experimental phases (which, 
as I recall, you were a friendly advisor on ;) were included in the refinement 
target (CNS mlhl). SeMet phases are usually good - but ones from other heavy 
atoms it might be nice to see too. There used to be a heavy atom database for 
such like things - not sure of the current status of that.

see you
 all the best
   Martyn 

Martyn Symmons
Cambridge












From: Frank von Delft frank.vonde...@sgc.ox.ac.uk
To: CCP4BB@JISCMAIL.AC.UK
Sent: Monday, 12 April, 2010 12:24:50
Subject: Re: [ccp4bb] Phasing statistics

I fully agree, for high quality data.

What though if the data are not impeccable and the structure
necessarily ropey?  E.g. 4A phases and anisotropic diffraction.  By
what metrics do we then judge the results?

(I don't know the answer, btw, but our membranous colleagues surely
spend quite a bit of time with that question...)

phx.


On 12/04/2010 12:10, Anastassis Perrakis wrote: 
Hi -
 


A year or so ago, I have asked as a referee somebody to provide
for a paper the statistics for their heavy atom derivative dataset,
 
and for the phasing statistics. For some good reasons, they were
unable to do that, and they (politely) asked me
'what would it change if you knew these, isn't the structure we
present impeccable?'. Well, I think they were right.
Their structure was surely correct, surely high quality. After
that incident and giving it some thought, 
I fail to see why should one report e.g. PP or Rcullis, or why
will I care what they were if the structure has a convincing Rfree and
is properly validated. 
If someone wants to cheat at the end of the day, its easy to
provide two numbers, but its hard to provide a good validated model
that agrees with the data.
(and, yes, you can also make up the data, but we have been
there, haven't we?!?)


So, my question to that referee, likely being a ccp4bb
aficionado that is reading this email, or to anyone else really, is:


What would it help to judge the quality of the structure or the
paper if you know PP, Rcullis and FOM?


Best -


A.


PS Especially since you used SHELXE for phasing these statistics
are utterly irrelevant, and possibly you could advice the referee to
read a bit about how SHELXE works ... or go to one of the nice courses
that George teaches ...


On Apr 12, 2010, at 10:37, Eleanor Dodson wrote:

You can feed the SHELX sites into phaser_er or CRANK both of
which will 
give this sort of information.

Or mlphare if you know how to set it up..

Eleanor


Harmer, Nicholas wrote:

Dear CCP4ers,



I've been asked by a referee to provide the
phasing statistics for a SAD dataset that I used to solve a recent
structure. Whilst I have been able to find a figure-of-merit for the
data after phasing, I can't work out how to get any other statistics
(e.g. phasing power or an equivalent or Rcullis). Does anyone know a
good route to obtaining useful statistics to put in the paper for SAD
data?



The structure solution was carried out
using SHELX C/D/E and then ARP/wARP.



Thanks in advance,



Nic Harmer



=

Dr. Nic Harmer

School of Biosciences

University of Exeter

tel: +44 1392 725179




P please don't print this e-mail unless you really need to
Anastassis (Tassos) Perrakis, Principal
Investigator / Staff Member
Department of Biochemistry (B8)
Netherlands Cancer Institute, 
Dept. B8, 1066 CX Amsterdam, The Netherlands
Tel: +31 20 512 1951 Fax: +31 20 512
1954 Mobile / SMS: +31 6 28 597791

Re: [ccp4bb] 2,3-butanediol

2010-03-25 Thread MARTYN SYMMONS

One thing I found with 2,3-Butanediol was for some crystallizations you can 
make it up a cryo solution with it at 12% and then put a thin layer it on top 
of the crystal droplet in a sitting drop well. Then you can loop the crystal 
and pull it up through the cryo layer - then out and to N2(l).  Just how brief 
this is depends on your dexterity and also how well the crystal is engaged in 
the loop. My crystals were not very happy about the cryo but I could make it 
brief enough to get them out intact.

Good luck,

Martyn 

Martyn Symmons
MBU Cambridge




- Original Message 
From: Sean Seaver s...@p212121.com
To: CCP4BB@JISCMAIL.AC.UK
Sent: Wednesday, 24 March, 2010 18:47:38
Subject: Re: [ccp4bb] 2,3-butanediol

Dear all,
I would like to know which is the more suitable method of using 2,3-Butanediol 
as cryoprotectant? Addition in to reservoir buffer or crystal soaking or both 
can be tried?

Cheers
Gauri
--
I am sure both could be tried, but have found quite few papers mentioning brief 
soaking with 2,3-butanediol:

Purification, crystallization and X-ray diffraction analysis of the 
extracellular part of the human Fc receptor for IgA, Fc[alpha]RI (CD89)
http://scripts.iucr.org/cgi-bin/paper?cnor=bw5002
5-10 seconds

Purification, crystallization and X-ray analysis of Hibiscus chlorotic ringspot 
virus
http://scripts.iucr.org/cgi-bin/paper?cnor=gr2202
described as 'briefly'

Crystallization and preliminary X-ray analysis of recombinant human acid 
[beta]-glucocerebrosidase, a treatment for Gaucher's disease
http://scripts.iucr.org/cgi-bin/paper?cnor=pu0086
described as 'briefly'

Crystallization and preliminary X-ray crystallographic studies of recombinant 
human betaine-homocysteine S-methyltransferase
http://scripts.iucr.org/cgi-bin/paper?cnor=gr2115
'quickly transferred'

Purification, crystallization and X-ray analysis of Hibiscus chlorotic ringspot 
virus
http://scripts.iucr.org/cgi-bin/paper?cnor=pu0115
5% increments from 5 to 30% for 3 min at each step

Hope that helps,

Sean

Re: [ccp4bb] crystallization of a macromolecular complex

2010-03-05 Thread MARTYN SYMMONS

I agree this 3:1, 4:1, ... 10:1 etc is a good approach - especially for 
screening - as it samples more concentration conditions than the plain vanilla 
1:1 set-up. One generally good thing too is that the precipitant also starts at 
lower concentration - so less chance of setting up a load of trials and 
watching them immediately crash out (although obviously not a problem in this 
particular case!) .

I think the reason for differences from pre-prepared concentrated protein are 
often to do with other components that come in with the protein - so anything 
in the protein solution like buffer, salt, reductant, detergent, metal ions, 
etc will be concentrated up alongside the protein. 

cheers
Martyn 

Martyn Symmons, MBU Cambridge.

As an aside I heard that the 1:1 derives from the days when pipettes could not 
be trusted with such small volumes. The 1:1 ratio is the best chance to get a 
droplet with reproducible relative concentrations regardless of any systematic 
error in the exact volumes dispensed.

 







From: Jacob Keller j-kell...@md.northwestern.edu
To: CCP4BB@JISCMAIL.AC.UK
Sent: Friday, 5 March, 2010 17:02:15
Subject: Re: [ccp4bb] crystallization of a macromolecular complex

 

 
20 g/L is the same as 20 mg/ml, isn't it?  That does 
  not seem particularly high to me.
 
Why not try 200 g/L?
 
As a variant to 
this, you could just try doing 10:1 prot:ppt drops, and allowing the protein 
concentration to increase in situ. In my limited experience, however, this is 
not the same as setting up drops 1:1 with more concentrated protein, for 
reasons 
unknown to me. I wonder whether that is the experience of others on the BB as 
well?
 
JPK

Re: [ccp4bb] Per-residue RMSD for multiple structures

2010-02-23 Thread MARTYN SYMMONS

I did this recently for an analysis of the subunit variation in AcrB. I took 
the LSQMAN multi-RMSDs among the three chains after improved superimposition 
and edited them into the format that Rob Campbell's data2bfactor.py expected. 
And then I put them into the B-factor column of one subunit. Finally I invoked 
Rob Campbell's color_b.py in pymol to colour the molecular surface by them 
(Fig. 4 in http://www.pnas.org/content/106/17/7173/suppl/DCSupplemental). But I 
guess a backbone could be similarly. 

There are probably other ways to do this but Rob Campbell's scripts are a boon 
and a blessing

cheers,
Martyn 

Martyn Symmons
MRC-MBU



- Original Message 
From: Gerard DVD Kleywegt ger...@xray.bmc.uu.se
To: CCP4BB@JISCMAIL.AC.UK
Sent: Tuesday, 23 February, 2010 14:57:02
Subject: Re: [ccp4bb] Per-residue RMSD for multiple structures?

Thanks Stephen!

I was going to suggest that, but I was afraid of the self-appointed CCP4BB 
Gestapo that has been seen goose-stepping in this neighbourhood recently 
(Tassos recently accused me of becoming mellow and diplomatic in my dotage, so 
I hope I've set the record straight now). However, since this solution is 
neither CCP4 nor Phenix, we may get away with this heinous act of 
bulletin-board heresy... On the other hand, I've learned that it is often more 
expedient to beg for forgiveness than to ask for permission.

I would add that:

- I assume that the sequences and numbering are identical

- you should put the structures in one big PDB file and read it into LSQMAN

- since LSQMAN doesn't do true multiple-structure alignment, you could 
pre-align them, e.g. with SSM/PDBeFold

- if you didn't, you could indeed use the MCentral and MAlign commands to align 
them

- my favourite plot would be the CD plot (but then again, it would, wouldn't 
it?) - see for instance: 
http://xray.bmc.uu.se/cgi-bin/gerard/image_page.pl?image=usf/pics/cdplot_1ldn.gif
 - which is also produced with the MPlot command - 
http://xray.bmc.uu.se/usf/lsqman_man.html#S82 - a normal MPlot would look like 
this: 
http://xray.bmc.uu.se/cgi-bin/gerard/image_page.pl?image=usf/pics/mplot_1ldn.gif

- the output file of the normal MPlot command is in a form that can be quickly 
converted into an O datablock for those handy with an editor and familiar with 
O datablocks, and could then be used to ramp a model inside O

- you may also want to consider showing how the (main-chain or side-chain) 
torsion angles differ between the structures, e.g. by plotting the circular 
variance of phi and psi - see for instance 
http://xray.bmc.uu.se/cgi-bin/gerard/image_page.pl?image=usf/pics/vmain_1ldn.gif
 - as described here: http://xray.bmc.uu.se/usf/lsqman_man.html#S83 - or a 
multiple-model Ramachandran plot like this 
http://xray.bmc.uu.se/cgi-bin/gerard/image_page.pl?image=usf/pics/mrama_1ldn.gif
 (with the MRama command). The advantage is that no superposition is required 
at all and that any domain movements won't debeautify your results

--dvd




On Tue, 23 Feb 2010, Stephen Graham wrote:

 I am pretty sure you can do this using LSQMAN from Gerard (BluRay?) Kleywegt.
 
 The pertinent commands are MCENTRAL to determine the 'most
 representative structure' (i.e. the one to align upon and show in the
 figure), MALIGN to do the alignment and then MPLOT to calculate a
 'multi-RMSD' for each residue (see manual for details - set the
 'cut-off for printing' to 0 to get all values).
 
 Regards depiction, I think pymol can also represent structures as
 sausages based on their B values:
 cartoon putty
 show cartoon
 
 HTH,
 
 Stephen
 
 On 23 February 2010 01:31, Ethan Merritt merr...@u.washington.edu wrote:
 Hi all,
 
 I am comparing 4 very similar (1.5A rmsd) large (750 residues) structures,
 but struggling to find a way to generate a figure that conveys where they
 are most alike and where they diverge.
 
 Simply drawing a superimposed set of backbone traces results in what looks
 like colored spaghetti.  I don't think that's going to work.
 
 So I had the idea of drawing a single backbone trace, or ribbon diagram,
 and coloring by the RMSD of the four C-alphas at each residue position.
 But I can't find a program that will output this as a table of numbers
 I can use.  All of the multiple structure superposition programs must
 have this information internally.  After all, that's what they are 
 minimizing.
 But do any of the programs provide an option to write it out?
 
 I can get pairwise per-residue deviations by doing SSM superposition in Coot,
 but that doesn't get me to an RMSD for all four structures jointly.
 
Ethan
 
 
 --
 Ethan A Merritt
 Biomolecular Structure Center
 University of Washington, Seattle 98195-7742
 
 
 
 
 -- Dr Stephen Graham
 1851 Research Fellow
 Cambridge Institute for Medical Research
 Wellcome Trust/MRC Building
 Addenbrooke's Hospital, Hills Road
 Cambridge, CB2 0XY, UK
 Phone: +44 1223 762 638
 


Best wishes,

--Gerard

Re: [ccp4bb] smeared spot in diffraction

2010-01-21 Thread MARTYN SYMMONS

One other thought following from Kelly's point about room temp check is that 
most likely these are thin plates or needles. And I think it was Frank VonD who 
pointed out that some of these thin crystals can be physically stressed by the 
shape of the frozen cryo formed in the cryoloop. So choose loops with care - 
maybe go with litholoops?
 
Martyn
 
Martyn Symmons
MRC-MBU 
Cambridge




- Original Message 
From: Kelly Daughtry kddau...@bu.edu
To: CCP4BB@JISCMAIL.AC.UK
Sent: Thursday, 21 January, 2010 14:27:58
Subject: Re: [ccp4bb] smeared spot in diffraction

Fengxia,
To me the diffraction appears as if the crystals did not freeze well.
So the best option seems to be annealing (as already suggested) or to
try different cryo-protectants.

Did you perform a room-temperature mount? If so, were the spots nice
and round (suggesting the cryo is the cause of the smears)?

I would suggest growing your crystals in the presence of cryo as well.
Glycerol, PEG 400, all the usual suspects.

May I suggest 2-methyl-2,4-pentanediol (MPD). I have had success
adding to well (10 - 20 %) but not in crystal drop, then using 20% as
a cryo (MPD acting to slow diffusion to give better ordered crystals).

Kelly Daughtry

***
Kelly Daughtry
PhD Candidate
Department of Physiology and Biophysics
Boston University School of Medicine
590 Commonwealth Ave
R 390
Boston MA, 02215
(P) 617-358-5548
***



On Thu, Jan 21, 2010 at 9:07 AM, Brad Bennett bradbennet...@gmail.com wrote:
 Hi Fengxia-
 How about increasing the PEG% so you don't have to add as much other
 cryoprotectant?

 Also, have you tried annealing the crystals? I've had success with this when
 I've had samples diffract similarly. The simplest way is by blocking the
 cryostream for 2-3 seconds (repeat 1-2 times) and then shoot. More involved
 way is by actually dismounting your crystals back into cryo buffer for 20-30
 seconds, then remounting and shooting.

 HTH-
 Brad

 On Thu, Jan 21, 2010 at 1:54 AM, Fengxia Liu fengxia@gmail.com wrote:

 Dear all,

 I am trying to diffract one semet-protein, it gave me some clear spots and
 some smeared spots in one diffration, you can find maps attached.
 Mother liquor condition is 0.1MTris-HCL pH8.5+10% PEG4k + 0.2M Li2SO4, I
 have tried mother liquor+25%glycerol, paratone, paratone+ mineral oil, but
 they all gave me such pattern.
 Does anyone know how to solve this? I appreciate any advice.

 Thank you in advance.

 Best wishes,
 Fengxia

Re: [ccp4bb] where I have been going wrong in crystallization?

2009-12-22 Thread MARTYN SYMMONS

Thanks to those who replied.

The humour comes of course from a slight feeling that there is
some truth in the 'respectful' version. Proteins are wonderful subtle
machines so perhaps we should be careful and thoughtful when we expose them to 
the
horrors of most crystallization conditions!

Which takes me to the original point of my search - which was for a dialysis 
method for lysozyme. I was trying to set up a micro dialysis format for my 
protein but wanted to test it first.  I am using spectrapore 8K dialysis 
membrane and wondered if that would retain lysozyme enough to get crystals (I'd 
like to stick with this cut-off as the physical and sealing properties of other 
membranes might be different). Or if someone has a higher MW cheap alternative 
protein/condition?

Best wishes - agus 'Bliadhna Mhath Ur Dhuibh Uile' as we say in gaelic. 

  Martyn
-
Martyn Symmons
Cambridge

  



- Original Message 
From: David Briggs drdavidcbri...@gmail.com
To: CCP4BB@JISCMAIL.AC.UK
Sent: Monday, 21 December, 2009 17:26:15
Subject: Re: [ccp4bb] where I have been going wrong in crystallization?

Hi Martyn,

this recipe tends to work for me...

Lysozyme: 50 mg/ml in 0.1 M Sodium Acetate pH 4.8
Reagent: 8% w/v Sodium Chloride, 0.1 M Sodium Acetate pH 4.8
Additional Reagents: Index Reagent 8, 22, 28, 31, 34, 40, 58, 59, 69, 86, 88

Mix equal amounts of lysozyme with reagent, incubate at 4 or 22
degrees Celsius. Batch or vapor diffusion works fine.

== Direct copy/paste from http://hamptonresearch.com/experiments.aspx ==

HTH and Happy Christmas.

Cheers,

Dave


David C. Briggs PhD

University of Manchester E-mail:
david.c.bri...@manchester.ac.uk

Twitter: @xtaldave
Skype: DocDCB




2009/12/21 MARTYN SYMMONS martainn_oshioma...@btinternet.com:
 Dear All
  checking out the Lysozyme crystallization methods on the web I liked the 
 Rigaku Instructions that I found:
 (http://www.rigaku.com/protein/crystallization.html)

 ...create a drop of 3ul lysozyme solution, and 3 ul of well solution, 
 respectfully, for a total drop size of 6ul...

 So perhaps sometimes I am just not respectful enough to deserve crystals ?

good wishes to all
   regards,
 Martyn
 ---
 Martyn Symmons
 MRC-MBU Cambridge UK
 'Chan fhiosrach mur feòraich.'
 Gaelic proverb -
  Nothing asked, nothing learned.

[ccp4bb] where I have been going wrong in crystallization?

2009-12-21 Thread MARTYN SYMMONS

Dear All
  checking out the Lysozyme crystallization methods on the web I liked the 
Rigaku Instructions that I found:
(http://www.rigaku.com/protein/crystallization.html)

...create a drop of 3ul lysozyme solution, and 3 ul of well solution, 
respectfully, for a total drop size of 6ul...

So perhaps sometimes I am just not respectful enough to deserve crystals ?

good wishes to all 
   regards,
 Martyn 
---
Martyn Symmons
MRC-MBU Cambridge UK 
'Chan fhiosrach mur feòraich.'
Gaelic proverb -
  Nothing asked, nothing learned.

Re: [ccp4bb] 3D search for peptide conformers?

2009-12-15 Thread MARTYN SYMMONS

I think SPASM by Gerard Kleywegt is a great tool for this sort of thing - you 
can set the similarity matrix threshold to make it more or less sequence 
independent (it still calculates the sequence match which is good for looking 
for key residues in the motif like structurally required GLYs). You can also 
restrict it to just CA positions. And you can set the thresholds for the search 
to narrow down on the basis of rmsd.

J Mol Biol. 1999 Jan 29;285(4):1887-97.
Recognition of spatial motifs in protein structures.
Kleywegt GJ.

You can download the program and a search library - currently july 2008 pdb I 
think (culled to 99% seq ident. is the best one). But could be something more 
recent available?

Good luck
Martyn

Martyn Symmons - Cambridge

From: Patrick Loll pat.l...@drexel.edu
To: CCP4BB@JISCMAIL.AC.UK
Sent: Monday, 14 December, 2009 20:02:53
Subject: [ccp4bb] 3D search for peptide conformers?

I have a 10-residue stretch of a protein that adopts an interesting 
conformation; I'd like to know if this conformation occurs in other proteins. 
I'd welcome suggestions for tools that will allow me to to search for this 
peptide conformation in the PDB. 

I naturally thought of DALI, but it seems to require a minimum length of 30 
residues. 

Thanks,

Pat

---
Patrick J. Loll, Ph. D.  
Professor of Biochemistry  Molecular Biology
Director, Biochemistry Graduate Program
Drexel University College of Medicine
Room 10-102 New College Building
245 N. 15th St., Mailstop 497
Philadelphia, PA  19102-1192  USA

(215) 762-7706
pat.l...@drexelmed.edu

Re: [ccp4bb] Refining residues as rigid bodies

2009-12-03 Thread MARTYN SYMMONS

Discussing with Fred Vellieux offlist I suggested the following: would it be 
useful to have a sort of 'simulated annealing' of the B-factor values? - in 
refinement I often try resetting them to higher values (Moleman has a function 
for this I think) and then see which ones refine back down nicely - but it 
would be good to have some sort of randomization method - that would be a test 
of how much the values are inherited from the modelling early on - say from the 
MR probe model for example. 

We could do multiple B-factor refinements with different starting kicks to the 
values and look for the best final Rfree or some measure of map quality?

Martyn

Martyn F. Symmons
Cambridge
'Chan fhiosrach mur feòraich.'
Gaelic proverb - Nothing asked, nothing learned.

 






From: Pavel Afonine pafon...@lbl.gov
To: CCP4BB@JISCMAIL.AC.UK
Sent: Wednesday, 2 December, 2009 22:52:45
Subject: Re: [ccp4bb] Refining residues as rigid bodies

Hi,

you can do similar thing (that is resulting in similar outcome) in
phenix.refine by increasing the weight on ADP restraints term. Example:
increase wu or decrease wxu_scale. Although I believe a regular
refinement of individual isotropic ADPs should normally work just fine
at 3A resolution in phenix.refine. 

Pavel.


On 12/1/09 11:18 PM, Frederic VELLIEUX wrote: 
If the problem is B-factor refinement, you can do that easily at low resolution 
with CNS. You just give tight restraints.

You modify the file bindividual.inp

This section:
{* target sigma values for restrained B-factor refinement *}

{* mainchain bonds *}
{===} bsig_main=1.5;
{* mainchain angles *}
{===} asig_main=2.0;

{* sidechain bonds *}
{===} bsig_side=2.0;
{* sidechain angles *}
{===} asig_side=2.5;

I cannot remember in which direction the values have to go. I think up. I have 
done this with a very low resolution structures (4.5 A?) a few years ago, you 
get smoothly varying B values, very tightly restrained. I do not think we 
published that structure, we obtained a higher resolution structure later (I 
don't think the referees would have been very happy seeing B factor refinement 
at low resolution). But it worked.

Fred.

  
Message du 01/12/09 23:51
De : Jason C Porta 
A : CCP4BB@JISCMAIL.AC.UK
Copie à : 
Objet : Re: [ccp4bb] Refining residues as rigid bodies


Basically my reasoning for doing this is a low data-to-parameter ratio, which 
makes B-factor refinement unfeasible. So far I have had nice results with 
breaking the complex into rigid subdomains. So i was basically just thinking 
of a way I could refine the structure best, without using too many parameters.

I see now how this can be done in Phenix. I will give it a try.

Thanks for all of your suggestions!

Re: [ccp4bb] Refining residues as rigid bodies

2009-12-02 Thread MARTYN SYMMONS

IMHO this is a bad idea as the restraints can give undue weight to poorly 
ordered residues in order to favour the tighter distribution that you are 
specifying. 

I have found bindividual.inp to perform at low resolution reasonably well, and 
I think it is more realistic than group B-factors. The overall weighting on 
these B-factor restraints is usually determined automatically by CNS I think.

The target sigmas in the distributed bindividual.inp  file are too tight to 
start with and I would always follow previous suggestions on the bb - mainchain 
bond 2.0, mainchain angle 3.0, sidechain bond 4.5, sidechain angle 6.0. You can 
check the log file afterwards to see if the refinement is actually getting 
close to the values you set. Then you can manually set the B-factor restraint 
weighting if you are not happy with the program's choice. That would be a 
better way to restrain than changing the target distributions?

I think the PHENIX restrained B-factor refinement is very well thought out - 
based on ideas from Ian Tickle's analysis. And I would think that should be 
good at low resolution too - so perhaps worth a try? You can always do just 
B-factor refinement in PHENIX after geometric refinement in CNS if you want.

regards
  Martyn 

Martyn F. Symmons
Cambridge UK
'Chan fhiosrach mur feòraich.'
Gaelic proverb - Nothing asked, nothing learned. 









From: Frederic VELLIEUX frederic.velli...@orange.fr
To: CCP4BB@JISCMAIL.AC.UK
Sent: Wednesday, 2 December, 2009 8:56:01
Subject: Re: [ccp4bb] Refining residues as rigid bodies

Dear Bulletin Board,

I received this information from Axel Bruenger, who rightly corrected me on the 
modification to be made to the bindividual.inp file:

As an fyi, the sigmas should be made smaller to get a narrower B-factor 
differences.

My previous post:

 You modify the file bindividual.inp
 
 This section:
 {* target sigma values for restrained B-factor refinement *}
 
 {* mainchain bonds *}
 {===} bsig_main=1.5;
 {* mainchain angles *}
 {===} asig_main=2.0;
 
 {* sidechain bonds *}
 {===} bsig_side=2.0;
 {* sidechain angles *}
 {===} asig_side=2.5;
 
 I cannot remember in which direction the values have to go. I think up.

So the values have to go down. The precise values I used are somewhere on a DAT 
tape and we have no DAT reader here anymore...

Fred.

Re: [ccp4bb] off topic; alignment keeping upper and lower case

2009-09-16 Thread MARTYN SYMMONS

Dear All
  - I am looking for a multiple alignment program that will work 
with upper and lower case letters in the sequences and preserve them through to 
the output. I used to use PILEUP but that is no longer supported here. 
   Any suggestions much appreciated.
   regards 
Martyn 

Martyn F. Symmons 
Department of Pathology
University of Cambridge

'Gun fhaillin 'san fhairge..'
(said of Griomasaigh boats)

[ccp4bb] unknown PyMOL script author

2009-01-14 Thread MARTYN SYMMONS

Dear CCP4ers
Do you recognize your Python code in the script below? I got this from a script 
in the PyMOL wiki last year but now I can't locate it or identify the author. I 
have used this syntax extensively in some structural analysis that I am now 
writing up and I would like to credit the originator (PyMOL and the wiki both 
get a citation obviouslybut I checked and neither Warren Delano nor Jason 
Vertrees is the author). 
 
for a in cmd.index(A//37/CB):
for b in cmd.index(B//LYS/NZ):
cmd.select(s1,%s`%d%a)
cmd.select(s2,%s`%d%b)
if cmd.select((s1|s2) and not ?skip):
if cmd.dist(tmp,s1,s2)  20.0 :
  print '',cmd.iterate(s1|s2,print 
'',segi,resn,resi,name),round(cmd.dist(tmp,s1,s2),3)

Thanks
Best wishes
  Martyn 

Martyn Symmons
Department of Pathology
University of Cambridge

Re: [ccp4bb] Poor electron density - polyAla or PolyGly?

2008-10-16 Thread MARTYN SYMMONS

Dear Joe
  one thing that has helped me a lot with similar problems is G. 
Kleywegt's SPASM - you make your best guess of the mainchain then add a total 
guess of the sidechains (say with mutate in COOT - choosing the top rotamer). 
Then you cut out the loop and run this in SPASM as a search model looking for 
similar motifs in the pdb. You specify a low cut-off for the substitution 
matrix - to get lots of hits - and use high values (3.0-4.5A) for the initial 
positional errors for the residues - this means you get maybe 100 possible 
structurally similar loops from other proteins. Then you can sort the output on 
BLOSUM similarity score or on RMSD to your initial guess. The BLOSUM search 
criterion is the reason that you add the sidechains. If you get no hits with CA 
and SC (sidechain) then you can switch to CA only.  Or try with a more 
restricted bit of your loop. Or look through the output just for loops that 
share a structurally conserved Gly or Pro with
 your sequence.

If I get any possible hits I then superimpose them  on the electron density to 
see if they give hints how the sidechains might lie and also how the backbone 
H-bonding might run. There are lots of small motifs with conserved sidechain to 
mainchain H-bonding (there is a database of these in Glasgow I think) and so 
you pick up hints from similar loops more often than you might think - from 
completely unrelated proteins. 

I have a script to run SPASM if you want it. 

  Another approach is RAPPER which builds multiple a priori loops very quickly 
and sees if they fit the density. There used to be a RAPPER server... not sure 
if it is still active.
   
Good luck. 
  Martyn 





- Original Message 
From: Joe Smith [EMAIL PROTECTED]
To: CCP4BB@JISCMAIL.AC.UK
Sent: Wednesday, 15 October, 2008 4:11:01 PM
Subject: [ccp4bb] Poor electron density - polyAla or PolyGly?

Hello,
I have been building a protein model (resolution 2.2A) which has one
small loop of 6 residues having poor density. I cannot see any side
chain in this region but I can see relatively poor main-chain density
which at least clearly indicate the loop conformation. I am trying my
best to build polyAla chain in this region. But 3-4 residues end up
coming in disallowed region of Ramachandran plot. I think the problem
is - in this region i can not even build the main chain atoms with
high confidence.
Can I build polyGly in this region? or should I just leave this
region? I just don't feel like leaving this region empty.
Any suggestions in this regards is highly appreciated!
regards
Joe

[ccp4bb] Protein-protein docking

2008-02-12 Thread MARTYN SYMMONS

I would recommend giving Dave Ritchie's Hex program a go - it is blisteringly 
quick. It's limited to local searches so it is best if you have a reasonable 
idea where you expect the docking surface to be (mutations, NMR shift data 
etc). We used it to refine solutions from searches with crosslinking 
restraints. It works nicely interactively if you explore the receptor surface 
by centring the search on different residues. 

Best wishes
Martyn 

Martyn Symmons
Department of Pathology 
University of Cambridge

Re: [ccp4bb] oxidised cys

2007-04-13 Thread Martyn Symmons

A possible modification for cysteine that adds extra density is 
S-(dimethylarsenic) cysteine (CAS). Requires DTT and cacodylate buffer 
conditions however. And does not crosslink so far as I know. 
   
Has been seen in a number of structures from cacodylate conditions - eg. one of 
the Xrcc4 structures 

cheers
  Martyn 

Martyn Symmons
Department of Pathology 
University of Cambridge





 Message Received: Apr 12 2007, 06:06 PM
 From: Flip Hoedemaeker [EMAIL PROTECTED]
 To: [EMAIL PROTECTED]
 Cc: 
 Subject: Re: [ccp4bb] oxidised cys
 
 I've actually seen something like this on disulfides (or at least I think
 so, I havent seen your density obviously), turned out it was model bias in
 MR, if I used a different template for MR the feature went away. This was
 high resolution stuff (~1.0 Å).
 
 Flip
 
 -Original Message-
 From: CCP4 bulletin board [mailto:[EMAIL PROTECTED] On Behalf Of
 [EMAIL PROTECTED]
 Sent: Tuesday, April 10, 2007 20:44
 To: [EMAIL PROTECTED]
 Subject: Re: [ccp4bb] oxidised cys
 
 Hi Stefano,
 
 How certain are you that this link is truly what you think it is? If I
 understand what you're saying - you want to create a (thioperoxythio) link
 - this chemistry should be hideously unstable. Can you explain this using
 disorder, or perhaps the residual density is a symmetry artifact?
 
 Regards,
 
 Artem
 
  Dear all
  in my structure I think I can see an oxidised Cys in cys-SO. Refining
  cys-SO
  I observe a residual density between the oxigen of one oxidised cys and
  the
  one of the other molecule in AU.
  I'd like to try to refine it as cys-SO-OS-cys. I didn't find an example of
  it in the pdb database. Could anyone tell me whether there are other
  cases?
  I guess I just didn't find them.
  Second question:
  How could I explain to refmac that there is the OO bond?
  I tried to write a line similar to the one for SSBOND in the pdb header
  OOBOND   1 CEA A   42CEA D   42
  but refmac couldn't care less...
 
  thanks in advance
 
  Stefano
 
  _
  Express yourself instantly with MSN Messenger! Download today it's FREE!
  http://messenger.msn.click-url.com/go/onm00200471ave/direct/01/

Re: [ccp4bb] oxidised cys

2007-04-13 Thread Martyn Symmons

Dear John
 I think this is a thiol-specific reaction - where it has happened to 
Cys residues the Met residues appear normal. I wondered if anyone had ever used 
this on purpose as a heavy atom derivative. Arsenic has quite a good anomalous 
signal too I think. 
  All the best
  Martyn 

Martyn Symmons
Department of Pathology 
University of Cambridge




 Message Received: Apr 13 2007, 01:37 PM
 From: John Walker [EMAIL PROTECTED]
 To: [EMAIL PROTECTED]
 Cc: [EMAIL PROTECTED]
 Subject: Re: [ccp4bb] oxidised cys
 
 Can methionine be modified with these two reagents in a similar manner?
 
 Cheers,
 
 John
 
 -- 
 John R. Walker, Ph.D.
 Structural Genomics Consortium
 University of Toronto
 Toronto, Ontario
 Canada
 
 On 4/13/07, Martyn Symmons [EMAIL PROTECTED] wrote:
  A possible modification for cysteine that adds extra density is 
  S-(dimethylarsenic) cysteine (CAS). Requires DTT and cacodylate buffer 
  conditions however. And does not crosslink so far as I know.
 
  Has been seen in a number of structures from cacodylate conditions - eg. one 
  of the Xrcc4 structures
 
  cheers
Martyn
 
  Martyn Symmons
  Department of Pathology
  University of Cambridge
 
 
 
 
  
   Message Received: Apr 12 2007, 06:06 PM
   From: Flip Hoedemaeker [EMAIL PROTECTED]
   To: [EMAIL PROTECTED]
   Cc:
   Subject: Re: [ccp4bb] oxidised cys
 
   I've actually seen something like this on disulfides (or at least I think
   so, I havent seen your density obviously), turned out it was model bias in
   MR, if I used a different template for MR the feature went away. This was
   high resolution stuff (~1.0 Å).
 
   Flip
 
   -Original Message-
   From: CCP4 bulletin board [mailto:[EMAIL PROTECTED] On Behalf Of
   [EMAIL PROTECTED]
   Sent: Tuesday, April 10, 2007 20:44
   To: [EMAIL PROTECTED]
   Subject: Re: [ccp4bb] oxidised cys
 
   Hi Stefano,
 
   How certain are you that this link is truly what you think it is? If I
   understand what you're saying - you want to create a (thioperoxythio) link
   - this chemistry should be hideously unstable. Can you explain this using
   disorder, or perhaps the residual density is a symmetry artifact?
 
   Regards,
 
   Artem
 
Dear all
in my structure I think I can see an oxidised Cys in cys-SO. Refining
cys-SO
I observe a residual density between the oxigen of one oxidised cys and
the
one of the other molecule in AU.
I'd like to try to refine it as cys-SO-OS-cys. I didn't find an example of
it in the pdb database. Could anyone tell me whether there are other
cases?
I guess I just didn't find them.
Second question:
How could I explain to refmac that there is the OO bond?
I tried to write a line similar to the one for SSBOND in the pdb header
OOBOND   1 CEA A   42CEA D   42
but refmac couldn't care less...
   
thanks in advance
   
Stefano
   
_
Express yourself instantly with MSN Messenger! Download today it's FREE!
http://messenger.msn.click-url.com/go/onm00200471ave/direct/01/

70 matches

Mail list logo