Re: [ccp4bb] [phenixbb] Calculate average B-factor?

2014-10-06 Thread Tim Gruene
Dear Jose,

the question came up again because I did not receive an answer to my
question. The thread discussed benefits and malefits of PDB vs. mmCIF,
which was not my question.
This time, Nat Echolls gave a very reasonable answer (at least for
phenix) on the phenixbb, i.e., that there are no plans to abandon the
PDB format (as working format), but very likely a smooth transition will
take place - I guess this will be more slowly than the enforcement of
the PDB to upload PDBx/mmCIF files for archiving. I agree that for
archiving mmCIF is a reasonable format, but I guess less than 1% of all
structures in the PDB hit the limits of the PDB format.

I greatly appreciate Nat's answer and I would appreciate an answer from
the responsibles for the other refinement programs.

Best,
Tim

On 10/05/2014 08:05 PM, Jose Manuel Duarte wrote:
 Thanks Frances for the explanation. Indeed mmCIF format is a lot more
 complicated and grep can be a dangerous tool to use with them. But for
 most cases it can do the job and thus it maintains some sort of
 backwards compatibility. I can't agree more that using specialised tools
 (for either PDB files or mmCIF files) that deal with the formats
 properly is the best solution (see for instance
 http://mmcif.wwpdb.org/docs/software-resources.html for some of the
 mmCIF readers).
 
 In any case I find it most surprising that this topic came yet again to
 this BB, when it was thoroughly discussed last year in this thread:
 
 https://www.jiscmail.ac.uk/cgi-bin/webadmin?A2=ind1308L=ccp4bbD=0P=26939
 
 I'm not sure why this kind of urban legends on the evilness of the mmCIF
 format keep coming back to the list...
 
 As explained there and elsewhere endless times, the PDB format is
 inadequate to represent the complexity of macromolecules and has been
 needing a replacement for a long time. The decision to move on to mmCIF
 has been made and in my opinion the sooner we move forward the better.
 
 Cheers
 
 Jose
 
 
 
 On 05.10.2014 15:52, Frances C. Bernstein wrote:
 mmCIF is a very general format with tag-value pairs, and loops
 so that tags do not need to be repeated endlessly.  It was
 designed so that there is the flexibility of defining new terms
 easily and presenting the data in any order and with any kind
 of spacing.

 I understand that there are 10+ files in cyberspace prepared
 by the PDB and that they all have the 'same' format.

 It is tempting to write software that treats these files as fixed
 format and hope that all software packages that generate coordinate
 files will use the same fixed format.  But that loses the generality
 and flexibility of mmCIF, and software written that way will fail
 when some field requires more characters or a new field is added.
 There are software tools to allow one to read and extract data from
 any mmCIF file; using these is more complicated than using grep but
 using these assures that one's software will not fail when it encounters
 a date file that is not exactly what the PDB is currently producing.

 Note that mmCIf was defined when the limitations of the fixed-format
 PDB format became apparent with large structures.  Let's not repeat
 the mistakes of the past.

 Frances

 =
 Bernstein + Sons
 *   *   Information Systems Consultants
 5 Brewster Lane, Bellport, NY 11713-2803
 *   * ***
  *Frances C. Bernstein
   *   ***  f...@bernstein-plus-sons.com
  *** *
   *   *** 1-631-286-1339FAX: 1-631-286-1999
 =

 On Sun, 5 Oct 2014, Tim Gruene wrote:

 Hi Jose,

 I see. In the example on page
 http://mmcif.wwpdb.org/dictionaries/mmcif_pdbx_v40.dic/Categories/atom_site.html,

 it is in field 12, though, and I would have thought that mmCIF allows
 line breaks.

 But as long as all developers writing PDBx/mmCIF with their programs
 follow the PDB constraints (``styling plans'' in their FAQ), everything
 is fine.

 Cheers,
 Tim

 On 10/05/2014 01:13 PM, Jose Manuel Duarte wrote:
 Well, if you simply replace that beauty by this one:

 grep ^ATOM filename.cif | awk '{print $15}' | awk '{s+=$1;} END
 {print
 s/NR;}'

 You will achieve exactly the same result (the b-factors are in the 15th
 field of the _atom_site section in deposited mmCIF files). I'm not an
 expert in awk, but I'm sure that can be made even shorter ;)

 It is important to keep in mind that mmCIF files are designed to be
 usable with grep-like tools, so I don't see any problems in moving
 forward to that format. Whilst I see a lot of problems in staying with
 the classic PDB format.

 Cheers

 Jose



 On 05.10.2014 11:18, Tim Gruene wrote:
 Hi all,

 reading this beauty I would like to ask a question to the respective
 developers:
 Will the PDB format remain the working format for the users and only
 upon deposition will it be converted to PDBml for archiving
 purposes, or
 are the 

[ccp4bb] Faculty Position in Structural Biology at Purdue

2014-10-06 Thread Mesecar, Andrew D
Dear Colleagues,

I wanted to draw your attention to the following position announcement (Below) 
advertised in Science on Friday, October 3rd.

  http://jobs.sciencecareers.org/job/343203/position-in-structural-biology/


PURDUE
UNIVERSITY

Faculty Position in
Structural Biology

 The Purdue University Department of Biological Sciences in conjunction 
with the Center for Cancer Research and the Walther Cancer Foundation are 
seeking an outstanding scientist with a proven track record of excellence in 
the science of cancer structural biology to join our faculty as a Walther 
Professor in Structural Biology. A Walther Professor is expected to conduct 
research in the area of structural biology to address fundamental questions in 
cancer biology; teach undergraduate and/or graduate students, and participate 
in ongoing programs in the Department of Biological Sciences and the Center for 
Cancer Research. Preference will be given to candidates utilizing modern 
cryo-EM approaches combined with other structural approaches such as X-ray 
crystallography to determine structures of cancer-relevant macromolecules and 
macromolecular complexes, and/or X-ray crystallography of cancer-relevant drug 
targets as part of a structure-based drug design program. Candidates should 
hold the rank of associate or full professor and have a PhD in Biological 
Sciences or related field, have an excellent track record of publications and 
extramural funding and a strong commitment to excellence in teaching.

 The Department of Biological Sciences offers a dynamic research 
environment in structural biology research and education. The Markey Center for 
Structural Biology at Purdue is recognized worldwide for its leadership in 
structural biology of viruses, membrane proteins, receptors, signaling 
proteins, enzymes and nucleic acids in addition to methods development in X-ray 
crystallography, cryo-electron microscopy and NMR. The Purdue Center for Cancer 
Research is among an elite group of NCI-designated Cancer Centers nationwide 
and one of only seven centers focused exclusively on basic and translational 
research. The Walther Professor will have laboratory space in the newly 
constructed Hockmeyer Hall of Structural Biology, which houses a Titan Krios 
cryo-TEM, X-ray generators and detectors, and crystallization and imaging 
robots. Other state-of-the-art shared resources across Purdue such as a Bruker 
Avance-III 800 MHz NMR and other advanced biophysical instrumentation are 
available through the Bindley Biosciences and Birck Nanotechnology Centers.

 Applications must be submitted electronically to 
http://hiring.science.purdue.eduhttp://hiring.science.purdue.edu/ as a PDF 
file that includes; a detailed curriculum vitae, names and addresses of three 
referees, a 2 - 3 page summary of research interests, and a one-page statement 
of teaching experience and interests. Inquiries should be directed 
tosea...@bio.purdue.edumailto:sea...@bio.purdue.edu or Structural Biology 
Search Committee, Department of Biological Sciences, Purdue University, 915 W. 
State St., West Lafayette, IN 47907-2054. Confidential review of applications 
will begin October 1, 2014 and will continue until the position is filled. 
Further information about the Department is available at 
http://www.bio.purdue.edu/.


A background check will be required for employment in this position.  Purdue 
University is an ADVANCE institution and a dual career friendly employer.

Purdue University in EEO/AA Employer. All individuals, including minorities, 
women, individuals with disabilities, and protected veterans are encouraged to 
apply.

Andrew D. Mesecar
Walther Professor of Cancer Structural Biology
 Deputy Director, Purdue University Center for Cancer Research
Departments of Biological Sciences and Chemistry
Hockmeyer Hall of Structural Biology
Room 311
240 S. Martin Jischke Drive
West Lafayette, IN 47907-1971
Ear Mail:  765-494-1924
E-Mail: amese...@purdue.edu


[ccp4bb] REMINDER: apply now for the first DLS/CCP4 data analysis workshop

2014-10-06 Thread David Waterman
Dear all,

Please be aware that the application period for the first DLS/CCP4 data
analysis workshop closes on the 31st October. Successful applicants will be
notified shortly after that date.

As a reminder, the workshop is intended for PhD students, postdocs and
early career scientists who are currently working on a project in MX, and
who are able to bring crystals or data with them. The workshop will consist
of a mixture of lectures and tutorials, plus hands-on practical sessions,
in which the students will work alongside the leading software developers
and scientists on their own data. Applicants please note that you must
provide the e-mail address of a supervisor who will write a letter in
support of your application.

More details may be found here:
http://www.ccp4.ac.uk/schools/DLS-2014/index.php

The online application form is at Diamond's Events site:
http://www.diamond.ac.uk/Home/Events/2014/Diamond-CCP4-Data-Collection-and-Analysis-workshop.html

Many thanks for your interest

-- David Waterman


Re: [ccp4bb] [phenixbb] Calculate average B-factor?

2014-10-06 Thread Dale Tronrud
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1



On 10/6/2014 4:20 AM, Tim Gruene wrote:
 Dear Jose,
 
 the question came up again because I did not receive an answer to 
 my question. The thread discussed benefits and malefits of PDB vs. 
 mmCIF, which was not my question. This time, Nat Echolls gave a 
 very reasonable answer (at least for phenix) on the phenixbb,
 i.e., that there are no plans to abandon the PDB format (as
 working format), but very likely a smooth transition will take
 place - I guess this will be more slowly than the enforcement of
 the PDB to upload PDBx/mmCIF files for archiving. I agree that for
 archiving mmCIF is a reasonable format, but I guess less than 1% of
 all structures in the PDB hit the limits of the PDB format.
 
That's odd. I've found that just about every structure I've worked
on in the last couple decades has not been able to be expressed in the
PDB format without loss of information. A primary example? Try
expressing a pair of side chains that have alternative conformation in
a PDB file. Okay, one conformation is A and the other is B. That
allows me a total of twelve pairs of side chains before I run out of
upper case letters. Most people hack their model by reusing A and
B but of course that is ambiguous about where you mean the A's are
the same and where they are different. A realistic model of the
surface of a protein cannot be expressed in the PDB format.

How many models are refined with TLS B factors? There still is no
way to describe TLS in the PDB format. Don't tell me it's stuffed in
REMARK! What kind of a file format is that?

I believe that 100% of the models that we should be building can't
be described in the PDB file format, and that has been true for a
great many years.

Dale Tronrud

 I greatly appreciate Nat's answer and I would appreciate an answer 
 from the responsibles for the other refinement programs.
 
 Best, Tim
 
 On 10/05/2014 08:05 PM, Jose Manuel Duarte wrote:
 Thanks Frances for the explanation. Indeed mmCIF format is a lot 
 more complicated and grep can be a dangerous tool to use with 
 them. But for most cases it can do the job and thus it maintains 
 some sort of backwards compatibility. I can't agree more that 
 using specialised tools (for either PDB files or mmCIF files) 
 that deal with the formats properly is the best solution (see
 for instance http://mmcif.wwpdb.org/docs/software-resources.html
 for some of the mmCIF readers).
 
 In any case I find it most surprising that this topic came yet 
 again to this BB, when it was thoroughly discussed last year in 
 this thread:
 
 https://www.jiscmail.ac.uk/cgi-bin/webadmin?A2=ind1308L=ccp4bbD=0P=26939



 
I'm not sure why this kind of urban legends on the evilness of the mmCIF
 format keep coming back to the list...
 
 As explained there and elsewhere endless times, the PDB format is
 inadequate to represent the complexity of macromolecules and has
 been needing a replacement for a long time. The decision to move
 on to mmCIF has been made and in my opinion the sooner we move
 forward the better.
 
 Cheers
 
 Jose
 
 
 
 On 05.10.2014 15:52, Frances C. Bernstein wrote:
 mmCIF is a very general format with tag-value pairs, and loops
  so that tags do not need to be repeated endlessly.  It was 
 designed so that there is the flexibility of defining new terms
 easily and presenting the data in any order and with any kind
 of spacing.
 
 I understand that there are 10+ files in cyberspace 
 prepared by the PDB and that they all have the 'same' format.
 
 It is tempting to write software that treats these files as 
 fixed format and hope that all software packages that generate 
 coordinate files will use the same fixed format.  But that 
 loses the generality and flexibility of mmCIF, and software 
 written that way will fail when some field requires more 
 characters or a new field is added. There are software tools
 to allow one to read and extract data from any mmCIF file;
 using these is more complicated than using grep but using
 these assures that one's software will not fail when it
 encounters a date file that is not exactly what the PDB is
 currently producing.
 
 Note that mmCIf was defined when the limitations of the 
 fixed-format PDB format became apparent with large structures. 
 Let's not repeat the mistakes of the past.
 
 Frances
 
 =  
 Bernstein + Sons *   *   Information Systems Consultants 
 5 Brewster Lane, Bellport, NY 11713-2803 *   * ***
  *Frances C. Bernstein *   *** 
 f...@bernstein-plus-sons.com *** * *   *** 1-631-286-1339 
 FAX: 1-631-286-1999 
 =
 
 On Sun, 5 Oct 2014, Tim Gruene wrote:
 
 Hi Jose,
 
 I see. In the example on page 
 http://mmcif.wwpdb.org/dictionaries/mmcif_pdbx_v40.dic/Categories/atom_site.html,



 
it is in field 12, though, and I would have thought that mmCIF allows
 line breaks.
 
 But as 

[ccp4bb] Chicago Symposium in Honor of the International Year of Crystallography

2014-10-06 Thread Constance Jeffery
Dear Colleagues,

On Friday October 10th, the University of Illinois at Chicago is hosting a 
symposium and poster session to celebrate the 100th year of crystallography.  
(Location: Herman Auditorium, MBRB 900 S. Ashland Ave., Chicago)

Please send a copy of the abstract for your poster to cjeff...@uic.edu by 
5pm Tuesday, Oct. 7.  Please indicate in the email if you are an undergraduate, 
graduate student, or postdoc, to be entered in the poster contest. *
Please note the posters are not limited to projects involving X-ray 
crystallography.  For example, projects involving NMR, Molecular Dynamics, 
Bioinformatics, and related methods are also welcome.   

The schedule of the symposium will be as follows, with lunch provided for those 
who register:
9:30am Registration
10:00-10:15  Welcome and IYCR Overview (C. Jeffery, University of Illinois at 
Chicago)
10:15-11:00  Prof. Phoebe Rice (University of Chicago)
11:00-11:45  Prof. Andrejz Joachimiak (Argonne National Labs)
12noon-1:00  Lunch
1:15 - 2:00Prof. Bernie Santarsiero (University of Illinois at Chicago)
 2:00 - 2:45   Prof. Heather Pinkett (Northwestern University)
2:45 - 4:30pm Poster session

For more information and to register, please visit the webpage:

http://www.uic.edu/labs/smxrd/IYCR/Welcome.html

***To help us plan for lunch and the reception, please indicate on the Doodle 
poll if you will be able to attend (check box) or not (click on Cannot make 
it button).  Here is the link:

http://doodle.com/qzuswpz2qpzy7dgm

We hope to see you all there!
-Constance Jeffery
Associate Professor
Dept. Biological Sciences
UIC




Re: [ccp4bb] Molecular Replacement model preparation

2014-10-06 Thread Phil Jeffrey
That document is fairly old and is in dire need of revision to reflect 
the modern arsenal of programs.


Nevertheless:
Putting the hinge axis along Z was a trick told to me by Steven Sheriff 
back in the days when we worked on Fab structures - which after all are 
classical examples of hinged molecules.  One would search with separate 
domain fragments - split either side of the hinge - and the 
Z-orientation trick makes it easier to spot pairs of peaks from each 
search model that are related to each other.  In the Fab world we 
searched with Fv models (VH:VL heterodimer) and CH1:CL constant region 
heterodimeric models.  Peaks related solely by hinge motion would have 
similar alpha and beta angles and potentially different gamma (Crowther 
convention Eulerian angles).  Historical note: this was back in the days 
when it was possible to remember the names of all the Fab fragments that 
were in PDB and their respective IDs.


This ploy was more important in the days before Phaser or Molrep, which 
will now gleefully try a long list of rotation function peaks for you 
quite quickly, so manually parsing the list of rotation function peaks 
is rather unnecessary.  And perhaps counter-productive.



Split your molecule apart at the hinge, giving fragment1 and fragment2. 
 Attempt to find both fragments independently.  Choose the one that 
gives the best results: TFZ score or LLG score or discrimination between 
possible space groupr or whatever you like.  Then, attempt to find the 
*other* fragment in the context of that first solution.



Phil Jeffrey
Princeton






On 10/5/14 3:34 AM, Luzuokun wrote:

Dear all,
I’m doing molecular replacement using Phaser. My protein is predicted to
have two domain with a “hinge” linking them. The model sequence identity
is 0.27. But the MR result is poor. I’ve tried other programme (Molrep,
MrBump, Balbes,,,_.) But no improvement was observed. I think that this
is due to the “open” or “closed” conformation around the hinge. I was
told that I could place the Z axis along the hinge
(http://xray0.princeton.edu/~phil/Facility/Guides/MolecularReplacement.html),
  could anyone tell me more details about how to do next?

Thanks!
Lu Zuokun


[ccp4bb] Traveling Salesman with Simulated Annealing

2014-10-06 Thread Sean Seaver
Hi Everyone,

I thought this was a good basic introduction (help those that click it 'on' or 
'off' cause they've been told to do it) / visualization of simulated annealing 
using a map as the basis.

http://toddwschneider.com/posts/traveling-salesman-with-simulated-annealing-r-and-shiny/

Take Care,

Sean Seaver, PhD

P212121
http://store.p212121.com/


Re: [ccp4bb] Molecular Replacement model preparation

2014-10-06 Thread Scott Thomas Walsh
Hi Phil,

Thank you for the input.  I would like to get CCP4ers input.  I deal with 
multiple domain
cytokine receptors in a manner very similar to antibody molecules.

Have people have more correct solutions searching for 1 domain sequentially and 
then other
domains OR searching for multiple domains all at once?  I am curious to hear 
peoples’ experiences on
this topic?

Cheers,

Scott


Scott T. R. Walsh, PhD
Assistant Professor
University of Maryland
IBBR/CBMG
3127E CARB-2
9600 Gudelsky Drive
Rockville, MD  20850  USA
phone: (240) 314-6478
fax: (240) 314-6225
email: swals...@umd.edu


On Oct 6, 2014, at 2:11 PM, Phil Jeffrey pjeff...@princeton.edu wrote:

 That document is fairly old and is in dire need of revision to reflect the 
 modern arsenal of programs.
 
 Nevertheless:
 Putting the hinge axis along Z was a trick told to me by Steven Sheriff back 
 in the days when we worked on Fab structures - which after all are classical 
 examples of hinged molecules.  One would search with separate domain 
 fragments - split either side of the hinge - and the Z-orientation trick 
 makes it easier to spot pairs of peaks from each search model that are 
 related to each other.  In the Fab world we searched with Fv models (VH:VL 
 heterodimer) and CH1:CL constant region heterodimeric models.  Peaks related 
 solely by hinge motion would have similar alpha and beta angles and 
 potentially different gamma (Crowther convention Eulerian angles).  
 Historical note: this was back in the days when it was possible to remember 
 the names of all the Fab fragments that were in PDB and their respective IDs.
 
 This ploy was more important in the days before Phaser or Molrep, which will 
 now gleefully try a long list of rotation function peaks for you quite 
 quickly, so manually parsing the list of rotation function peaks is rather 
 unnecessary.  And perhaps counter-productive.
 
 
 Split your molecule apart at the hinge, giving fragment1 and fragment2.  
 Attempt to find both fragments independently.  Choose the one that gives the 
 best results: TFZ score or LLG score or discrimination between possible space 
 groupr or whatever you like.  Then, attempt to find the *other* fragment in 
 the context of that first solution.
 
 
 Phil Jeffrey
 Princeton
 
 
 
 
 
 
 On 10/5/14 3:34 AM, Luzuokun wrote:
 Dear all,
 I’m doing molecular replacement using Phaser. My protein is predicted to
 have two domain with a “hinge” linking them. The model sequence identity
 is 0.27. But the MR result is poor. I’ve tried other programme (Molrep,
 MrBump, Balbes,,,_.) But no improvement was observed. I think that this
 is due to the “open” or “closed” conformation around the hinge. I was
 told that I could place the Z axis along the hinge
 (http://xray0.princeton.edu/~phil/Facility/Guides/MolecularReplacement.html),
  could anyone tell me more details about how to do next?
 
 Thanks!
 Lu Zuokun


[ccp4bb] How to rename nucleic acid residues in PDB files

2014-10-06 Thread Sasha Pausch
Hello All,

I would like to know how to change the nomenclature of bases (for example
from Ad to DA for adenosine triphosphate) in PDB file.


Re: [ccp4bb] How to rename nucleic acid residues in PDB files

2014-10-06 Thread William G. Scott
 On Oct 6, 2014, at 12:04 PM, Sasha Pausch sashapau...@gmail.com wrote:
 
 Hello All,
 
 I would like to know how to change the nomenclature of bases (for example 
 from Ad to DA for adenosine triphosphate) in PDB file.

perl -pi -e ‘s|Ad|DA|g’  your.pdb