from:"James Stroud"

Re: [ccp4bb] Equation Editor woes with Office 2011 for Mac

2015-05-20 Thread James Stroud

I didn’t see the following solution in any other responses. It’s probably the 
most reasonable one given the constraints of collaboration and publishing.

In the absence of using the best software, I found it practical to write the 
equations in MathType and save them as MathType PDF equations and then add 
these equations to the document. It is a portable, cross-platform-ish solution. 
Others only need to install a MathType player, which is free. The advantage is 
that if your equation gets hosed in the document, you still have the original, 
editable equation in the PDF. In such cases, you must re-embed it in your 
document, but it’s better than fully rewriting it.

With that said, if you want to work behind a full-featured word processor and 
have access to the wonders of TeX typesetting, LibreOffice (OpenOffice) + 
TexMaths is the best for the author during preparation of a manuscript. At this 
point it is bug free (to my experience), embeds vector equations (SVG) or 
raster (PNG), is editable, and looks spectacular both when editing and when 
publishing/printing.

The downside is that you have to collaborate with people you can’t force into 
using the best software. Worse, journals seem to use proprietary publishing 
software and they want MathType or equation editor with Microsoft word, hence 
my first solution.

James




On May 18, 2015, at 5:10 AM, Keller, Jacob kell...@janelia.hhmi.org wrote:

 There is the possibility of using one of the open-source versions, like 
 openOffice, but those I guess also have their issues.
 
 JPK
 
 -Original Message-
 From: CCP4 bulletin board [mailto:CCP4BB@JISCMAIL.AC.UK] On Behalf Of Randy 
 Read
 Sent: Monday, May 18, 2015 4:11 AM
 To: CCP4BB@JISCMAIL.AC.UK
 Subject: [ccp4bb] Equation Editor woes with Office 2011 for Mac
 
 Rather off-topic, but maybe someone on the list has found a way to work 
 around this!
 
 There's a problem with the Equation Editor in Office 2011 for Mac (i.e. the 
 one that is based on a stripped-down version of MathType, which you get with 
 Insert-Object-Microsoft Equation).  You can insert an equation, re-open it 
 and edit it several times, and then suddenly (and seemingly randomly) the 
 equation object will be replaced by a picture showing the equation, which can 
 no longer be edited.  I'm writing a rather equation-heavy paper at the 
 moment, and this is driving me crazy.
 
 This seems to be a known bug, which has existed from the release of Office 
 2011.  Apparently it happens, unpredictably, when an AutoSave copy of the 
 document is saved, so you can avoid it by turning off the AutoSave feature.  
 The last time this drove me crazy, several years ago, I did try turning off 
 AutoSave.  For a while, I was very good about manually saving frequently, but 
 I got into bad habits and eventually Word crashed after I had worked for 
 several hours on a grant proposal without manually saving.  So I turned 
 AutoSave back on.
 
 At the moment, the least-bad solution seems to be to turn off AutoSave while 
 I'm working on a document with lots of equations and then (hopefully) 
 remember to turn it back on after that document is finished.  But it would be 
 great if someone has come up with a better cure for this problem.
 
 No doubt someone will suggest switching from Word to LaTeX, but I need to be 
 able to collaborate on paper-writing, and even though I might be willing to 
 invest the effort in learning LaTeX, I can't really expect that of my 
 collaborators.  Most people in our field do use Microsoft Word, regardless of 
 its failings.  I've also tried using the professional version of MathType, 
 but that requires your collaborators to install it as well - and I don't 
 think that cured the equation to picture problem anyway.
 
 Thanks!
 
 -
 Randy J. Read
 Department of Haematology, University of Cambridge
 Cambridge Institute for Medical ResearchTel: +44 1223 336500
 Wellcome Trust/MRC Building Fax: +44 1223 336827
 Hills RoadE-mail: 
 rj...@cam.ac.uk
 Cambridge CB2 0XY, U.K.   
 www-structmed.cimr.cam.ac.uk

[ccp4bb] [RANT] Reject Papers describing non-open source software

2015-05-12 Thread James Stroud

I hereby call on the broadest community of academics and researchers, including 
scientists, historians, economists, sociologists, psychologists, and whoever 
else has ever published a paper or read from the literature thereof, to reject 
any and all papers that describe new software that itself is not released under 
an open source model.

I further declare that this post is designed to ruffle feathers and incite 
incendiary conversation, to provoke all-caps and evoke multiple exclamation 
marks with interposed “1”s where anger prevents one from properly holding the 
shift key.

My rationale for this post: I have just spent a week installing software for 
structural biology (not crystallography) only to find that some of the key 
utilities needed were described in a recent publication but were not OSS. The 
authors have decided to stop supporting the software but have not retracted 
their paper, which is completely irrelevant without the availability of the 
software package they describe.

Let’s hammer this one out and come to the rational conclusion that non-OSS 
software should not be awarded publications.

James

Re: [ccp4bb] [RANT] Reject Papers describing non-open source software

2015-05-12 Thread James Stroud


If the software were open source, then funding would not matter. It could be 
supported (or at least used) by the user base. Withdrawing the use of published 
software because of funding issues represents a problem that can be remedied by 
enforcing (at the referee level) open source licenses.

We allow (or rather can’t prevent) the use of other published information long 
after the funding that produced said knowledge dries up. The situation should 
be no different with software.

James



On May 12, 2015, at 1:08 PM, Gloria Borgstahl gborgst...@gmail.com wrote:

 Did they stop supporting it due to lack of renewed funding and having to cut 
 staff that had the knowledge?
 I'm pretty sure you only know part of the story.
 
 
 On Tue, May 12, 2015 at 11:48 AM, James Stroud xtald...@gmail.com wrote:
 I hereby call on the broadest community of academics and researchers, 
 including scientists, historians, economists, sociologists, psychologists, 
 and whoever else has ever published a paper or read from the literature 
 thereof, to reject any and all papers that describe new software that itself 
 is not released under an open source model.
 
 I further declare that this post is designed to ruffle feathers and incite 
 incendiary conversation, to provoke all-caps and evoke multiple exclamation 
 marks with interposed “1”s where anger prevents one from properly holding the 
 shift key.
 
 My rationale for this post: I have just spent a week installing software for 
 structural biology (not crystallography) only to find that some of the key 
 utilities needed were described in a recent publication but were not OSS. The 
 authors have decided to stop supporting the software but have not retracted 
 their paper, which is completely irrelevant without the availability of the 
 software package they describe.
 
 Let’s hammer this one out and come to the rational conclusion that non-OSS 
 software should not be awarded publications.
 
 James

Re: [ccp4bb] [RANT] Reject Papers describing non-open source software

2015-05-12 Thread James Stroud

On May 12, 2015, at 12:29 PM, Roger Rowlett rrowl...@colgate.edu wrote:

 Was the research publicly funded? If you receive funds from NSF, for example, 
 you are expected to share and make widely available and usable software and 
 inventions created under a grant (section VI.D.4. of the Award and 
 administration guide). I don't know how enforceable that clause is, however.

The funding shouldn’t matter. I suggest that a publication that has the purpose 
of describing non-open source software should be summarily rejected by 
referees. In other words, the power is in our hands, not the NSF’s.

Re: [ccp4bb] [RANT] Reject Papers describing non-open source software

2015-05-12 Thread James Stroud


Making the software OSS solves many problems. For example, it solves the 
problem of interdependency. Many tools these days aggregate the functions of 
several different software packages into pipelines. If one step in that 
pipeline is unavailable then the pipeline can be rendered unusable.

Most OSS licenses are only restrictive in the sense of trying to supersede the 
license in a derivative product. They do not restrict the authors unless the 
institution for which they work does not permit OSS licenses, which is usually 
done in the name of monetizing a product. In that case it doesn’t matter 
whether the product is published or not, except that the publication is free 
advertisement. If the software eventually becomes unavailable, then the 
publication is false advertisement and should be retracted. If it is monetized, 
then pay the researches with the profit, which can buy more things than simply 
having one's name on an article.

In terms of re-implementing tools. That is a colossal waste of time. First the 
idea that OSS software would be “too expensive” is not logical. Second, if OSS 
software does not do what you wish, you modify it, not re-implement it.

I don’t want to this to degenerate into a discussion of the merits of OSS. I 
would rather focus on the merits of not publishing articles whose purpose it is 
to describe closed source software. If software is described in the context of 
algorithms, then the description of the software should be removed from the 
paper. The algorithms should stand on their own.

James


On May 12, 2015, at 1:19 PM, Robbie Joosten robbie_joos...@hotmail.com wrote:

 I strongly disagree with rejecting paper for any other reasons than
 scientific ones. A paper describing software should properly describe the
 algorithms to ensure the reproducibility. The source should be available for
 inspection to ensure the program does what was claimed, for all I care this
 can be under the Ms-RSL license or just under good-old copyright. The
 program should preferably be available free for academic users, but if the
 paper is good you should be able to re-implement the tool if it is too
 expensive or doesn't exactly do what you want so it isn't entirely
 necessary. 
 Making the software open source (in an OSS sense) does not solve any
 problems that a good description of the algorithms doesn't do well already.
 OSS does not guarantee long-term availability, a paper will like outlive the
 software repository. OSS licenses (not the BSD license) can be so
 restrictive that you end up having to re-implement the algorithms anyway. So
 not having an OSS license should not be a reason to reject the paper about
 the software.
 
 Cheers,
 Robbie

Re: [ccp4bb] [RANT] Reject Papers describing non-open source software

2015-05-12 Thread James Stroud

On May 12, 2015, at 3:08 PM, Douglas Theobald dtheob...@brandeis.edu wrote:
 On May 12, 2015, at 4:50 PM, tom.p...@csiro.au tom.p...@csiro.au wrote:
 I like the open source model, but there are problems with this model along 
 with the others. One major problem is that the authors are expected to 
 support their software after others have gone in and modified it, which is 
 unrealistic.  
 
 I’ve never heard of such a thing.  Even if it does exist, there are plenty of 
 open source models that don’t require that.  

When I fork “wiki-source” software, I usually just put it into a repository 
that others can’t modify, unless the license restricts that. /snark

Re: [ccp4bb] Loop clashing on symmetry axis

2014-06-10 Thread James Stroud

You may want to drop the symmetry to see what is happening with the loop.

James

On Jun 10, 2014, at 11:48 AM, Nicholas Keep n.k...@mail.cryst.bbk.ac.uk wrote:

 Am refining a structure in P3(1)21 with three copies in the ASU. It is pretty 
 close to completion R/Rfree 18/21 with 2A data.
 However in one copy a section that is clear in the other two is poor as it 
 meets itself on a two fold.
 
 My interpretation is that one copy of the loop is in a visible conformation 
 that is on the two fold axis and the symmetry related copy is disordered so 
 it does not clash. Presumably close to half the time the ordered copy is from 
 one side and the rest from the other to give the symmetric density.
 
 
 If you push the structure into the density then the symmetry related copy 
 clashes.
 
 Is this conformation refineable in refmac5 or phenix if the occupancy is set 
 to zero?
 
 Should I allow clashes to push both copies either side of the density as is 
 currently happening?
 
 Any tips on dealing with this?
 
 Best wishes
 Nick
 
 -- 
 Prof Nicholas H. Keep
 Executive Dean of School of Science
 Professor of Biomolecular Science
 Crystallography, Institute for Structural and Molecular Biology,
 Department of Biological Sciences
 Birkbeck,  University of London,
 Malet Street,
 Bloomsbury
 LONDON
 WC1E 7HX
 
 email n.k...@mail.cryst.bbk.ac.uk
 Telephone 020-7631-6852  (Room G54a Office)
  020-7631-6800  (Department Office)
 Fax   020-7631-6803
 If you want to access me in person you have to come to the crystallography 
 entrance
 and ring me or the department office from the internal phone by the door

Re: [ccp4bb] mmCIF as working format?

2013-08-07 Thread James Stroud

On Aug 7, 2013, at 1:06 PM, Ed Pozharski wrote:
 On 08/07/2013 01:51 PM, James Stroud wrote:
 In the long term, the MM structure community should perhaps get its 
 inspiration from SQL
 For this to work, a particular interface must monopolize access to structural 
 data.

Not necessarily, although the alternative pathway might be more idealistic and 
hence unrealistic.

All that needs to happen is that the community agree on

1. What is the finite set of essential/useful attributes of macromolecular 
structural data.
2. What is the syntax of (a) accessing and (b) modifying those attributes.
3. What is the syntax of selecting subsets of structural data based on those 
attributes.

The resulting syntax (i.e. language) itself should be terse, easy to learn, 
easy to use, and preferably easy to implement.

If such a standard is created, then I believe awk-ing/grep-ing/sed-ing/etc PDBs 
and mmCIFs would quickly become historical.

James

Re: [ccp4bb] mmCIF as working format?

2013-08-07 Thread James Stroud

On Aug 7, 2013, at 2:35 PM, Ed Pozharski wrote:
 If I understand your proposal and reference to SQL correctly, you want some 
 scripting language that sounds like simple English.

I didn't say anything about being English-like. English and other natural 
languages are ill-adapted to describing the well-defined operations one might 
perform on a data structure.

 Is the advantage over existing APIs here that one does not need to learn 
 Python, C++, (or, heaven forbid, FORTRAN)?

Anyone can learn Python in an hour and a half. That's not an issue (except for 
whitespace nuts). If one wants to use Python to modify PDB structural data, I 
recommend starting with the tutorial I wrote for CCTBX: 
http://cctbxwiki.bravais.net/CCTBX_Wiki#Working_with_pdb_Files

The advantage of a language over an API is that an API requires coding overhead 
and must (by the definition of API) be part of an Application. SQL has no 
such requirement and neither would an ideal language for *selecting* and 
*modifying* macromolecular structural data. In SQL, one can make selections and 
modifications without importing libraries, defining a main function, declaring 
variables, etc. Low overhead is probably the reason so many crystallographers 
(myself not included) are fluent in the likes of awk.

 I.e. programs would look like this
 
 ---
 GRAB protein FROM FILE best_model_ever.cif;
 SELECT CHAIN A FROM protein AS chA;
 SET chA BFACTORS TO 30.0;
 GRAB data FROM FILE best_data_ever.cif;
 BIND protein TO data;
 REFINE protein USING BUSTER WITH TLS+ANISO;
 DROP protein INTO FILE better_model_yet.cif;
 ---
 
 Not necessarily a bad idea but now through the fog of time I remember 
 something oddly reminiscent... ah, CNS! (for those googling for it it's not 
 the central nervous system :).

Although a little too much like natural language, it is not a bad idea. But, 
where is the link describing the layer of CNS that looks like that? In my 
X-Plor 3.1 manual (Yale University Press, 1987) I see nothing remotely like 
what you describe. CNS, according to the most recent tutorial for 1.3, looks 
like this:

topology
evaluate ($counter=1)
evaluate ($done=false)
while ( $done = false ) loop read
   if ( exist_topology_infile_$counter = true ) then
  if ( BLANK%topology_infile_$counter = false ) then
 @@topology_infile_$counter
  end if
else
   evaluate ($done=true)
end if
evaluate ($counter=$counter+1)
end loop read
end

This example makes a point about the problems of APIs. Namely, they require 
loops and tests, and lack a true selection mechanism, except perhaps for the 
scripting layer of CNS. But even with CNS, once you have a selection, you must 
loop over it to modify the data.

Although it is likely the best library for working with structural data, 
CCTBX requires a loop just to change a specific chain ID (to the best of my 
knowledge):

pdb_inp = pdb.input(file_name=best-model.pdb)
hierarchy = pdb_inp.construct_hierarchy()
for model in hierarchy.models():
  for chain in model.chains():
if chain.id == A:
  chain.id = B

I don't intend to pick on CCTBX specifically (because the CCTBX developers have 
specific needs to which they program), but loop/test mechanisms are awkward for 
selecting and modifying structural data, and get much more awkward as 
selections get more complex (e.g. selecting the C-alpha of every alanine of 
chain A, etc.).

James

Re: [ccp4bb] Extracting .pdb info with python

2013-06-06 Thread James Stroud


On Jun 5, 2013, at 10:41 PM, Nat Echols wrote:

 On Thu, Jun 6, 2013 at 2:37 PM, GRANT MILLS gdmi...@students.latrobe.edu.au 
 wrote:
 My script seems to miscount the columns and read the two as one column, does 
 anyone know how to avoid this? (PS, I've googled this like crazy but I either 
 don't understand or the link is irrelevant)
 
 You should resist the temptation to write your own PDB parser; that way lies 
 pain and suffering.  There are multiple free libraries for Python that can be 
 used for this task - I recommend either CCTBX or BioPython (probably the 
 latter if you don't need to do very much with the models).
 
 -Nat


Resist the urge to duplicate the work of others should be the first rule of 
programming if its not already (although it's the hardest rule to follow).

I wrote a PDB tutorial for cctbx, which has probably the most powerful python 
PDB parsing library:

http://cctbxwiki.bravais.net/CCTBX_Wiki#CCTBX_Basics


James

Re: [ccp4bb] how to make the crystal thicker

2013-05-28 Thread James Stroud

I think there are some mitegen loops that are flat. They will probably be 
useful for your crystals given what you say about the 2.2 Å anisotropic 
diffraction, which is probably caused by deformation of your plates in 
conventional loops.

Some others on the list may have a more specific recommendation about this 
brand of loops. Also, you may want to check the mitegen website for specific 
products and applications.

James


On May 28, 2013, at 7:18 PM, 姜艳 wrote:

 Dear professors,
 I get my crystal in 0.1M Tris, PH7.5, 200mM (NH4)2SO4, and 20% PEG3350, 
 however, it is very thin. From one side, the diffraction is perfect, about 
 2.2A, but from the other side, diffraction is too bad, the spots look like a 
 thread! Process cannot be done by HKL2000.
 As top guns of this field, could you  give me some suggestion to make the 
 crystal thicker? I will be grateful for your kind help.
 Best,
 Jiang Yan
 
 Institute of Biophysics, Chinese Academy of Sciences
 Beijing, Chaoyang District

Re: [ccp4bb] DNA structures superimpose

2013-04-12 Thread James Stroud

The concept of a domain in a DNA structure is a bit obscure. Could you be 
more explicit about what you are trying to do? Do you mean protein domains in a 
protein-DNA complex?

James



On Apr 12, 2013, at 12:18 AM, Veerendra Kumar (Dr) wrote:

 Dear CCP4 members,
 Is there any program to superimpose the DNA structures? I also want to 
 measure the relative domain rotation angle. I tried using DynDom but it does 
 not work for me. 
 Can someone suggest a program which can output the rotation angles? 
 
 Thank you 
 
 Best Regards
 
 Veerendra kumar
 
 CONFIDENTIALITY:This email is intended solely for the person(s) named and may 
 be confidential and/or privileged.If you are not the intended 
 recipient,please delete it,notify us and do not copy,use,or disclose its 
 content.
 
 Towards A Sustainable Earth:Print Only When Necessary.Thank you.

Re: [ccp4bb] CCP4 Update victim of own success

2013-04-12 Thread James Stroud

On Apr 12, 2013, at 12:44 PM, eugene.krissi...@stfc.ac.uk wrote:

 What, I'm afraid, people rarely realise these days, is that their desktops 
 are, essentially, GUIs to various OS features, so they obviously use GUI more 
 frequently than they think :) After all, this is all matter of habits and 
 training, and the reality is that people get more and more GUI-oriented these 
 days, like it or not. Whether to fight the reality or try to use it for 
 benefit is, certainly, every developer's own choice. I still remember payroll 
 officers saying that hand calculators (and even their predecessors) were much 
 more convenient and robust than modern software, but do not hear this for 
 some 15 years already ...
 
 Eugene


There is nothing intrinsically wrong with a GUI. The problem, for those who 
like to take advantage of automation, is that a GUI is a bottleneck if it can't 
be automated in some way. In my opinion, the best automation is based on an API 
(in my favorite scripting language, of course).

The original unix tools were built with automation in mind, using pipes, 
redirects, and file I/O to control the flow of information. This automation 
wasn't designed because the architects couldn't conceive of a GUI, but because 
it was and is efficient.

A utility can easily lock a user into a GUI. The usual cruel method is by 
spawning a dialog box the forces the user to acknowledge whatever triviality 
the program is trying to communicate (instead of sending the information to a 
log file where it belongs). If a program locks the user into a GUI, by whatever 
means, then the program can't be automated. And if it can't be automated it 
becomes a source of inefficiency. I think this is James Holton's point.

Most of CCP4 can be automated by using the utilities from the command line, 
taking advantage of pipes, redirects, and file I/O as envisioned by the unix 
architects. CCP4 can also be run via a GUI, which I take advantage of in 
certain situations. It's a great model for user interaction. Long may it live 
(and may modal dialog boxes die a horrible and quick death).

James



 
 On 12 Apr 2013, at 19:09, James Holton wrote:
 
 
 I agree with Nat.  There are good GUIs and bad GUIs, just like there are good 
 command-line programs and bad command-line programs.  Bad programs are easy 
 to write and good ones are hard.  Conservation of work I think.
 
 -James Holton
 MAD Scientist
 
 On 4/12/2013 10:38 AM, Nat Echols wrote:
 On Fri, Apr 12, 2013 at 10:27 AM, James Holton 
 jmhol...@lbl.govmailto:jmhol...@lbl.gov wrote:
 But, when it comes to GUIs, I have always found them counterproductive.  In 
 my humble opinion, the purpose of computers and other machines is to DO work 
 for me, not create work for me, and I already have enough buttons to push 
 each day.
 
 This is a very defensible position with regards to your normal workflow (or 
 mine) - but beamline scientists (or software developers) are not very 
 representative of crystallographers as a group.  I've seen a lot of reflexive 
 anti-GUI mentality from users who don't fall into either category, presumably 
 because a senior postdoc or PI told them real crystallographers use the 
 command line, when in reality they'd be better served by figuring out on 
 their own what workflow is most efficient for them.
 
 -Nat
 
 
 
 -- 
 Scanned by iCritical.

Re: [ccp4bb] CCP4 Update victim of own success

2013-04-11 Thread James Stroud


On Apr 11, 2013, at 6:17 AM, eugene.krissi...@stfc.ac.uk 
eugene.krissi...@stfc.ac.uk wrote:

 Sorry that this was unclear. We assume that updater is used primarily from 
 ccp4i, where nothing changed (and why it should be used from command line at 
 all ?:)).

Scriptability.


On Apr 11, 2013, at 10:34 AM, Jim Pflugrath wrote:
 I think James gets to 'fight' like in the old game of rogue by pressing the 
 h, j, k, l keys on his keyboard (not a detachable one either).  While Eugene 
 gets to use a modern game controller or a Wii.
 
 Ooops, game is already over and James [Holton] has lost.


But he wrote the game.

James

[ccp4bb] CCP4 Update victim of own success

2013-04-10 Thread James Stroud

Hello All,

I downloaded a crispy new version of CCP4 and ran update until the update 
update script disappeared. Is the reason that CCP4 has reached its final update?

James

Re: [ccp4bb] CCP4 Update victim of own success

2013-04-10 Thread James Stroud

On Apr 10, 2013, at 9:30 PM, eugene.krissi...@stfc.ac.uk eugene.krissi...@stfc.ac.uk wrote:No, it got renamed to ccp4um :) That should have been written in update descriptions, was it not?There was only one mention of "ccp4um" that I could find in all update descriptions that I found (6.3.0-020). I only figured out what information was trying to be communicated because of your message (see attachment).JamesOn 11 Apr 2013, at 03:54, James Stroud wrote:Hello All,I downloaded a crispy new version of CCP4 and ran update until the update update script disappeared. Is the reason that CCP4 has reached its final update?James

Re: [ccp4bb] SHELX-2013 release and homepage

2013-02-27 Thread James Stroud

The underlying issue is that the implementation of OS X dynamic linking is 
dependent system architecture. As I understand it, OS X 10.5.8 has a 32 bit 
kernel, so its dynamic linking is not forward-compatible with 64 bit OS X 
versions, for which SHELX-2013 seems to built.

Also, to be fair, it's not as if Linux users have ever been able to migrate 
from from 32 bit to 64 bit without a complete system overhaul.

So while I believe that OS X may not be the optimal OS of the future, the 32/64 
bit incompatibility should not be blamed as the reason.

James


p.s. What I don't understand is why the dynamic loader was ever needed in the 
first place. As a zero dependency program, wouldn't it make sense just to 
statically compile all of the SHEL* programs at the cost of a little bigger 
downloads?



On Feb 27, 2013, at 3:10 AM, Tim Gruene wrote:

 -BEGIN PGP SIGNED MESSAGE-
 Hash: SHA1
 
 phew - and people claim, macs were user-friendly. On Linux you just
 add '-static' (at least for command line tools as the shelx programs)
 and make the sane assumption that the kernel is less than say 10 years
 old...
 
 Tim
 
 On 02/26/2013 04:50 PM, Guangyu Zhu wrote:
 I just downloaded it. I'm using Mac 10.5.8 and getting error
 messages:
 
 dyld: unknown required load command 0x8022 Trace/BPT trap
 
 Google search shows that:
 
 
 You should contact the developer of this application. Only the
 developer can fix this. The application was incorrectly built on a
 OS X 10.6 machine for a OS X 10.5 machine. The developer can fix
 this by considering three things:
 
 
 1. Using the correct compiler parameters: gcc-4.2
 -mmacosx-version-min=10.5 -isysroot /Developer/SDKs/MacOSX10.5.sdk 
 ...
 
 
 2. Using the correct linker settings (setting environment variable
 before link command). This is required, so that the OS X 10.6
 linker will not use the loader command 'LC_DYLD_INFO_ONLY'
 (=0x8022), because OS X 10.5 does not understand this command:
 
 export MACOSX_DEPLOYMENT_TARGET=10.5 (or   setenv
 MACOSX_DEPLOYMENT_TARGET=10.5)
 
 
 After this is fixed, one can check if the application was correctly
 built for OS X 10.5 by running 'otool':
 
 otool -l binary
 
 The correct binary should not contain any  'LC_DYLD_INFO_ONLY'
 load commands (only 'LC_DYLD_INFO' commands).
 
 
 (also see my blog article http://grauonline.de/wordpress/?p=71 )
 
 
 
 Is it possible to build one for Mac 10.5.8?
 
 Thanks!
 
 Guangyu Zhu
 
 On 2/26/13 3:32 AM, George Sheldrick
 gshe...@shelx.uni-ac.gwdg.de wrote:
 
 As some of you have already discovered, there is a major new
 release of the whole of SHELX (the first since 1997) complete
 with a new homepage that should make downloads easier. To obtain
 the programs, please point your browser to:
 
 http://shelx.uni-ac.gwdg.de/SHELX/
 
 and then 'register' (top of blue menu, upper left). You will
 then receive the password immediately by email and can then go to
 'downloads'. The procedure has been designed to make life
 difficult for spammers etc. A little program (in FORTRAN of
 course) turns the registrations into a sorted users' list, which
 I hand-edit where necessary before it appears on the homepage,
 this may take a few days. The homepage also provides access to
 extensive documentation, FAQs etc. Please let me know of any
 problems (even typos) with the homepage and programs, with so
 much new material there are sure to be some bugs. The new
 versions replace all previous versions including beta-tests and
 almost all the programs have been improved since their last
 beta-test version (see 'recent changes').
 
 In particular, the new shelxe_2013/2 corrects a couple of serious
 bugs in the autotracing from MR models and MRSAD present in the
 beta-test 2013/1.
 
 George
 
 
 -- Prof. George M. Sheldrick FRS Dept. Structural Chemistry, 
 University of Goettingen, Tammannstr. 4, D37077 Goettingen,
 Germany Tel. +49-551-39-3021 or -3068 Fax. +49-551-39-22582
 
 
 - -- 
 - --
 Dr Tim Gruene
 Institut fuer anorganische Chemie
 Tammannstr. 4
 D-37077 Goettingen
 
 GPG Key ID = A46BEE1A
 
 -BEGIN PGP SIGNATURE-
 Version: GnuPG v1.4.12 (GNU/Linux)
 Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/
 
 iD8DBQFRLduMUxlJ7aRr7hoRAkNUAJ99nANlYCXYmlOqTpvKnDJyg+A1bwCdF7GA
 Ag8n9o/IzBAkuX+HBfH8cMw=
 =WBV3
 -END PGP SIGNATURE-

Re: [ccp4bb] off topic - legacy hardware help needed

2013-01-24 Thread James Stroud

It seems like the problem is that the Indigo has an IP address that is not part 
of the subnet that it is on, so this makes network access impossible. Is this 
correct?

If so, you can just spoof the proper subnet in isolation. You can run dd-wrt on 
a compatible router and assign any subnet you want (ensuring to disconnect it 
from the WAN, lest a disgruntled IT guy shows up at your office). Then, just 
access the SGI via IP as you normally would from another machine on the 
isolated subnet and edit the config files to change the IP address. This 
doesn't require tracking down any legacy hardware.

James



On Jan 23, 2013, at 5:07 PM, Dave Roberts wrote:

 Hi all,
 
 By the way, thanks for all the suggestions on the linux versions.  I went 
 against my better judgement and just stuck with Fedora, mainly because I'm 
 familiar with it.  I have to admit, I kind of like it. I was able to get it 
 up and running, run nfs to mount local drives, and install all the necessary 
 crystallography software with no hitch - quick.  It's kind of nice.  And it 
 set up my wireless printer automatically - so all is great.
 
 Anyway, we have an old Indigo SGI that runs our NMR.  It's a console only 
 system, and we access it via the network from another old SGI (toaster model 
 - blue).  The console does not have a video card (nor space for one), so I 
 can't plug in to it and see what's happening.
 
 Anyway, our network was recently updated, and in doing so it has made access 
 to our console system unavailable.  We can't get there because the IP's that 
 used to be needed are no more.
 
 So, I can get the disk out, and I have a variety of unix/linux systems that I 
 could plug it in to.  But, alas, I have no motherboards or systems that take 
 SCSI (that I have a sled for or a way to put it in). I need to be able to 
 mount the drive on some sort of system, edit a few config files to fix the 
 network, then plug it all back in.  All without messing up boot tables and 
 such (not a big deal, just thought I'd throw that out there).
 
 Is there a cable that simply allows me to plug in the back of a SCSI drive 
 then connect to an IDE port on a newer motherboard (or better yet, an 
 external USB port)?  Just curious - that would be worth it to me.
 
 Any thoughts?
 
 Thanks
 
 Dave

Re: [ccp4bb] Mac mini advice

2013-01-23 Thread James Stroud

On Jan 22, 2013, at 11:20 PM, Nat Echols wrote:
 The real difficulty is integrating Macs into a
 Linux-centric environment, for example configuring NFS, NIS, etc.

That's because NFS and NIS are antiquities left over from the days of 
mainframes. Distributed file systems and user information databases are 
designed for an environment of many workers and few machines, when the typical 
graphics workstation cost $50,000. These days, we argue whether to spend an 
extra $200 on a $500 computer. We have moved to a new paradigm: many workers 
with many more machines, with each machine having essentially mainframe levels 
of storage and computing power. In other words, instead of NFS, you should run 
git.

James

Re: [ccp4bb] Mac mini advice

2013-01-22 Thread James Stroud

Get a quad-core. If you have iTunes going, some website running javascript 
without your knowing it, and you have a computational job running, then you've 
used up your dual core and things get sluggish. It happens to me all the time 
on my c. 1996 iMac, which is still (barely) good enough for me.

On Mac v. Linux where calculations come secondary to office-type calculations, 
you have to weigh your level of vendor lock-in. Do you run Libreoffice or 
Microsoft Office? Inkscape or Illustrator? Gimp or Photoshop? Etc. If you are 
locked-in to commercial products and haven't migrated to open source, then you 
may want to think twice about a Linux box. Macs are very seamless for an office 
environment, but I don't know if they are appropriate for heavy-duty 
calculations given that you'll trade horsepower for the Mac experience.

James



On Jan 22, 2013, at 10:59 AM, Cara Vaughan wrote:

 Dear CCP4BB
 
 I'm thinking about buying a Mac Mini and was looking for advice from people 
 who have used these for crystallography.
 
 We don't need the computer to do serious number-crunching as we have back-end 
 servers that can do this for us, so it is primarily for running coot for 
 model building, etc. and low intensity crystallography jobs.
 
 I've seen from the archive that some people do use the Mac Mini for 
 crystallography and I've got two questions:
 1. Do I need the Quad core or is a Dual core processor enough?
 2. Is the intergrated Intel HD graphics card OK for crystallography 
 requirements?
 
 All the best,
 Cara.
 
 
 Cara Vaughan
 Lecturer in Structural Biology
 Institute of Structural and Molecular Biology
 Birkbeck College and UCL
 London UK
 
 
 This message was sent using IMP, the Internet Messaging Program.

Re: [ccp4bb] Mac mini advice

2013-01-22 Thread James Stroud

I meant c.2006 iMac, of course.

James

On Jan 22, 2013, at 11:05 PM, James Stroud wrote:

 Get a quad-core. If you have iTunes going, some website running javascript 
 without your knowing it, and you have a computational job running, then 
 you've used up your dual core and things get sluggish. It happens to me all 
 the time on my c. 1996 iMac, which is still (barely) good enough for me.
 
 On Mac v. Linux where calculations come secondary to office-type 
 calculations, you have to weigh your level of vendor lock-in. Do you run 
 Libreoffice or Microsoft Office? Inkscape or Illustrator? Gimp or Photoshop? 
 Etc. If you are locked-in to commercial products and haven't migrated to open 
 source, then you may want to think twice about a Linux box. Macs are very 
 seamless for an office environment, but I don't know if they are appropriate 
 for heavy-duty calculations given that you'll trade horsepower for the Mac 
 experience.
 
 James
 
 
 
 On Jan 22, 2013, at 10:59 AM, Cara Vaughan wrote:
 
 Dear CCP4BB
 
 I'm thinking about buying a Mac Mini and was looking for advice from people 
 who have used these for crystallography.
 
 We don't need the computer to do serious number-crunching as we have 
 back-end servers that can do this for us, so it is primarily for running 
 coot for model building, etc. and low intensity crystallography jobs.
 
 I've seen from the archive that some people do use the Mac Mini for 
 crystallography and I've got two questions:
 1. Do I need the Quad core or is a Dual core processor enough?
 2. Is the intergrated Intel HD graphics card OK for crystallography 
 requirements?
 
 All the best,
 Cara.
 
 
 Cara Vaughan
 Lecturer in Structural Biology
 Institute of Structural and Molecular Biology
 Birkbeck College and UCL
 London UK
 
 
 This message was sent using IMP, the Internet Messaging Program.

Re: [ccp4bb] Symmetry operator

2013-01-10 Thread James Stroud

The transformation matrix describing the symmetry is sensitive to the 
coordinate system origin. You should center the entire tetramer on the origin 
(0, 0, 0), where the origin coincides with the point symmetry element.  If you 
have a tetramer with true point symmetry, then the center of the tetramer 
should be the center of mass. Then you can use lsq-expl from O or whatever the 
equivalent is in pymol, phenix, or coot to get the transformation matrix 
between any two monomers.

Repeated application of that transformation matrix to any monomer of the 
tetramer should generate the tetramer.

James

On Jan 10, 2013, at 5:48 PM, james09 pruza wrote:

 Hi,
 
 Which program outputs the symmetry operator (rotation and translation)? I 
 have a dimer in the asymmetric unit and need to know the symmetry operator to 
 get a tetramer, the active molecule. 
 James

Re: [ccp4bb] Symmetry operator

2013-01-10 Thread James Stroud

I need to amend that and make a couple of corrections, after thinking about it. 
First, the rotation-translations shouldn't be sensitive to the origin. Second, 
if it has C4 (square) symmetry, then you only need one generator 
(rotation-translation) to make the tetramer, and the two monomers should be 
adjacent.

If it is T4 (tetrahedral) symmetry, then you need two generators, with one 
generator made from a pair of monomers opposite and one generator made from a 
pair adjacent.

James


On Jan 10, 2013, at 6:05 PM, James Stroud wrote:

 The transformation matrix describing the symmetry is sensitive to the 
 coordinate system origin. You should center the entire tetramer on the origin 
 (0, 0, 0), where the origin coincides with the point symmetry element.  If 
 you have a tetramer with true point symmetry, then the center of the tetramer 
 should be the center of mass. Then you can use lsq-expl from O or whatever 
 the equivalent is in pymol, phenix, or coot to get the transformation matrix 
 between any two monomers.
 
 Repeated application of that transformation matrix to any monomer of the 
 tetramer should generate the tetramer.
 
 James
 
 On Jan 10, 2013, at 5:48 PM, james09 pruza wrote:
 
 Hi,
 
 Which program outputs the symmetry operator (rotation and translation)? I 
 have a dimer in the asymmetric unit and need to know the symmetry operator 
 to get a tetramer, the active molecule. 
 James

Re: [ccp4bb] Off topic: Selecting atoms within a given distance from a target atom

2012-11-18 Thread James Stroud

In O, the command is called symm_sphere. 
http://xray.bmc.uu.se/alwyn/A-Z_of_O/everything_s.html#anchor536668

James


On Nov 18, 2012, at 10:14 AM, Bosch, Juergen wrote:

 Hi Pavel,
 
 does this also work for symmetry related atoms ?
 
 Jürgen 
 
 ..
 Jürgen Bosch
 Johns Hopkins Bloomberg School of Public Health
 Department of Biochemistry  Molecular Biology
 Johns Hopkins Malaria Research Institute
 615 North Wolfe Street, W8708
 Baltimore, MD 21205
 Phone: +1-410-614-4742
 Lab:  +1-410-614-4894
 Fax:  +1-410-955-3655
 http://lupo.jhsph.edu
 
 On Nov 17, 2012, at 14:26, Pavel Afonine pafon...@gmail.com wrote:
 
 Hi Rex, 
 
 as easy as:
 
 phenix.pdb_atom_selection model.pdb within(3, chain L and resseq 9 and name 
 CA) --write-pdb-file=cut.pdb
 
 which in the above example selects all atoms within 3 A from CA atom in 
 chain A of residue number 9, and writes them into cut.pdb file.
 
 Pavel
 
 On Sat, Nov 17, 2012 at 12:04 PM, Rex Palmer rex.pal...@btinternet.com 
 wrote:
 I would like to specify a target atom in a pdb file and then isolate all 
 atoms within a given distance of the target. The selected atoms are then to 
 be placed in a new pdb file.
 Any suggestions please.
  
 Rex Palmer
 http://www.bbk.ac.uk/biology/our-staff/emeritus-staff
 http://rexpalmer2010.homestead.com

Re: [ccp4bb] vitrification vs freezing

2012-11-16 Thread James Stroud

On Nov 16, 2012, at 12:01 PM, Ed Pozharski wrote:

 On 11/16/2012 12:54 PM, Kendall Nettles wrote:
 
 I wouldn't go into the lab and say did you cryo-cool those crystals yet? 
 or  check out this nice crystal. Its ready for vitrification.
 
 
 If we speak the way scientific articles are written...
 
 By Bernard Dixon, published in New Scientist, 11 April 1968, p.73, an 
 imaginary conversation at breakfast:
 
 Daddy, I want cornflakes this morning. Must I have porridge?
 
 Yes. It has been suggested by mummy that, in view of the external coldness, 
 the eating of porridge by you will cause an increase in bodily temperature. 
 Furthermore, in regard to the already-mentioned temperature considerations, 
 your grandma-knitted gloves and wool-lining-hooded coat will have to be worn.


The imprecision of this language is staggering. For instance, the meaning of 
the word worn is completely ambiguous and should be clarified. Does the 
author mean that the coat (what kind?) should be draped over the child's head, 
tied around the child's waste, or should his arms be placed through the 
sleeves. If the latter, should the fasteners (what kind?) be dorsal or ventral?

James

Re: [ccp4bb] vitrification vs freezing

2012-11-15 Thread James Stroud

Isn't cryo-cooled redundant?

James

On Nov 15, 2012, at 11:34 AM, Phil Jeffrey wrote:

 Perhaps it's an artisan organic locavore fruit cake.
 
 Either way, your *crystal* is not vitrified.  The solvent in your crystal 
 might be glassy but your protein better still hold crystalline order (cf. 
 ice) or you've wasted your time.
 
 Ergo, cryo-cooled is the description to use.
 
 Phil Jeffrey
 Princeton
 
 On 11/15/12 1:14 PM, Nukri Sanishvili wrote:
 s: An alternative way to avoid the argument and discussion all together
 is to use cryo-cooled.
 Tim: You go to a restaurant, spend all that time and money and order a
 fruitcake?
 Cheers,
 N.

Re: [ccp4bb] Assemble Protein-DNA complex

2012-11-09 Thread James Stroud

This sounds like a job for ammonium acetate. Use it as your salt. Purify your 
complex in it and then set up drops where they wells have the amount of 
ammonium acetate needed to keep your protein stable and the wells have none, or 
a range of concentrations. The ammonium acetate will equilibrate by vapor 
diffusion, lowering the concentration in the drop and causing your complex to 
come out of solution.

James

On Nov 9, 2012, at 9:14 AM, Wei Huang wrote:

 Dear CCP4BBers,
 
 I have a problem in purifying protein-DNA complex for a protein that I am 
 interested in.
 
 The purification of protein only has been optimized and I've get enough yield 
 for what I need (10 mg/2.4 L growth). And I've measured DNA binding using 
 Fluorescence Anisotropy. The results show that my protein has the tightest 
 binding (Kd=8 nM) to the DNA at low salt condition (30 mM KCl) in 20 mM HEPES 
 (pH 7.5).
 
 However, I came across several problems when I assemble protein-DNA complex 
 in large scale. 
 
 First, my protein is unstable at low salt condition. When I dialyzes my 
 protein into low salt buffer (tried 30 mM and 100 mM KCl) for binding DNA, 
 the protein precipitates. What I don't quite understand is that the DNA 
 binding assay performed at low salt condition doesn't seem to be affected by 
 this instability of protein. I guess it may be due to the assay was performed 
 at very diluted protein concentration (in nM). 
 
 Second, I can not purify protein-DNA complex at high salt condition with gel 
 filtration column. Because of the first problem, I tried to assemble the 
 complex at high salt condition (150 mM KCl, 150 mM NaCl). However, the 
 elution profile shows no binding of DNA to my protein (no increase in the 
 observation of protein peak and a large peak around expected position for 
 DNA). This may be due to weaker binding at high salt as my DNA binding assay 
 shows that the Kd under this buffer condition is ~1100 nM.
 
 Third, a lot of protein is lost during dialysis of protein-DNA complex into 
 low salt condition. I tried add DNA directly into protein in high salt 
 buffer, then dialyze very slowly against low salt buffer. However, I still 
 lost quite a lot of protein due to precipitation. I was able to load some 
 sample onto the gel filtration column with low salt running buffer. And I saw 
 the shift of protein peak in the elution profile, also protein concentration 
 measured by Bradford assay shows that the protein concentration is much less 
 than that expected from uv trace, suggesting the contribution to the 
 absorbance from DNA. But the yield is very low, less than 0.2 mg of protein 
 is left and the complex seems to be unhappy when I concentrate it. So I can 
 not get protein sample concentrated enough for my study.
 
 My previous experience with another DNA binding protein is much better. I 
 purified it in high salt, dialyzed into low salt to binding DNA and finally 
 purify with gel filtration column. However, the one I am currently working on 
 seems to be very picky. If you have any suggestion regarding to my problems, 
 I will be thankful.
 
 Best regards,
 
 -- 
 Wei Huang, PhD
 Postdoctoral Associate
 Center for Proteomics and Bioinformatics
 Case Western Reserve University
 Cleveland, OH 44106

Re: [ccp4bb] Assemble Protein-DNA complex

2012-11-09 Thread James Stroud

I meant where the drops have the concentration of ammonium acetate needed.

James


On Nov 9, 2012, at 10:06 AM, James Stroud wrote:

 This sounds like a job for ammonium acetate. Use it as your salt. Purify your 
 complex in it and then set up drops where they wells have the amount of 
 ammonium acetate needed to keep your protein stable and the wells have none, 
 or a range of concentrations. The ammonium acetate will equilibrate by vapor 
 diffusion, lowering the concentration in the drop and causing your complex to 
 come out of solution.
 
 James
 
 On Nov 9, 2012, at 9:14 AM, Wei Huang wrote:
 
 Dear CCP4BBers,
 
 I have a problem in purifying protein-DNA complex for a protein that I am 
 interested in.
 
 The purification of protein only has been optimized and I've get enough 
 yield for what I need (10 mg/2.4 L growth). And I've measured DNA binding 
 using Fluorescence Anisotropy. The results show that my protein has the 
 tightest binding (Kd=8 nM) to the DNA at low salt condition (30 mM KCl) in 
 20 mM HEPES (pH 7.5).
 
 However, I came across several problems when I assemble protein-DNA complex 
 in large scale. 
 
 First, my protein is unstable at low salt condition. When I dialyzes my 
 protein into low salt buffer (tried 30 mM and 100 mM KCl) for binding DNA, 
 the protein precipitates. What I don't quite understand is that the DNA 
 binding assay performed at low salt condition doesn't seem to be affected by 
 this instability of protein. I guess it may be due to the assay was 
 performed at very diluted protein concentration (in nM). 
 
 Second, I can not purify protein-DNA complex at high salt condition with gel 
 filtration column. Because of the first problem, I tried to assemble the 
 complex at high salt condition (150 mM KCl, 150 mM NaCl). However, the 
 elution profile shows no binding of DNA to my protein (no increase in the 
 observation of protein peak and a large peak around expected position for 
 DNA). This may be due to weaker binding at high salt as my DNA binding assay 
 shows that the Kd under this buffer condition is ~1100 nM.
 
 Third, a lot of protein is lost during dialysis of protein-DNA complex into 
 low salt condition. I tried add DNA directly into protein in high salt 
 buffer, then dialyze very slowly against low salt buffer. However, I still 
 lost quite a lot of protein due to precipitation. I was able to load some 
 sample onto the gel filtration column with low salt running buffer. And I 
 saw the shift of protein peak in the elution profile, also protein 
 concentration measured by Bradford assay shows that the protein 
 concentration is much less than that expected from uv trace, suggesting the 
 contribution to the absorbance from DNA. But the yield is very low, less 
 than 0.2 mg of protein is left and the complex seems to be unhappy when I 
 concentrate it. So I can not get protein sample concentrated enough for my 
 study.
 
 My previous experience with another DNA binding protein is much better. I 
 purified it in high salt, dialyzed into low salt to binding DNA and finally 
 purify with gel filtration column. However, the one I am currently working 
 on seems to be very picky. If you have any suggestion regarding to my 
 problems, I will be thankful.
 
 Best regards,
 
 -- 
 Wei Huang, PhD
 Postdoctoral Associate
 Center for Proteomics and Bioinformatics
 Case Western Reserve University
 Cleveland, OH 44106

[ccp4bb] PyMOL/Python CGObjects interest

2012-10-27 Thread James Stroud

Related to PDB embedded objects, I'm wondering if there would be much interest 
in my releasing an implementation of a language designed to make fairly 
elaborate illustrations in PyMOL. The reason I ask is because it would take 
some time to document and distribute, but I would do it if the demand was 
there. The implementation is written and battle-tested in my own projects, but 
undocumented.

At this moment, I call the language CGObjects. Here are a couple of examples of 
what it can do:

http://www.jamesstroud.com/pictures/site-pictures/cgobjects-example/image
http://www.jamesstroud.com/site-multimedia/site-mp4-movies/example-mp4-movie

(The movie is just a bunch of still images created using the library.)

CGObjects is a hierarchical and purely declarative language that is based on 
yaml structure. For example, the (superfluous) helices in the picture in the 
first link are specified like this:

  - a_helix :
- show : True
- object : doublehelix
  args :
pitch : 5
phase : 30
handedness : left
p : [1, -15, 0]
p1 : [0, -15, 0]
p2 : [0, 15, 0]
rgb1 : [1, 1, 0]
rgb2 : [0, 1, 1]
step : 0.1
radius : 0.075
head_size : 0.225
nsec : 30
center_on : atoms cyl D 4:6 CA
align_to :
  my_moment : [0, 0, 1]
  frame: atoms cyl D 4:6 CA
  frame_moment : [1, 0, 0]

Although this example looks complicated, it is mostly styling information 
(rgb1, rgb2, radius, head_size, etc.). The doublehelix is created by winding 
point p around the axis specified by p1 and p2 with a pitch of 5.

Points can be specified as vectors (e.g. [1, -15, 0]) or by atom selections 
(e.g. atoms cyl D 4 CA).

The selection syntax allows for very terse and intuitive selections for single 
atoms or atom ranges. For example, the CA atoms of residues 4 through 6 of 
chain D of the pdb abbreviated as cyl are selected with the expression

 atoms cyl D 4:6 CA

Selections are not limited, however, in that they can be arbitrarily 
complicated using an S-expression (LISP-like) syntax that is easy to use once 
you get the hang of it. Here is an example:

  or (and (one_of segID A000 A002 B00B B00D C000 C002) ( resSeq 25))
 (and (one_of segID A00C A00E B001 B003 C00A C00C) ( resSeq 22))

This expression selects all residues numbered less than 25 from segments named 
A000, A002, etc., and also all residues greater than 22 from segments A00C, 
A00E, etc. Although S-expressions look funny or confusing at first, once you 
get the hang of them, you might agree with me that any other selection syntax 
seems barbaric by comparison. (Note that atoms cyl D 4:6 CA is actually an 
S-expression, where atoms is the operator and the rest of the expression 
contains the arguments to the operator.)

In the first link above, note that the helices are perfectly aligned with the 
so-called figure axes (green, blue, and yellow double-headed arrows). These 
figure axes align with the principle moments of the triangle made by atoms cyl 
D 4:6.  Similarly, the helix is made to align with these moments by first 
centering on the triangle formed by the atom centers of atoms cyl D 4:6 and 
then by aligning the last moment of the helix with the principal moment of the 
triangle. All of this high-level geometry is specified by only five simple 
lines:

center_on : atoms cyl D 4:6 CA
align_to :
  my_moment : [0, 0, 1]
  frame: atoms cyl D 4:6 CA
  frame_moment : [1, 0, 0]

The use of atom selections and moments frees the user from performing manual 
alignments, extracting coordinates, or doing math. This means that if your 
model coordinates change, then your illustrations change with them.

CGObjects has many different types of objects like surfaces, polytubes, and 
arrows. It also supports reusable, hierarchical styling and even variable 
declarations, making it possible to change the appearance of an entire set of 
illustrations in a centralized and uniform way.

James



On Oct 26, 2012, at 2:32 PM, Pete Meyer wrote:
 Francois Berenger wrote:
 Hello,
 For some new project, I'd like to be able to generate things
 and store them in PDB format.
 For example, a triangle, a line segment, a square,
 a cube, a sphere, an arrow, etc.
 Being able to change the color and line width would be nice.
 Is there some official recommended way of doing this?
 Is there some software able to read and display such
 graphical annotations of PDB files?
 I'll also need the format description in that case.
 I want to be able to process a PDB file and store the result
 of my processing in the same PDB file as some kind of annotation.
 My current way of doing this is to discretize my objects as H atoms
 in some other output PDB file, but that's just a temporary workaround.
 My current search got me this:
 http://www.cgl.ucsf.edu/eccc1/
 So, maybe there is some support for what I am looking for into
 Chimera.
 Thanks

Re: [ccp4bb] adding some user-defined graphical objects in a PDB file (and displaying them)

2012-10-25 Thread James Stroud

This sounds like something that a PDB file is not intended to do. I think 
everyone has universally agreed to the PDB specification at the RCSB, which 
makes no provisions for arbitrary objects, as cool as they would be.

But you could put your information into REMARK records, which are free-form. 
Maybe you could then find some people to honor your specification or build an 
implementation yourself...


21st century solution


The simplest way, would be to write a converter that embeds a pymol script 
into REMARK records and a preprocessor for pymol that extracts them and feeds 
them into the pymol stream.

If you write the implementation then you have a de facto standard. It would 
take a handful of lines of code. Write it in python and claim the mantle of 
coolness and the disdain of python detractors everywhere!

You could set up a home page using wikispaces or even the pymol wiki, pointing 
to your implementation that you keep on github.


James


On Oct 25, 2012, at 7:25 PM, Francois Berenger wrote:

 Hello,
 
 For some new project, I'd like to be able to generate things
 and store them in PDB format.
 
 For example, a triangle, a line segment, a square,
 a cube, a sphere, an arrow, etc.
 Being able to change the color and line width would be nice.
 
 Is there some official recommended way of doing this?
 
 Is there some software able to read and display such
 graphical annotations of PDB files?
 I'll also need the format description in that case.
 
 I want to be able to process a PDB file and store the result
 of my processing in the same PDB file as some kind of annotation.
 
 My current way of doing this is to discretize my objects as H atoms
 in some other output PDB file, but that's just a temporary workaround.
 
 My current search got me this:
 http://www.cgl.ucsf.edu/eccc1/
 So, maybe there is some support for what I am looking for into
 Chimera.
 
 Thanks a lot for your suggestions,
 Francois.

Re: [ccp4bb] off topic: a Python online course and others

2012-10-21 Thread James Stroud

The design of python was influenced by the ABC language which itself was meant 
to be a replacement for BASIC. So the similarities you observe are not by 
chance.

If you are looking for a free online experience that will stretch your computer 
literacy, try this:

http://mitpress.mit.edu/sicp/full-text/book/book.html

James


On Oct 21, 2012, at 7:38 AM, Edward Berry wrote:

 I took a look at the first four lessons at the first link, and I think there 
 must be some mistake- 
 this site is actually teaching BASIC. All these commands are valid syntax 
 under say microsoft 
 GWBasic or QuickBasic. But I think this Zed Shaw has been studying shell 
 programming also and got mixed up because he uses this octa-#-thorpe instead 
 of apostrophe or REM to denote comments.
 
 Remember when parents used to send their kids to computer boot camp to learn 
 BASIC
 for fear they would be computer-illiterate and couldn't function in the 
 modern age if 
 they couldn't program a computer? 
 
  Sean Seaver s...@p212121.com 10/20/12 1:43 PM 
 I'd also recommend:
 
 Learn Python The Hard Way By Zed A. Shaw
 http://learnpythonthehardway.org/
 
 Online Python Tutor
 http://pythontutor.com/
 
 Take Care,
 
 Sean Seaver, PhD
 
 P212121
 http://store.p212121.com/

Re: [ccp4bb] PNAS on fraud

2012-10-18 Thread James Stroud

The fit seems to be driven by the high number of points in the area of the 
graph where many points overlap. The points that catch your eye and establish 
the visible balance probably do not contribute much.

Maybe this one should have been plotted as log in the abscissa for appearances.


James


On Oct 18, 2012, at 11:52 AM, DUMAS Philippe (UDS) wrote:

 
 Le Jeudi 18 Octobre 2012 19:16 CEST, Bernhard Rupp (Hofkristallrat a.D.) 
 hofkristall...@gmail.com a écrit:
 
 I had a look to this PNAS paper by Fang et al.
 I am a bit surprised by their interpretation of their Fig. 3: they claim that 
 here exists a highly signficant correlation between Impact factor and number 
 of retractations. Personnaly,  I would have concluded to a complete lack of 
 correlation...
 Should I retract this judgment?
 Philippe Dumas
 
 Dear CCP4 followers,
 
 Maybe you are already aware of this interesting study in PNAS regarding the
 prevalence of fraud vs. 'real' error in paper retractions:
 
 Fang FC, Steen RG and Casadevall A (2012) Misconduct accounts for the
 majority of retracted scientific publications. Proc Natl Acad Sci U S A
 109(42): 17028-33.
 
 http://www.pnas.org/content/109/42/17028.abstract
 
 There were also a few comments on related stuff such as fake peer review in
 the Chronicle of Higher Education. As not all may
 have access to that journal, I have put the 3 relevant pdf links on my web
 
 http://www.ruppweb.org/CHE_Misconduct_PNAS_Stuft_Oct_2012.pdf
 http://www.ruppweb.org/CHE_DYI_reviews_Sept_30_2012.pdf
 http://www.ruppweb.org/CHE_The-Great-Pretender_Oct_8_2012.pdf
 
 
 Best regards, BR
 -
 Bernhard Rupp
 001 (925) 209-7429
 +43 (676) 571-0536
 b...@ruppweb.org
 hofkristall...@gmail.com
 http://www.ruppweb.org/
 -

Re: [ccp4bb] Plate crystals

2012-10-16 Thread James Stroud

You might get better help if you post links (not attachments) to diffraction 
images from several angles and an image of the crystal that gave the image, and 
maybe several images of typical crystals.

Also, you could post dimensions of the crystal, and if your lab is set up for 
it, a picture of the crystal in the loop.

4 Å diffraction for a plate crystal could be promising, but plate crystals 
require proper handling and optimization.

You should also try to work with your 4 Å data and see if  you can get a 
structure with MR (or even MIR), if  that is at all possible. You might find 
that you can trim some extra sequence from one or both ends that hinder good 
crystallization.

James


On Oct 15, 2012, at 4:01 PM, Jahan Alikhajeh wrote:

 
 Dear Friends,
 
 I am trying to crystalize a 70 kDa nasty protein but I got plate shape 
 crystals with high mosaicity and useless diffraction (up to 4A).
 I tried to improve/optimize crystallization but either I got the same or 
 nothing. I tried seeding but I had so many crystals without any improvement. 
 Does anyone have better idea than routine optimization method in the lab? 
 Thanks in advance.
 
 Jahan

Re: [ccp4bb] electrostatic potential and charged residues

2012-09-14 Thread James Stroud

You could do homology modeling of several homologs using 
http://protein.cribi.unipd.it/Homer/ . Then calculate the electrostatics of the 
homologs.

Better though would be actual structures. How much identity do the homologues 
have? If they are so close that homology modeling works, then you may be able 
to crystallize a few with the information available from the structure you 
already have.

Imagine how powerfully a range of experimental structures would be to support 
your argument that the electrostatics are a conserved feature of the protein.

James


On Sep 14, 2012, at 12:37 PM, Qiang Chen wrote:

 Hi all,
 
 I'm working on a protein structure which showed a special electrostatic
 potential on its surface: positive on one end and negative on the other
 end. I wonder to what extent I can say this pattern is determined by the
 charged residues? If the residues are conserved, could I make a conclusion
 that its homologues also have such pattern?
 
 Thanks!
 
 
 The information in this e-mail is intended only for the person to whom it is
 addressed. If you believe this e-mail was sent to you in error and the e-mail
 contains patient information, please contact the Partners Compliance HelpLine 
 at
 http://www.partners.org/complianceline . If the e-mail was sent to you in 
 error
 but does not contain patient information, please contact the sender and 
 properly
 dispose of the e-mail.

Re: [ccp4bb] Off-topic: Best Scripting Language

2012-09-13 Thread James Stroud


On Sep 13, 2012, at 3:24 AM, Tim Gruene wrote:
 I have the impression that
 python programmers spend a lot of effort in trying to convince others
 that python is a good choice. Why bother rather than let people make
 their own decision?


Someone asked.

Plus, python programmers put no more effort than any other programmer. It's 
just that python has more advocates (for good reason) so the apparent effort is 
amplified.

Don't hate us because our preferred programming language is beautiful.

James

--
James Stroud

http://www.jamesstroud.com

Re: [ccp4bb] Off-topic: Best Scripting Language

2012-09-13 Thread James Stroud

On Sep 13, 2012, at 11:02 AM, Patrick Shaw Stewart wrote:
 Like most computer users and many scientists I don't write scripts to 
 organize or analyse my data unless I get desperate.  I've used both Python 
 and Perl a few years ago, but it would take quite a lot of time and effort 
 and staring at on-line tutorials to get back into either of them right now.  
 So I end up using massive Excel files that kind of work, but are a pain.  
 I've noticed that quite a few structural biologists have the same problem.
 
 I've never understood why there can't be a simple programming language that 
 is completely self-explanatory bercause it uses English sentences.

Yeah. They tried that. It's called AppleScript and is a complete disaster for 
programmers simply because of its vague resemblance to natural language. There 
are essays on this issue [1, 2], but other than the message stay away from 
programming languages that try to be natural languages, these essays are 
mostly academic.

It turns out that the syntax and semantics of all reasonable programming 
languages are very similar, or fall into only a few classes (e.g. C-like, 
S-expressions, etc.), so once you are fluent in one from a class, it's easy 
to pick up the others. This can't be said of natural languages, which are full 
of idioms and grammatical exceptions, even in closely related dialects.

James


[1] 
http://www.codinghorror.com/blog/2006/08/computer-languages-arent-human-languages.html
[2] http://daringfireball.net/2005/09/englishlikeness_monster

Re: [ccp4bb] Off-topic: Best Scripting Language

2012-09-12 Thread James Stroud


On Sep 12, 2012, at 9:11 AM, Pete Meyer wrote:

 That said, I'd take a look at python, octave or R.  Python's relatively easy 
 to learn, and more flexible than octave/R; but it doesn't have the built-in 
 statistic functions that octave and R do.


import scipy


Now it does!

Re: [ccp4bb] Off-topic: Best Scripting Language

2012-09-12 Thread James Stroud

Python sorting 1 records of 1 floats for each record,  finding the max, 
min, and mean of entire 100,000,000 32 bit float array (400 MB) on a 6 year old 
white imac.

 *11.6 seconds.

*This doesn't include the time to generate the 400 MB of random (normal) data.

Try it on your own computer. Here's the copy-paste from mine:

py import timeit
py timeit.timeit('big_data.sort(axis=0), big_data.mean(); big_data.max(); 
big_data.min();',
 'import numpy; big_data=numpy.random.normal(10, 
size=1e8).reshape((1e4,1e4)); print random data made, starting...',
 number=1)
random data made, starting...
11.597978115081787

James




On Sep 12, 2012, at 8:32 AM, Jacob Keller wrote:

 Dear List,
 
 since this probably comes up a lot in manipulation of pdb/reflection files 
 and so on, I was curious what people thought would be the best language for 
 the following: I have some huge (100s MB) tables of tab-delimited data on 
 which I would like to do some math (averaging, sigmas, simple arithmetic, 
 etc) as well as some sorting and rejecting. It can be done in Excel, but this 
 is exceedingly slow even in 64-bit, so I am looking to do it through some 
 scripting. Just as an example, a sort which takes 10 min in Excel takes 
 ~10 sec max with the unix command sort (seems crazy, no?). Any suggestions?
 
 Thanks, and sorry for being off-topic,
 
 Jacob
 
 -- 
 ***
 Jacob Pearson Keller
 Northwestern University
 Medical Scientist Training Program
 email: j-kell...@northwestern.edu
 ***

Re: [ccp4bb] Off-topic: Best Scripting Language

2012-09-12 Thread James Stroud


On Sep 12, 2012, at 1:00 PM, George Sheldrick wrote:

 It is the lack of compatibility between different versions mentioned by Ethan 
 that really put me off learning PYTHON.


Python is backwards compatible. I have reams of code I wrote in python 2.3 that 
still works in 2.7 without modification.

Also, python (aka python 2) and python 3000 (aka python 3) are considered two 
different languages. It's not reasonable to consider them one language and then 
complain that they are incompatible. Python 3 was created as a new language 
(and should be treated as such) precisely because it breaks compatibility with 
python 2. That was the intent of the language authors.

You blame the authors for recognizing limitations of a language and inventing a 
new one to overcome those limitations.

If the FORTRAN authors would have done that about 30 years ago, we all might be 
programming in FORTRAN.

James

Re: [ccp4bb] Does anyone have used DelPhi to calculate the Electrostatic potentials of DNA ？

2012-07-11 Thread James Stroud

I think delphi doesn't need to know about connectivity, only the atom 
identities. So the distinction between DNA and protein should be irrelevant and 
thus the input files would be the same.

If you just want to visualize the delphi EP, try this page:

  http://structure.usc.edu/howto/delphi-surface-pymol.html

James



On Jul 10, 2012, at 8:36 PM, dengzq1987 wrote:

 Hi all,
 recently，I want to use DelPhi to calculate the Electrostatic potentials of 
 DNA.but in the manual,i can not find the method to create the inputfile 
 fort.11 、fort.12 and fort.13.Does anyone have experience on this?  Please 
 suggest
 
 Thank you in advance
 
 Sincerely
 dengzq

Re: [ccp4bb] One little clash

2012-07-11 Thread James Stroud

It looks like dityrosine, usually caused by radiation damage (I think UV is the usual culprit).http://www.nugowiki.org/images/thumb/c/ca/HMDB06045.png/220px-HMDB06045.pngJamesOn Jul 11, 2012, at 1:37 PM, Lukacs, Christine wrote:Hi all-I have a protein that crystallizes in I422, and diffracts well, between 1.3-1.7A. Beautiful density, slightly higher final R-factors than you might expect at this resolution (low to mid 20s). The density is all beautiful, except that I have this one little clash, between a few atoms from a tyrosine and its symmetry mate. In this picture I have it modeled as an Alanine and you can see the two tyrosine rings interlocking; and there is clearly no alternate conformation.image003.pngSince it is not near my site of interest, I have been pretty much ignoring it, going through refinement with it as an alanine, then changing it at the very end to a tyrosine and just minimizing B-s, no positional. Now that I plan to publish a bunch of these, I should probably figure out what is really going on. Any insights?ThanksChristineChristine LukacsRocheThis message is intended for the use of the named recipient(s) only and may contain confidential and/or proprietary information. If you are not the intended recipient, please contact the sender and delete this message. Any unauthorized use of the information contained in this message is prohibited.

Re: [ccp4bb] One little clash

2012-07-11 Thread James Stroud

The hydroxyls were on the wrong carbons in the previous picture I sent. These are correct.JamesOn Jul 11, 2012, at 1:37 PM, Lukacs, Christine wrote:Hi all-I have a protein that crystallizes in I422, and diffracts well, between 1.3-1.7A. Beautiful density, slightly higher final R-factors than you might expect at this resolution (low to mid 20s). The density is all beautiful, except that I have this one little clash, between a few atoms from a tyrosine and its symmetry mate. In this picture I have it modeled as an Alanine and you can see the two tyrosine rings interlocking; and there is clearly no alternate conformation.image003.pngSince it is not near my site of interest, I have been pretty much ignoring it, going through refinement with it as an alanine, then changing it at the very end to a tyrosine and just minimizing B-s, no positional. Now that I plan to publish a bunch of these, I should probably figure out what is really going on. Any insights?ThanksChristineChristine LukacsRocheThis message is intended for the use of the named recipient(s) only and may contain confidential and/or proprietary information. If you are not the intended recipient, please contact the sender and delete this message. Any unauthorized use of the information contained in this message is prohibited.

Re: [ccp4bb] SOMoRe

2012-07-09 Thread James Stroud


On Jul 9, 2012, at 11:02 AM, Tim Gruene wrote:

 Dear Fulvio,
 
 http://lmgtfy.com/?q=somore+molecular+replacement
 
 should help you out, especially the but last paragraph of the first
 link provided.


Not Found

The requested URL /~djamrog/somore.html was not found on this server.

Re: [ccp4bb] Help with curves+

2012-07-02 Thread James Stroud

The star is usually for sugar atoms, not for the bases. I don't remember what 
curves wants.

Does your PDB have apostrophes instead of stars? If so you should just do a 
global search and replace.

If you still have a problem, you should copy-paste the error message.

James


On Jul 2, 2012, at 12:44 PM, Nikolai Suslov wrote:

 Hello,
 
 I am trying to analyze an RNA helix with curves+. I keep receiving an
 error message that base atom C1*  is missing from the pdb. C1 atoms
 are certainly all there.
 
 Any advice on how to troubleshoot this will be much appreciated.
 
 Sincerely,
 Nikolai Suslov

[ccp4bb] Layer Groups: B2 recognized?

2012-06-29 Thread James Stroud

Hello Everyone and especially symmetry experts,

I found at http://img.chem.ucl.ac.uk/sgp/medium/003dy1.htm that B2 (SG #2) is a 
recognized space group that has a basis rotated from P2 (and as a result has 
two new positions).

I'm wondering whether a similar B2 for P2 (LG #3) is recognized for the layer 
groups? I checked wikipedia (http://en.wikipedia.org/wiki/Layer_group) and the 
Bilbao server (http://www.cryst.ehu.es/subperiodic/get_sub_gen.html) and could 
find no evidence of a B2 layer group being recognized, but I can also come up 
with no logical reason why it shouldn't be recognized.

Thank you for any input,

James

--
James Stroud

http://www.jamesstroud.com

[ccp4bb] Question on Symmetry Axis Notation Convention

2012-06-14 Thread James Stroud

Hello All,

I would like to discuss symmetry axes, but I'm not sure what the notation 
convention is. For example, I'd like to say something about a 2(1) along the 
x-axis, but the phrase the 2(1) symmetry axis along x is a bit cumbersome to 
repeat many times or to put in a table. So I'd like a shorthand, maybe 
something like x(2_1) (where the preceding _ means that the 1 is 
subscript. Another way I like is x_{2(1)} (where the curly braces mean that 
all of 2(1) is subscript).

Does anyone know what the convention is or if there is one?

Thanks in advance for any help.

James

Re: [ccp4bb] how to get phase of huge complex

2012-06-12 Thread James Stroud

If you want to use se-Met, you might want to start by labeling only one protein 
at a time. For example, if you have A,B,C,D, grow crystals like this:

se-A, B, C, D
A, se-B, C, D,

etc.

Then try combinations of 2, then 3, then if you haven't got the phases you 
need, try all 4.

And remember, if 2 different combinations diffract and they are isomorphous, 
then you can try MIR too.

Also, if your complex can stay intact on a gel shift, look at this paper: 
http://www.ncbi.nlm.nih.gov/pubmed/10903954

James


On Jun 12, 2012, at 8:46 PM, LISA wrote:

 Hi all,
  
 My work is to solve huge complex containing 4 different proteins and total 
 molecular weight is about 300 KD. I can purify the complex by co-expression 
 them in E.coli.  This complex contains 8 protein A, 2 protein B and 1 protein 
 C and D. protein B and protein C  have homology structures deposited in PDB 
 database. No homology structure available for protein A and D, which 
 contribute 60% of the whole molecular weight for the complex. 
  
   Now I am trying to find a way to solve the phase of this complex. I am 
 thinking of use sad or mad with se-Met.   There total 111 Met residues in 
 this complex. Is it possible to solve this complex by se-Met? Does someone 
 have experience to solve huge complex structure with se-met? It is also very 
 welcome for all the suggestion. Thank you.
  
 All the best,
  
 Lisa

Re: [ccp4bb] Who is using 64-bit Linux?

2012-04-13 Thread James Stroud


On Apr 13, 2012, at 1:24 PM, James Holton wrote:

 I tried downgrading the operating system to 32-bit, but that reduced the 
 number of CPUs available in the system from 24 to 8.  Still don't know why 
 that is 

I'm probably wrong, but I'll guess that a 32 bit operating system can only 
spare 3 of those bits to address CPUs ;-)

James

Re: [ccp4bb] Question about PyMol

2012-04-11 Thread James Stroud

Yes.

File - Save Molecule...

James


On Apr 11, 2012, at 1:17 PM, Harman, Christine wrote:

 Hi All,
 Sorry for the dumb question, but if there was a way to export coordinates 
 from PyMol.  Is yes, then how? 
  
 Thanks
  
 Christine

Re: [ccp4bb] very informative - Trends in Data Fabrication

2012-04-03 Thread James Stroud


On Apr 3, 2012, at 7:19 PM, Katherine Sippel wrote:

 I would also consider looking into adding an RSS feed to your site so that 
 those people interested in your articles can be informed without spamming the 
 boards.

Why continue to punish him? Adding an RSS feed means installing and configuring 
an RSS server. Aren't there rules against cruel and inhumane punishment?

There are many free newsfeed disseminators. Twitter is the most famous. There 
are others, maybe better, so I'm not being a twittervangelist here.

My point is this: free and easy is better than difficult.

James

Re: [ccp4bb] a small trick for protein and organic compound cocrystallization.

2012-03-30 Thread James Stroud

On Mar 30, 2012, at 1:04 PM, Bryan Lepore wrote:
 On Fri, Mar 30, 2012 at 4:02 AM, Kevin Jin kevin...@gmail.com wrote:
 Here is way I have used for [...]
 
 I hate to be a curmudgeon, but can a list member please explain why
 this is not specifically blogspam or spam - or whatever it is exactly?


It's not really spam, but there are better channels to communicate these type 
of updates. Twitter is probably the best known channel. People get to subscribe 
and authors can push their contributions to willful and interested subscribers. 
The author posts a summary of what's behind the link and then posts the link, 
just as Jin is doing with his ad-hoc feed here.

For example, a few days ago I created the @amyloids twitter feed 
(https://twitter.com/#!/amyloids) that features recent publications I have read 
and that I also think are of interest to amyloid researchers.

Instead of being to critical, let's just direct authors to use the proper 
channels. Social media has moved fast in recent years, and we should be 
understanding as the scientific community plays catch-up.

If Jin created a twitter feed for tips, it would probably get some subscribers 
from this board--as long as we don't begrudge the Jin an opportunity to send 
out an advertisement for the feed itself.

James

Re: [ccp4bb] DNA length for crystallization

2012-02-15 Thread James Stroud

Use 5' overhangs of two and make the DNA 10, 11, 15, 20, 21 25, 26, 30, or 31 
bases in length. Count the overhangs in the length.

If you don't know where to start, try 15, 25, and 26 first because they will 
make 2(1) screws, which are good for crystals.

James


On Feb 15, 2012, at 1:06 AM, LISA wrote:

 Hi all,
 
 I have a DNA binding protein. I can get crystals when I mix 8-28 nt dsDNA 
 with my protein. But neither of them has good diffraction. Some biochemical 
 data said the longer of DNA, the tigher of the binding betwwen DNA and my 
 protein. The binding is not sequence-specfic. Does anyone have suggestion of 
 the optimization? What is the good length of DNA for crystallization?
 Thank you.
 
 Lisa

Re: [ccp4bb] Crystal Structures as Snapshots--Summary

2012-02-15 Thread James Stroud

I feel compelled to throw a few references into the ring.

NFAT is a protein where you get a good sampling of snapshots:

1. Folded up as a monomer when interacting with partner proteins:

http://www.ncbi.nlm.nih.gov/pubmed/9510247
http://www.ncbi.nlm.nih.gov/pubmed/16873067

2. Extended as a dimer:

http://www.ncbi.nlm.nih.gov/pubmed/12949493

3. Folded up as a monomer when interacting with a partner protein which happens
to be itself as an extended dimer:

http://www.ncbi.nlm.nih.gov/pubmed/18462673

4. Wrapped around DNA as a monomer without partners:.

http://www.ncbi.nlm.nih.gov/pubmed/14643663

In this last reference you get a sample of extended, wrapped around, and folded
up all in the same unit cell!

James

On Feb 15, 2012, at 1:48 PM, Jacob Keller wrote:

Dear Crystallographers,

thanks for all of the responses and conversation. I have culled
together the various references which have been sent on the BB and
which I have come up with, and posted them below. Worthy of special
mention, I think, is the first one (Lange et al), in which 46 (!)
different crystal structures are pitted against a lot of RDC NMR data,
and the match seems to be excellent (although it seems you probably
have to know both methods fairly well to evaluate this properly.)
Anyway, for asserting that variances between crystal structures at
least in some cases represent differences between
physiologically-relevant states in solution, the Lange paper is really
on the mark.

Thanks again,

Jacob

Lange OF, Lakomek NA, Farès C, Schröder GF, Walter KF, Becker S,
Meiler J, Grubmüller H, Griesinger C, de Groot BL.
Recognition dynamics up to microseconds revealed from an RDC-derived
ubiquitin ensemble in solution.
Science. 2008 Jun 13;320(5882):1471-5. PubMed PMID: 18556554.

Kondrashov, D.A., Zhang, W., Aranda, R.t., Stec, B., and Phillips,
G.N., Jr. (2008). Sampling of the native conformational ensemble of
myoglobin via structures in different crystalline environments.
Proteins 70, 353-362.

Zhang, X. J., Wozniak, J. A., and Matthews, B. W. (1995) Protein
flexibility and adaptability seen in 25 crystal forms of T4 lysozyme,
Journal of molecular biology 250, 527-552.

Long, SB, Casey, P., Beese, LS (2002) The reaction path of protein
farnesyltransferase at atomic resolution. Nature Oct 10;
419(6907):645-50.
http://www.ncbi.nlm.nih.gov/pubmed?term=The%20reaction%20path%20of%20protein%20farnesyltransferase%20at%20atomic%20resolution

J. R. Kiefer, C. Mao, J. C. Braman and L. S. Beese (1998) “Visualizing
DNA replication in a catalytically active Bacillus DNA polymerase
crystal” Nature 6664:304-7.
http://www.ncbi.nlm.nih.gov/pubmed?term=Visualizing%20DNA%20replication%20in%20a%20catalytically%20active%20Bacillus%20DNA%20polymerase%20crystal

Mancini EJ, Kainov DE, Grimes JM, Tuma R, Bamford DH, Stuart DI (2004)
Atomic snapshots of an RNA packaging motor reveal conformational
changes linking ATP hydrolysis to RNA translocation.
Cell 118(6):743-55
http://www.cell.com/abstract/S0092-8674(04)00837-2

Nature. 2009 Dec 3;462(7273):669-73.
Hidden alternative structures of proline isomerase essential for catalysis.
Fraser JS, Clarkson MW, Degnan SC, Erion R, Kern D, Alber T.

***
Jacob Pearson Keller
Northwestern University
Medical Scientist Training Program
email: j-kell...@northwestern.edu
***

Re: [ccp4bb] Crystal Structures as Snapshots

2012-02-10 Thread James Stroud

How could they not be snapshots of conformations adopted in solution?

James


On Feb 10, 2012, at 1:25 PM, Jacob Keller wrote:

 Dear Crystallographers,
 
 I am looking for references which discuss the validity of the
 assertion that multiple crystal structures of the same or similar
 proteins can be considered freeze-frame snapshots of actual
 conformations assumed in solution. In a way, the assertion seems
 almost definitely true to me, but on the other hand, I could imagine
 some objections as well. Seems there should be some classic literature
 here...
 
 All the best,
 
 Jacob
 
 
 
 -- 
 ***
 Jacob Pearson Keller
 Northwestern University
 Medical Scientist Training Program
 email: j-kell...@northwestern.edu
 ***

Re: [ccp4bb] Crystal Structures as Snapshots

2012-02-10 Thread James Stroud

So the implication is that some of these treatments might allow the protein to 
overcome energetic barriers that are prohibitive in solution--after the protein 
is already in the solid state and not in solution any more?

Another view is that crystallization is a result of stabilizing conformations 
that are accessible in solution.

On the point of physiological relevance, it wasn't mentioned in the original 
question.

James


On Feb 10, 2012, at 1:34 PM, Nat Echols wrote:

 On Fri, Feb 10, 2012 at 12:29 PM, James Stroud xtald...@gmail.com wrote:
 How could they not be snapshots of conformations adopted in solution?
 
 Packing billions of copies of an irregularly-shaped protein into a
 compact lattice and freezing it to 100K isn't necessarily
 representative of solution, especially when your solution contains
 non-physiological amounts of salt and various organics (and possibly
 non-physiological pH too).
 
 -Nat

Re: [ccp4bb] Crystal Structures as Snapshots

2012-02-10 Thread James Stroud

The contrast seems to boil down to the semantics of the word snapshot.

In my definition, I assume that the uncertainty of a structure is an intrinsic 
quality of the structure and thus included in the meaning of snapshot. Part 
of that uncertainty comes from averaging.

James


On Feb 10, 2012, at 1:51 PM, Jacob Keller wrote:

 Interesting to juxtapose these two responses:
 
 James Stroud:
 How could they not be snapshots of conformations adopted in solution?
 
 David Schuller:
 How could that possibly be the case when any structure is an average of all
 the unit cells of the crystal over the timespan of the diffraction
 experiment?
 
 JPK

[ccp4bb] 3 Letter for 1'-deoxyribofurnaose-5'-phosphate?

2012-01-27 Thread James Stroud

Hello All,

Do any of you RNA structural biologists know what is the 3 letter code for 
1'-deoxyribofurnaose-5'-phosphate (abasic site in RNA)? I found that the 
1'2'-dideoxy is 3DR, but could not find a lead on the 1'-deoxy. I scoured 
HIC-UP to no avail, which I think is the most comprehensive hetero-compound 
library.

If I knew what the 3 and R stood for in 3DR I could probably guess. I'm 
pretty sure the D stands for the 2' deoxy.

Thank you for any help!

James

Re: [ccp4bb] 3 Letter for 1'-deoxyribofurnaose-5'-phosphate? - SOLVED!

2012-01-27 Thread James Stroud

Hello All,

Two people named Rob told me that the the 3 letter code is N.

   http://ligand-expo.rcsb.org/reports/N/N/index.html

Coincidentally, the old 3 letter name is ROB.

Thank you for your help!

James




 On Fri, Jan 27, 2012 at 2:38 PM, James Stroud xtald...@gmail.com wrote:
 Hello All,
 
 Do any of you RNA structural biologists know what is the 3 letter code for 
 1'-deoxyribofurnaose-5'-phosphate (abasic site in RNA)? I found that the 
 1'2'-dideoxy is 3DR, but could not find a lead on the 1'-deoxy. I scoured 
 HIC-UP to no avail, which I think is the most comprehensive hetero-compound 
 library.
 
 If I knew what the 3 and R stood for in 3DR I could probably guess. I'm 
 pretty sure the D stands for the 2' deoxy.
 
 Thank you for any help!
 
 James

Re: [ccp4bb] writing scripts-off topic

2012-01-24 Thread James Stroud

I should know better than to touch a flame post, but, but here goes:

flame
don't use anything but python. They are nightmare languages, sloppy, and force 
you to format the code by inserting semantically redundant brackets and 
semicolons in a specific way rather than dispensing these redundancies 
altogether
/flame

On Jan 24, 2012, at 1:59 AM, Tim Gruene wrote:
 [flame=;-)]
 P.S.: don't use python. It's a nightmare language, sloppy, it forces you
 to format the code in a specific way rather than your own way and ...
 [/flame]

Also, if you use python and you want to format your code in your own way, you 
should learn about python's I don't care if anyone, including me, can read my 
code parenthetical construct:


print \n.join((lambda Ru,Ro,Iu,Io,IM,Sx,Sy:reduce(lambda x,y:x+y,map(lambda y,
Iu=Iu,Io=Io,Ru=Ru,Ro=Ro,Sy=Sy,L=lambda yc,Iu=Iu,Io=Io,Ru=Ru,Ro=Ro,i=IM,
Sx=Sx,Sy=Sy:reduce(lambda x,y:x+y,map(lambda x,xc=Ru,yc=yc,Ru=Ru,Ro=Ro,
i=i,Sx=Sx,F=lambda xc,yc,x,y,k,f=lambda xc,yc,x,y,k,f:(k=0)or (x*x+y*y
=4.0) or 1+f(xc,yc,x*x-y*y+xc,2.0*x*y+yc,k-1,f):f(xc,yc,x,y,k,f):chr(
64+F(Ru+x*(Ro-Ru)/Sx,yc,0,0,i)),range(Sx))):L(Iu+y*(Io-Iu)/Sy),range(Sy
(-2.1, 0.7, -1.2, 1.2, 30, 60, 24)[i*60:(i+1)*60] for i in xrange(24))


(adapted from 
http://effbot.org/pyfaq/is-it-possible-to-write-obfuscated-one-liners-in-python.htm)

James

Re: [ccp4bb] writing scripts-off topic

2012-01-24 Thread James Stroud


On Jan 24, 2012, at 11:24 AM, Ian Tickle wrote:
 Maybe a Python expert will answer this
 but I've often wondered, what happens if as some editors do
 (particularly if as I do you have to use different editors at
 different times depending on where you are working, such as on Windows
 working remotely from home or Linux at work), you could have a mixture
 of space and tab characters in the file?


Just have a policy of no tabs. I make sure all of my editors interpret a tab as 
a set number of spaces (I like 2).

A tab was meant as a separator and not a formatting character anyway. Have you 
ever edited a document where someone used 8 tabs to center a line of text 
rather than simply applying centered formatting? If you have, you'll realize 
that tabs are not meant to be formatting characters.

When I first picked up a python book and it told me that indent had semantic 
relevance, I immediately thought of fortran, so almost gave it up. Two days 
later, python's use of whitespace seemed more natural than the alternative. 
Let's look at the Java hello world program again

class HelloWorldApp {
public static void main(String[] args) {
System.out.println(Hello World!);
}
}

Notice that this obviously veteran programmer has used whitespace formatting in 
a natural way to improve the readability of the code. Java did not enforce 
this. If a language harnesses this formatting for semantics, then all of the 
brackets and the semicolon become redundant.

You probably already format your code in a way consistent with python's 
whitespace rules if you have been programming for any length of time.

James

Re: [ccp4bb] secondary structure output

2012-01-24 Thread James Stroud

Try DSSP2PDB.

http://structure.usc.edu/dssp2pdb/


It's a perl script.

Crossover flame note: I spent a lot of time learning perl and I still don't 
recommend the language ;-)

James


On Jan 24, 2012, at 1:57 PM, Ed Pozharski wrote:

 I am looking for a program/server that would determine secondary
 structure from a pdb file and then output a new pdb file with
 HELIX/SHEET records.  I have a model for which pymol fails to produce
 correct secondary structure.  DSSP and STRIDE identify the secondary
 structure correctly but I'd need to convert their output myself to
 either pdb header or pymol script.  So I wonder maybe something like
 that already exists.
 
 Gracias,
 
 Ed
 
 
 
 
 -- 
 I'd jump in myself, if I weren't so good at whistling.
   Julian, King of Lemurs

Re: [ccp4bb] secondary structure output

2012-01-24 Thread James Stroud

This reminds me of stride2pdb in case you want to use more than pymol with your 
secondary structure assignments:

http://structure.usc.edu/stride2pdb/

James


On Jan 24, 2012, at 2:20 PM, Martin Hällberg wrote:

 You can try STRIDE2PyMOL:
 
 http://pldserver1.biochem.queensu.ca/~rlc/work/pymol/stride_ss.py
 
 /Martin
 
 On Jan 24, 2012, at 9:57 PM, Ed Pozharski wrote:
 
 I am looking for a program/server that would determine secondary
 structure from a pdb file and then output a new pdb file with
 HELIX/SHEET records.  I have a model for which pymol fails to produce
 correct secondary structure.  DSSP and STRIDE identify the secondary
 structure correctly but I'd need to convert their output myself to
 either pdb header or pymol script.  So I wonder maybe something like
 that already exists.
 
 Gracias,
 
 Ed
 
 
 
 
 -- 
 I'd jump in myself, if I weren't so good at whistling.
  Julian, King of Lemurs

Re: [ccp4bb] writing scripts-off topic

2012-01-23 Thread James Stroud


On Jan 23, 2012, at 9:46 PM, Yuri Pompeu wrote:

 Hello Everyone,
 I want to play around with some coding/programming. Just simple calculations 
 from an input PDB file, B factors averages, occupancies, molecular weight, so 
 forth...
 What should I use python,C++, visual basic?
 thanks

Python is the most practical. Here is a simple python program:

print Hello World

Feel the power.

Python can be that simple or can be arbitrarily complex, nuanced, or abstract. 
You can write entire applications in python or small utilities. If you practice 
good habits, you will begin building reusable libraries from day one, saving 
time over the long haul.

STAY AWAY from proprietary nonsense like visual basic and from languages that 
do not facilitate reusability, like perl or other 1980's era shell languages. 
You will find yourself porting or abandoning your code, which is not a good use 
of your time.

I also do not recommend overweight languages like java, which create programs 
that never seem to deploy correctly and take about 5 times more code to create 
than should be necessary. Here's the java Hello World:


class HelloWorldApp {
public static void main(String[] args) {
System.out.println(Hello World!); // Display the string.
}
}


Public static void main? Don't bother.

And python can be VERY fast for calculations if you use free and popular 
libraries like numpy and scipy. These librares are wrappers around optimized 
fortran and C libraries that you will never have to use directly.

I recommend staying away from very low level languages like C or fortran, too. 
It is good to know these languages, but not so good to use them. Your 
creativity should go towards implementing cool ideas and should not be 
squandered on plugging memory leaks. It's better to use high level languages 
that leverage your time most effectively.

James

Re: [ccp4bb] Off-topic: ELNs

2012-01-18 Thread James Stroud

On Jan 18, 2012, at 6:17 AM, Anastassis Perrakis wrote:
 Here at the NKI, we had formed a committee to look at ELN solution two years 
 ago.
 We had interviewed three vendors, and run two tests with twenty users.
 
 A brief description of the outcome:
 
 1. None of the twenty test-users was satisfied with any of the two solutions 
 - and each was annoyed for a different reason.

Would you mind elaborating on any of the reasons that resonated most with you?

Are these ELNs simply over-engineered? You mention the requirement to make 
experiment templates. This sounds cumbersome and a potential for duplicated 
effort.

Do ELNs generally require templates or other overhead in the form of end-user 
effort?

James

Re: [ccp4bb] artificial tetramer

2011-12-12 Thread James Stroud

This is not trivial. Assuming an arbitrary origin, the simplest 4-fold symmetry 
operation (4-fold rotation) has 5 free parameters (translation along the 
symmetry axis is irrelevant). The biggest problem is determining the values for 
these parameters. For example, once you apply the symmetry, your molecule may 
clash with its symmetry mates or not even contact them. And even if you solve 
this latter problem automatically (which is not trivial because of 
irregularity), that leaves a net of 3 parameters describing the orientation of 
the protomer.

James


 
On Dec 12, 2011, at 1:34 PM, Fred wrote:

 Hi Tim,
 Thanks for your message and sorry if I wasn't clear. I don't have neither the 
 axis orientation nor the rotation matrix. I would like to create them but 
 don't know how and which program to use. Theoretically a have to create a 
 axis (vector) at some distance of the molecule into the cell and give it the 
 4-fold propriety. Quite simple, but don't which program to use.
 Regards,
 Fred
 
 
 Em 12-12-2011 18:23, Tim Gruene escreveu:
 -BEGIN PGP SIGNED MESSAGE-
 Hash: SHA1
 
 Hello Fred,
 
 if you know the rotation matrix, you can use pdbset with its 'rotate'
 keyword.
 It is not clear to me whether or not you have the rotation matrix or how
 you define rotation.
 
 Cheers,
 Tim
 
 On 12/12/2011 08:49 PM, Fred wrote:
 Hi List,
 I would like to build an artificial tetramer from a monomer PBD file.
 All that I have is the coordinates it self with CRYST/CELL information
 cards. The artificial 4-fold axis has an arbitrary orientation into the
 cell. I mean, its not parallel to any crystallographic axis and have to
 be at a certain distance of the molecule. This sounds conceptually
 simple, but I would like to do that in batch mode for hundreds of PDB's.
 Could someone, please, tell me the easiest way/program to do that?
 Thanks in advance,
 Fred
 
 - -- - --
 Dr Tim Gruene
 Institut fuer anorganische Chemie
 Tammannstr. 4
 D-37077 Goettingen
 
 GPG Key ID = A46BEE1A
 
 -BEGIN PGP SIGNATURE-
 Version: GnuPG v1.4.10 (GNU/Linux)
 Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/
 
 iD8DBQFO5mKwUxlJ7aRr7hoRAsyuAKDfnH50sG/EuKiFz7tCzgTUtnZrdQCg3zSn
 cqs8GHOu5M3JQahA/CofR1k=
 =tDUj
 -END PGP SIGNATURE-

Re: [ccp4bb] artificial tetramer

2011-12-12 Thread James Stroud

Coot can do this using the Rubik's Cube principle: transform to some state 
where the operation can be performed, perform the operation, then transform 
back.

So, in coot, I would

(1) rotate the molecule to the appropriate orientation
(2) move to the appropriate place in the unit cell
(3) change the symmetry to P4
(3) apply P4 symmetry
(4) change the symmetry back to whatever
(5) move the tetramer back to the appropriate place in the unit cell
(6) rotate the molecule back

Then, your original molecule will be in the original place surrounded by it's 
symmetry mates.

James



On Dec 12, 2011, at 2:36 PM, Fred wrote:

 Hi James,
 In my first post arbitrary orientation into the cell only means not 
 parallel to any crystallographic axis, which would simplify things very much. 
 I want to apply the 4-fold axis to the protein coordinates. If I have a cell 
 and therefore an origin, I can take a point at any distance of the origin, 
 pass a vector/axis through it and take the 3 others molecules by symmetry. 
 That's trivial, given the point, the orientation and the property of the 
 rotation. Don't know which program to use.
 Regards,
 Fred
 
 
 
 Em 12-12-2011 19:18, James Stroud escreveu:
 This is not trivial. Assuming an arbitrary origin, the simplest 4-fold 
 symmetry operation (4-fold rotation) has 5 free parameters (translation 
 along the symmetry axis is irrelevant). The biggest problem is determining 
 the values for these parameters. For example, once you apply the symmetry, 
 your molecule may clash with its symmetry mates or not even contact them. 
 And even if you solve this latter problem automatically (which is not 
 trivial because of irregularity), that leaves a net of 3 parameters 
 describing the orientation of the protomer.
 
 James
 
 
 
 On Dec 12, 2011, at 1:34 PM, Fred wrote:
 
 Hi Tim,
 Thanks for your message and sorry if I wasn't clear. I don't have neither 
 the axis orientation nor the rotation matrix. I would like to create them 
 but don't know how and which program to use. Theoretically a have to create 
 a axis (vector) at some distance of the molecule into the cell and give it 
 the 4-fold propriety. Quite simple, but don't which program to use.
 Regards,
 Fred
 
 
 Em 12-12-2011 18:23, Tim Gruene escreveu:
 -BEGIN PGP SIGNED MESSAGE-
 Hash: SHA1
 
 Hello Fred,
 
 if you know the rotation matrix, you can use pdbset with its 'rotate'
 keyword.
 It is not clear to me whether or not you have the rotation matrix or how
 you define rotation.
 
 Cheers,
 Tim
 
 On 12/12/2011 08:49 PM, Fred wrote:
 Hi List,
 I would like to build an artificial tetramer from a monomer PBD file.
 All that I have is the coordinates it self with CRYST/CELL information
 cards. The artificial 4-fold axis has an arbitrary orientation into the
 cell. I mean, its not parallel to any crystallographic axis and have to
 be at a certain distance of the molecule. This sounds conceptually
 simple, but I would like to do that in batch mode for hundreds of PDB's.
 Could someone, please, tell me the easiest way/program to do that?
 Thanks in advance,
 Fred
 
 - -- - --
 Dr Tim Gruene
 Institut fuer anorganische Chemie
 Tammannstr. 4
 D-37077 Goettingen
 
 GPG Key ID = A46BEE1A
 
 -BEGIN PGP SIGNATURE-
 Version: GnuPG v1.4.10 (GNU/Linux)
 Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/
 
 iD8DBQFO5mKwUxlJ7aRr7hoRAsyuAKDfnH50sG/EuKiFz7tCzgTUtnZrdQCg3zSn
 cqs8GHOu5M3JQahA/CofR1k=
 =tDUj
 -END PGP SIGNATURE-

Re: [ccp4bb] better way to post your density snapshots

2011-12-08 Thread James Stroud

On Dec 8, 2011, at 8:39 AM, Ed Pozharski wrote:However, I'd see noharm in posting just a small cutout of the map in the region ofinterest. It's not a difficult task (fft/mapmask or perhaps some usfmagic), but is there some user-friendly approach to cutting out a smallmap volume?There's probably a slick program somewhere, but attached is a command line tool I made several years ago and still use all of the time. To use it, you need ccp4 and mapman from USF in your path. Running the script with "mapreg -h" yields the documentation below. It can cut a map using a box, extend or cut to the ASU, extend a map to a cell, or use a pdb file to make a box. It can also cull a map around a PDB.To use it, the attachment needs to be untarred (tar zxvf mapreg.tar).One of these days I may give it a web page.James* mapreg version 0.02*** output a region of a cns map*** copyright James C. Stroud, 2008*** distributed under the GNU Public License**Usage: mapreg [-h] [-c weight] [-b border] [-s symm] [-t type] [-x 'a b c alpha beta gamma'] mapfile [x1 y1 z1 x2 y2 z2 | ASU | CELL | pdbfile ]Flags: -h print this help -b border define a border if using a pdb file to define region default is 5 (5 Angstroms) -c weight culls according to a weighting factor between 0 and 1 -x cell the sides and angles *must* be in single quotes -s symm usually symmetry number is needed (ispcgr number) -t type type of input map - default is cns -o outfile name of output file (name generated if not supplied)Description: Mapreg takes a cns map as input and outputs a new region of the map as specified at the command line. The possible region specifiers are: x1 y1 z1 x2 y2 z2 : the region defined by the two grid unit or fractional coordinate points (x1,y1,z1) and (x2,y2,z2) 'ASU' : the CCP4 default asymmetric unit 'CELL' : the whole unit cell pdbfile : a pdb file defining the limits of the regionif border is defined, then this will be the borderin Angstroms around pdbfile to define the region If the region specifier is left out, then mapreg will output the whole unit cell. CULLING === If a culling factor is supplied and a pdb file is used for trimming, then culling will be attempted. Culling trims the map to the the atoms of the pdb file if supplied. Without the pdb file, the program terminates with an error if a culling factor is supplied. The culling weighting factor is, for all practical purposes, arbitrary. Play with it to get the desired results. Start with 0.1 and go up (tighter) or down (less tight). Make sure you are aware of the caveats of culling before you use this to make figures for publication.

mapreg.tar
Description: Unix tar archive

[ccp4bb] [ANNOUNCEMENT] mapreg has a home page

2011-12-08 Thread James Stroud

Hello all,

Since no one offered a good utility for Ed Pozharski's idea to easily cut a 
region of a density map, I made a permanent web location for my utility called 
mapreg:

  http://mapreg.bravais.net/

Enjoy.

James

Re: [ccp4bb] Efficient way of showing residue conservation

2011-12-07 Thread James Stroud

Usually you put a statistic like this in the tempFactor field (B) and then 
color by B-factor in pymol or similar.

I'm certain there is a facility for filling this entry somewhere. If not, then 
a fairly trivial server is waiting for someone to create it and claim the glory.

Google something like alignment b-factor or clustal b-factor. That's my 
best guess.

James


On Dec 7, 2011, at 10:26 PM, Yuri Pompeu wrote:

 I once saw a figure showing the protein as surface, but instead of having it 
 coloured by atom type
 or potential, it was shown by percent conservation in the family. Something 
 like red highly conserved, all the way to white, not conserved at all...
 Now, I assume the figure was done by uploading aligned sequnces of several 
 members of a family, and the colouring
 the generated surface accordingly. 
 Does anyone know a way to do this more elegantly than what I tried doing?
 ps. I quit colouring them manually after I remebered my protein was 407 aa 
 long...

Re: [ccp4bb] Withdrawal of subscription

2011-11-29 Thread James Stroud

With such a nice parting letter, we find it disheartening to let you go.

James

On Nov 29, 2011, at 7:44 PM, debanjan choudhuri wrote:

 To whom it may concern,
 I am writing this in request to the cancellation of my subscription to CCP4BB 
 mail.
 It was indeed a very rewarding and a knowledgeable experience to be a part of 
 this family. 
 With regards
 Debanjan Choudhuri

Re: [ccp4bb] Movements of domains

2011-11-21 Thread James Stroud


On Nov 21, 2011, at 3:04 PM, Filip Van Petegem wrote:

 So the question is: how you can state that a particular movement was 
 'significantly large' compared to the resolution limit?

I can think of a different but related question. How significant is a 
particular movement compared to a measured coordinate error? One way to measure 
the coordinate error in this example is to least-squares superpose the two 
instances of the domain in question and calculate the rmsd.

This makes the calculation of significance independent of the resolution of the 
data set.

James

Re: [ccp4bb] Movements of domains

2011-11-21 Thread James Stroud


On Nov 21, 2011, at 3:52 PM, Filip Van Petegem wrote:

 As mentioned for X-ray structures, a Luzzati analysis may give information 
 about the positional errors, but there should be an increased resolution when 
 comparing domain movements, because it's unlikely for all atoms to have an 
 error in the same direction.


Here's how I think about it:

If you use the empirical coordinate error that I described previously, you can 
use simple statistics to calculate how likely you are to get a coordinated 
movement (relative to a fixed landmark).

I can use a 1-d case as an example. In this 1-d case, let's pretend that we 
have a domain of N=25 atoms where atom 2 is about 1 away from atom 1 and atom 3 
is 2 away from atom 1 and one away from atom 2, etc, with a standard deviation 
of 1 for the position of the atoms. If atom 1 for domain A is at 1, this is just

A_j = j

Then you can have domain B that has moved +1 compared to domain A:

B_j = j+1

Since we have an alignment (B_j - A_j), then we can calculate the movement, X:

X = mean(B) - mean(A)

We can also calculate the error of the ensemble (aka the error of the mean):

sigmaE = std( (B - mean(B)) - (A - mean(B)) ) / sqrt(25)

Then, we can calculate how likely it is we observe the movement X by tail 
integration of the cumulative normal distribution. We will justify this for the 
3-d case because the least squares superposition (from which we estimate the 
coordinate error) assumes normality.

Here is a simulation of this scenario in python:

py import numpy
py from scipy.special import ndtr
py a = numpy.array([numpy.random.normal(j) for j in xrange(25)])
py b = numpy.array([numpy.random.normal(j+1) for j in xrange(25)])
py a
array([  1.38125295,  -0.27126096,   1.7597104 ,   1.36242299,
 3.88327659,   4.33063307,   5.00544708,   7.0258,
 7.83945228,   9.72101719,  10.36231633,  10.29176378,
11.78497375,  12.16082056,  14.31057296,  13.25941344,
17.93779336,  18.05626047,  18.62148347,  20.52756478,
19.73362283,  21.83953268,  22.28038617,  23.24545481,  22.96192518])
py b
array([  3.32750181,   2.42664791,   3.23309368,   4.32882699,
 6.59985764,   6.49597664,   5.27921723,   7.8573831 ,
 9.98722475,  10.65225383,  11.69970159,  11.67435798,
12.16191254,  13.69297801,  14.21845382,  17.21423427,
16.89347161,  17.68778305,  17.89371115,  18.7679351 ,
20.84842496,  20.69249899,  23.97436807,  23.54011453,  26.84986504])
py X = b.mean() - a.mean()
py sigma_ensemble = ((b - b.mean()) - (a - a.mean())).std() / math.sqrt(25)
py X_standardized = (X - 0) / sigma_ensemble
py 2 * ndtr(-abs(X_standardized))
0.00011596192653578624

This means, for the 1-d scenario I describe, (using the random arrays generated 
above), the movement is expected about once for every 10,000 experiments, 
providing a p-value, or estimate of significance. Note that the 2 comes from 
the fact that the cumulative distribution has 2 tails.

A 3-D calculation using the rmsd as the coordinate error would be similar 
except that you use Euclid's formula to calculate the distances in higher 
dimensions (instead of the absolute value of a simple subtraction as in 1-d).

James

Re: [ccp4bb] Movements of domains

2011-11-21 Thread James Stroud

On Nov 21, 2011, at 5:23 PM, James Stroud wrote:
 except that you use Euclid's formula to calculate the distances in higher 
 dimensions

I meant to say Euclidian distance. Euclid's formula has a specific meaning 
that is different.

Re: [ccp4bb] Movements of domains

2011-11-21 Thread James Stroud


On Nov 21, 2011, at 6:34 PM, Jacob Keller wrote:
 I am curious how all of this can be more than splitting hairs, i.e.,
 under what conditions can this 1Ang domain motion mean something
 biologically significant?

To engage in the discussion, I think we had to accept this:

On Nov 21, 2011, at 3:04 PM, Filip Van Petegem wrote:
 I'm not talking about the fact that this movement was artificially caused by 
 crystal packing or something similar.
 Just for whatever the reason (whether packing, pH, ligand binding, ...), you 
 simply observe the movement.   

So the point of the discussion, as I understand it, is to figure out whether 
the movement warrants further consideration in the first place, i.e. whether it 
is significant with respect to the error of the models.

I think it doesn't take too much energy to discount the attempt to quantify the 
statistical significance by claiming that one can't imagine how such a change 
might be biologically significant. I'm really not privy to the structures in 
question, so I am in no position to make this judgement.

James

Re: [ccp4bb] Archiving for fraud detection

2011-11-04 Thread James Stroud


On Nov 4, 2011, at 2:09 AM, Chris Morris wrote:

 One argument for archiving images has been that reprocessing could 
 demonstrate deliberately deceptive structures.
 
 In fact, what is needed for this is not necessarily the image. It is the last 
 data file that was produced by a trusted computer.


Although this is a good idea from the perspective of storage, it is difficult 
to implement. 

For this idea to work, you need a (1) certificate system, (2) certificate 
authority. The certification is necessary to verify that the data file was 
indeed generated by a trusted computer. The chosen file needs to be certified 
by the authority and the certification archived on a trusted system. None of 
these requirements are terribly problematic. The infrastructure for a 
certificate system is free in the form of openSSL. Almost any lab or 
institution could easily become a certificate authority. The storage 
requirements for the certificates are trivial. For example, if a certificate 
were 2 KB, then, for the 8,000 structures per year, the storage requirements 
would be 1.6 MB. After 1000 years, we would fill up my $14.95 2 GB thumb drive.

The difficulty is that certification should be done on the file before it is 
transferred from the trusted computer. This requires inserting the 
certification process somewhere in the transfer pipeline, which is difficult 
because it requires all the synchrotrons to actually implement it. Allowing the 
user to produce the certificate after transfer is as useful as having no 
certificate system at all.

Then there is the issue of data collection on a home source.

James

Re: [ccp4bb] raw data deposition

2011-10-27 Thread James Stroud

On Oct 27, 2011, at 5:22 PM, Francis E Reyes wrote:
 So I ask again, are there literature examples where reevaluation of the 
 crystallographic data has directly resulted in new biological insights into 
 the system being modeled?

This is a poor criterion on which to base any conclusions or decisions. We can 
blame the lack of examples on unavailability of the data.

Right now, I'd love to get my hands on the raw images for a particular cryoEM 
data set, but they are not available--only the maps. But the maps assume one 
symmetry and I have a hypothesis that the true symmetry is different. I could 
test my hypothesis by reprocessing the data were it available.

James

Re: [ccp4bb] IUCr committees, depositing images

2011-10-26 Thread James Stroud


On Oct 26, 2011, at 9:59 AM, Patrick Shaw Stewart wrote:

 The principle is that the movie is written to the same area of memory, 
 jumping back to the beginning when it is full (this part is not essential, 
 but it makes the principle clear).  Then, when the photographer takes his 
 finger off the trigger, the last x seconds is permanently stored.  So you 
 keep your wits about you, and press the metaphorical store button just 
 after you have got the movie in the can so to speak

This idea seems equivalent to only storing permanently those datasets that 
actually yield structures worthy of deposition.

James

Re: [ccp4bb] IUCr committees, depositing images

2011-10-25 Thread James Stroud


On Oct 24, 2011, at 3:56 PM, James Holton wrote:

 The PDB only gets about 8000 depositions per year

Just to put this into dollars. If each dataset is about 17 GB in size, then 
that's about 14 TB of storage that needs to come online every year to store the 
raw data for every structure. A two second search reveals that Newegg has a 3GB 
hitachi for $200. So that's about $1000 / year of storage for the raw data 
behind PDB deposits.

James

Re: [ccp4bb] Biological assembly

2011-10-20 Thread James Stroud

Since a crystallographic 3-fold generates the trimer (of dimers) then A1-B2 can be the ASU in this case.Just generate the symmetry mates with the A1-B1 dimer and then make a new PDB of A1-B2 then generate the symmetry mates of A1-B2 to see that the lattice is complete.I compulsively made and attached an illustration to show what I mean.James

H3.pdf
Description: Adobe PDF document
On Oct 20, 2011, at 3:40 AM, Kayashree M wrote:Thank you Sir for the suggestions. Hi,If, in your case, no possible asymmetric unit can contain A1-B2, then you deposit A1-B1 (or I suppose A2-B2...) and indicate to the PDB (like placing cards in the header cards) the operator to be used (and the subunit it applies to) in order to generate the most likely biological dimer. Normally the PDB can take care of that.The protein looks like the letter "C", So in one of the trimer it is arrangedas "C" while in the other trimer (stacked) it is arranged like "inverted C", SoThe dimer A1-B1 and A2-B2 are same while A1-B1 and A1-B2 are different.Thanking you With RegardsKavya-- 
This message has been scanned for viruses and
dangerous content by
MailScanner, and is
believed to be clean.

Re: [ccp4bb] Biological assembly

2011-10-19 Thread James Stroud

On Oct 19, 2011, at 4:36 AM, Kayashree M wrote:
 We have a structure which is a homodimer in the asymmetric unit.
 PISA predicts most probable assembly as a dimer but this
 dimeric assembly is different from what is solved (offcourse
 we can generate the symmetry equivalent molecule and get that).

This last sentence is a bit vague. Can you take the just dimer that PISA 
predicts, fit this dimer to the lattice (i.e. each monomer sitting correctly in 
density but retaining the dimeric relationship predicted by PISA), and then 
generate the complete lattice using just this fitted dimer and crystallographic 
symmetries?

If so, that means that the PISA dimer is equivalent to the ASU you can deposit 
the PISA dimer as the ASU.

James

Re: [ccp4bb] MR - small coiled coil, 1.65A = 1.000 solutions, all of them wrong

2011-10-17 Thread James Stroud

 The programs give me solutions with reasonable maps, but it is never 
 possible to refine to achieve Rvalues below 0.40.

This strongly suggests a space group issue.

If the systematic absences are compelling, you should try to drop it down to P2 
first. Look at your packing in the C222(1). If you change the direction of one 
of the coils relative to the other, will it break the orthorhombic symmetry and 
drop it to monoclinic? This may provide a clue to the space group you should 
try.

James


On Oct 17, 2011, at 11:09 AM, Napoleão Valadares wrote:

Hi there!
I got crystals from some synthetic peptides I bought, they are 30 residues 
 long and are supposed to form a coiled coil. I collected various data sets 
 (home source, Brookhaven and Diamond), including some at the resolution of 
 1.65 A, for which the space group appears to be C222 or C2221. The unit cell 
 is small, 22.67, 88.06, 26.13, and the Matheus Coefficient indicates that's 
 there's only one helix in the asymmetric unit and a 25% solvent content.
 
I have tried A LOT of Molecular Replacement using Phaser and Phenix 
 AutoMR. I'm using a 80% identity coiled coil helix as search model. The 
 programs give me solutions with reasonable maps, but it is never possible 
 to refine to achieve Rvalues below 0.40. Additionally, maps from different 
 solutions look reasonable, so I'm thinking these are all bias.
 
I have 5 other synthetic 30 residues peptides (that crystallize in 
 different space groups and diffract to lower resolutions), including a 
 SelenoMethionine (SM) derivative (but it does not have enough anomalous 
 signal, ASU is too big, it is possible that the SM are disordered). I'm stuck 
 on this since March.
 
Regarding the search model, I already tried trimming some or all side 
 chains and removing 2, 3 or 5 residues on each/both sides. I also tried other 
 search models. Maybe some magic combination of parameters on Phaser or 
 other programs can help me.
 
What is your advice regarding how to proceed with MR? Is there some 
 program, procedure, parameter, pray or human sacrifice that could help me?
Thank you.
Regards,
  Napo

Re: [ccp4bb] MR - small coiled coil, 1.65A = 1.000 solutions, all of them wrong

2011-10-17 Thread James Stroud

I should have said not compelling.

James


On Oct 17, 2011, at 12:40 PM, James Stroud wrote:

 The programs give me solutions with reasonable maps, but it is never 
 possible to refine to achieve Rvalues below 0.40.
 
 This strongly suggests a space group issue.
 
 If the systematic absences are compelling, you should try to drop it down to 
 P2 first. Look at your packing in the C222(1). If you change the direction of 
 one of the coils relative to the other, will it break the orthorhombic 
 symmetry and drop it to monoclinic? This may provide a clue to the space 
 group you should try.

Re: [ccp4bb] data processing problem with ice rings

2011-10-14 Thread James Stroud

First of all, are you sure those are ice rings? They do not look typical. I 
think you might have salt crystals from dehydration *before* freezing. 
Otherwise, I think your freezing went well. Maybe try a humidity controlled 
environment when you freeze.

Second, I'm not so sure the bad stats come from the contaminating rings. The 
lattice seems to have some sort of problem, like a split lattice. You might be 
able to tackle this problem by increasing your spot size or skewing it's shape 
to compensate for the split. You need to investigate several images throughout 
the run to see whether and how to manipulate your spot size. Sometimes, the 
split lengthens the spots in the direction of the phi axis and you get lucky. 
But I think the phi axis might be horizontal in this picture, which makes 
things a little trickier. From one image, it is difficult to tell the pathology 
of this crystal.

In principle, if you can accurately measure the most high-resolution spots 
visible (which appear to be about 1.9 Å, guessing from your log file) then you 
will have a pretty good data set, even with the contaminating rings.

Personally, I'd use Denzo for this data, but I don't know what is vogue with 
the community right now. I still use O, so my tastes might be somewhat 
antiquated.

James



On Oct 13, 2011, at 11:12 PM, ChenTiantian wrote:

 Hi there,
 I am processing a dataset which has bad ice rings (as you can see in the 
 attach png file).
 I tried both XDS and imosflm, and got similar results, it seems that adding  
 EXCLUDE_RESOLUTION_RANGE cannot get rid of the effects of the ice rings.
 the following is part of the CORRECT.LP which is the second attached file, 
 you can find more details there. 
 
   SUBSET OF INTENSITY DATA WITH SIGNAL/NOISE = -3.0 AS FUNCTION OF RESOLUTION
  RESOLUTION NUMBER OF REFLECTIONSCOMPLETENESS R-FACTOR  R-FACTOR 
 COMPARED I/SIGMA   R-meas  Rmrgd-F  Anomal  SigAno   Nano
LIMIT OBSERVED  UNIQUE  POSSIBLE OF DATA   observed  expected  
 Corr
 
  4.24   371525537  5545   99.9%  46.9% 52.7%
 371502.4850.8%19.4%   -28%   0.5135136
  3.01   553449002  9840   91.5%  62.7% 65.1%
 551161.7668.3%48.1%   -28%   0.5207760
  2.46   84636   12699 12703  100.0%  67.4% 84.7%
 846341.5573.0%54.2%   -19%   0.513   12104
  2.13   97910   14743 14987   98.4% 254.5%199.3%
 979080.16   276.2%  4899.9%   -23%   0.473   14037
  1.90  110260   16846 16940   99.4% 299.2%303.3%   
 1102450.06   325.0%   -99.9%   -17%   0.422   15995
  1.74  118354   18629 18744   99.4%1062.0%   1043.6%   
 118317   -0.20  1156.4%   -99.9%   -13%   0.380   17414
  1.61  122958   20193 20331   99.3% 967.5%   1571.1%   
 1228680.10  1059.7%   987.3%-2%   0.402   18348
  1.51  125075   21554 21794   98.9% 838.9%   1355.1%   
 1249330.08   922.6%  1116.9%-1%   0.402   18977
  1.42   72057   17042 23233   73.4% 640.8%775.3%
 703910.08   732.5%   826.7%-8%   0.425   10003
 total  823746  136245144117   94.5% 166.4%166.7%   
 8215620.40   181.1%   296.7%   -15%   0.435  119774
 
 Note that I/SIGMA of each resolution shell is 2.5, so how should I do to 
 process the dataset properly? Any suggestion about this super ice rings?
 Thanks!
 
 Tiantian
 
 -- 
 Shanghai Institute of Materia Medica, Chinese Academy of Sciences
 Address: Room 101, 646 Songtao Road, Zhangjiang Hi-Tech Park,
 Shanghai, 201203 
 csrc.pngCORRECT.LP

Re: [ccp4bb] should the final model be refined against full datset

2011-10-14 Thread James Stroud

Each R-free flag corresponds a particular HKL index. Redundancy refers to the 
number of times a reflection corresponding to a given HKL index is observed. 
The final structure factor of a given HKL can be thought of as an average of 
these redundant observations.

Related to your question, someone once mentioned that for each particular space 
group, there should be a preferred R-free assignment. As far as I know, nothing 
tangible ever came of that idea.

James



On Oct 14, 2011, at 5:34 PM, D Bonsor wrote:

 I may be missing something or someone could point out that I am wrong and why 
 as I am curious, but with a highly redundant dataset the difference between 
 refining the final model against the full dataset would be small based upon 
 the random selection of reflections for Rfree?

Re: [ccp4bb] Ice rings...

2011-10-11 Thread James Stroud

I've used a technique called annealing, which amounts to holding an index 
card between the cryo stream and the crystal for a few seconds then removing 
the card quickly.

In my experience, about 70% of the time the diffraction is worse and about 30% 
of the time the ice rings will be gone with slightly improved diffraction, 
allowing recovery of a significant range of data. Most of the time, though, I 
find another crystal that had a better initial freeze, so annealing has never 
been a life saver--but it could be under dire circumstances.

James




On Oct 11, 2011, at 9:30 AM, Dr. Thayumanasamy Somasundaram wrote:

 Francis,
 
 I would like to bring your attention to our paper in Acta Cryst D Volume 66 
 (6), 741-744 (2010) where we deal with spots under the ice-rings. We have 
 been very successful in eliminating the ice-rings and recover the data 
 underneath. If you are interested you can request the Python script from 
 Michael Chapman at OHSU.
 De-icing: recovery of diffraction intensities in the presence of ice rings, 
 Michael S. Chapman and Thayumanasamy Somasundaram
 
 
 If you need help please e-mail me outside the CCP4BB. 
 
 On 10/11/2011 11:16 AM, Francis E Reyes wrote:
 
 All,
 
 
 So I have two intense ice rings where there appear to be lattice spots in 
 between them. 
 
 I understand that any reflections that lie directly on the ice ring are 
 useless, however, how do software programs (HKL2000, d*Trek, mosflm, XDS) 
 deal with these intermediate spots? 
 
 It would seem to me that employing a 'resolution cut off' just before the 
 ice ring (on the low resolution side) would be improper, as there are spots 
 on the high resolution side of the ice. (see enclosed .tiff)
 
 
 In fact, how do these programs deal with spots lying on ice rings? Are they 
 rejected by some algorithm by those programs during integration, or is it up 
 to the scaling/merging (by SCALA for example) step to deal with them? 
 
 Thanks!
 
 F
 
 
 
 -
 Francis E. Reyes M.Sc.
 215 UCB
 University of Colorado at Boulder
 
 
 
 
 
 
 -- 
 
 Dr. Thayumanasamy Somasundaram [Soma]
 Director, X-Ray Crystallography Facility (XRF)
 Off. Ph: (850)644-6448  | Lab Ph: (850)645-1333   
 Fax:(850)644-7244   | E-mail: tsomasunda...@fsu.edu   
 
 URI: www.sb.fsu.edu/~soma | URI: www.sb.fsu.edu/~xray 
 Postal Address--
 91, Chieftan Way | KLB 414
 Institute of Molecular Biophysics 
 Florida State University  
 Tallahassee, FL 32306-4380, USA.

Re: [ccp4bb] detect dsDNA

2011-10-01 Thread James Stroud

If you can reproduce the crystals and have the material

1. Harvest several large crystals.
2. Make several transfers to fresh mother liquor to wash.
3. Dissolve in DNA loading dye without SDS
4. Run on a native gel (e.g. 6% polyacrylamide, 0.5XTBE, etc.).
5. Include positive control lanes for protein, DNA, and complex.
6. Stain with ETBr. Take a picture.
7. Wash out the ethidium in accordance with state, local, federal, UN, laws, 
filling out the proper documentation ad nauseum. Take a safety class just to be 
sure you didn't miss something. Hug a bureaucrat.
8. Stain with coomassie. Take a picture.

That should tell you more than you need to know.

James

On Sep 30, 2011, at 9:36 PM, zq deng wrote:

 Hi all,
 .
 recently,I got a crystal of protein-DNA crystal.i used silver stainto prove 
 that it is a protein crystal.Does anyone have method to detect if there is 
 DNA in the crystal.
 any suggestion will be appreciated.
  
 Regards, 
 deng

[ccp4bb] Space Group Table

2011-09-14 Thread James Stroud

Hello All,

Is Table 6 of http://cci.lbl.gov/sginfo/hall_symbols.html the authoritative 
mapping of space group numbers to Hermann-Mauguin symbols (i.e., can we count 
on major software packages to honor this mapping if present in a PDB file)? I 
notice that this web page was authored by a couple of prominent developers.

Thank you,

James

Re: [ccp4bb] structure based superposition

2011-08-18 Thread James Stroud

Check out Theseus: http://www.theseus3d.org/

You will need a sequence based alignment. This alignment can be provided 
automatically by muscle if you have it installed 
(http://www.drive5.com/muscle/)--this is different from pymol. Pymol seems to 
do an SVD to obtain the alignment. I'm not up to date on what all the other 
programs are using, but I think most use least-squares, so you need to provide 
a sequence based alignment for these programs

Theseus results are always as good or better (subjectively speaking) than 
straight least-squares superposition.

I don't know how the Theseus/muscle combo compares to pymol's use of SVD in 
terms of alignment quality because I haven't compared them directly.

James


On Aug 18, 2011, at 3:10 AM, Suda Ravindran wrote:

 Dear all,
 
 I would like to know the tools/servers/programs that can be used for 
 structure based superposition of two or more proteins. Please help me out..!!
 
 Thanks,
 
 -Suda

Re: [ccp4bb] Intensity-Weighted Reciprocal Lattice

2011-07-22 Thread James Stroud

One holistic way to view a reciprocal lattice (or subset thereof) without the 
requirement for 3-d or moving frames is with a pole figure.

 http://en.wikipedia.org/wiki/Pole_figure

See Palmer  Ladd for a better discussion.

I don't know what software can make a pole figure, though. 

James


On Jul 21, 2011, at 6:36 PM, Michael Thompson wrote:

 Hello ccp4  phenix BB members,
 
 I would like to view the intensity-weighted reciprocal lattice for several 
 data sets that I have collected. (The data have been indexed, integrated and 
 scaled with Denzo and Scalepack.) I was wondering if anyone could offer some 
 advice on what might be the best and/or most practical way to do this?
 
 I know that there are several programs out there that can generate sections 
 (i.e. 0,k,l) of the reciprocal lattice, such as LABELIT and xrayplot. Are 
 there any other options for doing this, perhaps within ccp4 and/or phenix? I 
 once saw someone give a presentation and they had a little video that showed 
 a three dimensional section of the reciprocal lattice rocking back and forth, 
 which was really cool. I liked this because I felt like it gave a much more 
 holistic representation as opposed to viewing a bunch of individual sections. 
 I don't know if there is an easy way to do this, or if this person somehow 
 managed to create this 3D depiction from a series of sections.
 
 Any tips or recommendations would be appreciated.
 
 Thanks,
 
 Mike
 
 
 
 
 -- 
 Michael C. Thompson
 
 Graduate Student
 
 Biochemistry  Molecular Biology Division
 
 Department of Chemistry  Biochemistry
 
 University of California, Los Angeles
 
 mi...@chem.ucla.edu

Re: [ccp4bb] Off Topic: How to delete loops from a protein

2011-07-19 Thread James Stroud

I've found that predator is one of the best services of this sort:

  Ref: PROTEINS: Structure, Function, and Genetics 27:329–335 (1997)

  Server: http://mobyle.pasteur.fr/cgi-bin/portal.py?#forms::predator

The server is slow but the service is good.

James



On Jul 18, 2011, at 10:47 PM, Francois Berenger wrote:

 Hi Obayed,
 
 If I understood your question well,
 you are looking for something called secondary structure prediction.
 
 I googled these keywords and found this server:
 http://bioinf.cs.ucl.ac.uk/psipred/
 
 You may find other interesting servers on the web and
 some literature comparing them.
 
 I think such methods need only the sequence of your
 protein to predict its secondary structures.
 
 Hope this helps,
 Francois.
 
 On 07/19/2011 02:14 PM, Eric Larson wrote:
 Hi Obayed,
 
 you could give in situ protolysis a try. This is where you add a bit of
 protease along with you target protein to the crystallization drop. It
 has been quite successful for the folks at the SGC. Here are the
 relevant references:
 
 Dong A, et al. In situ proteolysis for protein crystallization and
 structure determination. Nat Methods. 2007 Dec;4(12):1019-21.PMID:
 17982461. (http://www.ncbi.nlm.nih.gov/pubmed/17982461)
 
 Wernimont A, Edwards A. In situ proteolysis to generate crystals for
 structure determination: an update. PLoS One. 2009;4(4):e5094. PMID:
 19352432. (http://www.ncbi.nlm.nih.gov/pubmed/19352432)
 
 good luck,
 
 Eric
 
 
 Eric T. Larson, PhD
 Biomolecular Structure Center
 Department of Biochemistry
 Box 357742
 University of Washington
 Seattle, WA 98195
 
 email: larso...@u.washington.edu
 
 
 On Mon, 18 Jul 2011, Obayed Ullah wrote:
 
 
 Hi all
 
 I wrote last time but got only one feedback. I know some of you guys
 must have this experience that how to delete loops from the
 protein. Please help me with suggestions.
 
 I am working with a human protein which have around 20% sequence
 identity with the other proteins of  the same family. Structure
 of some of the proteins from this family have been solved. All the
 solved structures have around 20% identity with my protein. I
 am trying to crystallize  the protein but it looks like very hard to
 get crystal. I have tried different N and C terminally
 truncated constructs for crystallization but no crystal. My feeling is
 that probably there is some flexible loops with in the
 protein which limiting the crystallization.
 
 So I want to delete the loops with in the protein (not to truncate in
 the terminal, I already have done this). I am not asking
 suggestion about how to delete the loop rather how to decide where the
 loop is. I am not sure how much it will be helpful to get a
 homology model of such a protein having low sequence identity. Is
 there any strategy to decide where the loop could be? Does
 anybody know any established/ rational method to do that.
 
 Waiting for your suggestions
 
 Obayed Ullah

Re: [ccp4bb] Off Topic: How to delete loops from a protein/homology modeling

2011-07-19 Thread James Stroud

Homer is one of the best services for homology modeling:

  http://protein.cribi.unipd.it/Homer/

James


On Jul 19, 2011, at 2:06 PM, Paul Kraft wrote:

 Hi Obayed,
 even though there is 20% sequence identity you may be able to get a very good 
 homology model, especially if there is more than one protein structure in the 
 PDB with 20% homology. Then you can overlap the pdbs and find out what 
 structurally needs to be preserved as opposed to what is in total homology 
 preserved. Typically it is the position of turns residues G, D, S,  P, N etc. 
 You won't know until you thread your protein through both pdbs and compare 
 them all. Swiss Pro's Expasy has an easy program that will take an alignment 
 with a pdb and generate a homology model with loops spliced in and energy 
 minimized. There are many other more or less complicated programs, but it's a 
 good one to start with.
 Paul
 
 Dr. Paul Kraft
 Structural Biologist
 cell 586-596-2770
 email: haresea...@yahoo.com
 email: kraft_proteome_resea...@yahoo.com
 
 
 This communication and any attachments contain information which is 
 confidential and may also be privileged. It is for the exclusive use of the 
 intended recipient(s). If you are not the intended recipient(s) please note 
 that any form of disclosure, distribution, copying or use of this 
 communication or the information in it or in any attachments is strictly 
 prohibited and may be unlawful. If you have received this communication in 
 error, please notify the sender and delete the email and destroy any copies 
 of it.
 
 E-mail communications cannot be guaranteed to be secure or error free, as 
 information could be intercepted, corrupted, amended, lost, destroyed, arrive 
 late or incomplete, or contain viruses. We do not accept liability for any 
 such matters or their consequences. Anyone who communicates with us by e-mail 
 is taken to accept the risks in doing so.
 
 --- On Tue, 7/19/11, James Stroud xtald...@gmail.com wrote:
 
 From: James Stroud xtald...@gmail.com
 Subject: Re: [ccp4bb] Off Topic: How to delete loops from a protein
 To: CCP4BB@JISCMAIL.AC.UK
 Date: Tuesday, July 19, 2011, 2:37 AM
 
 I've found that predator is one of the best services of this sort:
 
   Ref: PROTEINS: Structure, Function, and Genetics 27:329–335 (1997)
 
   Server: http://mobyle.pasteur.fr/cgi-bin/portal.py?#forms::predator
 
 The server is slow but the service is good.
 
 James
 
 
 
 On Jul 18, 2011, at 10:47 PM, Francois Berenger wrote:
 
  Hi Obayed,
  
  If I understood your question well,
  you are looking for something called secondary structure prediction.
  
  I googled these keywords and found this server:
  http://bioinf.cs.ucl.ac.uk/psipred/
  
  You may find other interesting servers on the web and
  some literature comparing them.
  
  I think such methods need only the sequence of your
  protein to predict its secondary structures.
  
  Hope this helps,
  Francois.
  
  On 07/19/2011 02:14 PM, Eric Larson wrote:
  Hi Obayed,
  
  you could give in situ protolysis a try. This is where you add a bit of
  protease along with you target protein to the crystallization drop. It
  has been quite successful for the folks at the SGC. Here are the
  relevant references:
  
  Dong A, et al. In situ proteolysis for protein crystallization and
  structure determination. Nat Methods. 2007 Dec;4(12):1019-21.PMID:
  17982461. (http://www.ncbi.nlm.nih.gov/pubmed/17982461)
  
  Wernimont A, Edwards A. In situ proteolysis to generate crystals for
  structure determination: an update. PLoS One. 2009;4(4):e5094. PMID:
  19352432. (http://www.ncbi.nlm.nih.gov/pubmed/19352432)
  
  good luck,
  
  Eric
  
  
  Eric T. Larson, PhD
  Biomolecular Structure Center
  Department of Biochemistry
  Box 357742
  University of Washington
  Seattle, WA 98195
  
  email: larso...@u.washington.edu
  
  
  On Mon, 18 Jul 2011, Obayed Ullah wrote:
  
  
  Hi all
  
  I wrote last time but got only one feedback. I know some of you guys
  must have this experience that how to delete loops from the
  protein. Please help me with suggestions.
  
  I am working with a human protein which have around 20% sequence
  identity with the other proteins of  the same family. Structure
  of some of the proteins from this family have been solved. All the
  solved structures have around 20% identity with my protein. I
  am trying to crystallize  the protein but it looks like very hard to
  get crystal. I have tried different N and C terminally
  truncated constructs for crystallization but no crystal. My feeling is
  that probably there is some flexible loops with in the
  protein which limiting the crystallization.
  
  So I want to delete the loops with in the protein (not to truncate in
  the terminal, I already have done this). I am not asking
  suggestion about how to delete the loop rather how to decide where the
  loop is. I am not sure how much

Re: [ccp4bb] Off Topic: PDB validation server

2011-07-11 Thread James Stroud

On Jul 8, 2011, at 11:13 AM, Katherine Sippel wrote:

 I was shocked to discover that the file with only one questionable solvent 
 in April now has 173 of them. 

One word: Diffusion.

James

Re: [ccp4bb] [CONCEAL]

2011-06-16 Thread James Stroud

I don't think it worked because I can still see your email address.

James


On Jun 16, 2011, at 10:27 AM, Peter Burkhard wrote:

 
 SET CCP4BB CONCEAL
 
 
 Peter Burkhard, PhD, Assoc. Prof. Nanobiotechnology
 The Institute of Materials Science
 University of Connecticut
 97 North Eagleville Road
 Storrs, CT 06269-3136, USA
 Phone:  +1 860 486 3830 
 Fax:  +1 860 486 4745 
 E-mail: peter.burkh...@uconn.edu
 Web:http://www.ims.uconn.edu/~pburkhard/
 
 This communication, including any attachments, is intended solely for the use 
 of the addressee and may contain information which is privileged, 
 confidential, exempt from disclosure under applicable law or subject to 
 copyright. If you are not an intended recipient, any use, disclosure, 
 distribution, reproduction, review or copying is unauthorized and may be 
 unlawful. If you have received this transmission in error, please notify the 
 sender immediately. Thank you.

Re: [ccp4bb] Question about the statistical analysis-might be a bit off topic

2011-06-05 Thread James Stroud

The short answer can be found in item 2 in this link:

  http://science.widener.edu/svb/stats/error.html

The long answer is I highly recommend Error Analysis by John Taylor:

  http://science.widener.edu/svb/stats/error.html

If you can find the first edition (which can fit in your pocket) then consider 
yourself lucky. Later editions suffer book bloat.

James


On Jun 4, 2011, at 10:44 AM, capricy gao wrote:

 
 If means and standard deviations of A and B are known, how to estimate the 
 variance of A/B?
 
 Thanks.

[ccp4bb] INSERTION CODE

2011-05-03 Thread James Stroud

Just giving this thread a title.

James


On May 3, 2011, at 10:06 AM, Ian Tickle wrote:

 James, interesting that you chose residue number 32 for your example,
 because that is the number of one of the two active-site ASPs in the
 aspartic proteinase family (the other is ASP 215) that I (with Tom
 Blundell  others) worked on for many years.  So Ed, it's not just
 relevant to the WuKabat numbering for antibodies.  The idea that
 one would _not_ use consistent numbering (and therefore insertion
 codes) across species (viral, fungal, plant and animal so there is
 huge sequence variability with insertions  deletions everywhere),
 when working with these structures is frankly ludicrous.  I recall
 some programs (FRODO was one) actually required renumbering to the
 ordinals, i.e. 1, 2, 3 ... - that is until I fixed it!  This caused
 endless confusion, not least because there are often other ASPs in the
 vicinity of the active site which could easily get renumbered to 32.
 For me, it's important that when I refer to 'ASP 32' there's no
 possibility that I mean anything other than the active site ASP!
 
 Cheers
 
 -- Ian
 
 On Tue, May 3, 2011 at 5:17 PM, James Holton jmhol...@lbl.gov wrote:
 
 My understanding is that it was introduced for cases where an error in the
 sequence was discovered long after a large body of literature had
 accumulated for the wrong sequence.  That is, imagine some enzyme where an
 important catalytic active site residue was number 152, and lots of people
 had been talking about this residue for years.  Then, when you solve the 3D
 structure, you discover that there is actually a glycine between residues
 32 and 33, what do you do?  Do you change 152 to 153 and put up with all
 the angry letters from enzymologists, telling you that you mislabeled this
 important residue?  In case you don't want to do this, the PDB allows you to
 put in a residue 32A.  Deletions can happen too, but they are easier to
 deal with from a file format standpoint.
 
 -James Holton
 MAD Scientist
 
 On 5/3/2011 6:27 AM, Jahan Alikhajeh wrote:
 
 Dear Friends,
 I have noticed an issue in a pdb file, the term insertion code.
 Does anyone know anything about it? what is it used for?
 Thanks in Advance,
 
 
 Jahan Alikhajeh, Ph.D,
 
 Technical Supervisor,
 
 MAN Corporation LTD,
 
 Keshavarz Boulevard,
 
 Ghods Avenue No. 41,
 
 5th Floor, Tehran, Iran, 14177,
 
 Tel: +982166282841
 
 Fax: +982166282997

Re: [ccp4bb] Assigning secondary structure

2011-04-08 Thread James Stroud

Just use dssp and cite it. For cartoon figures, if you provide a disclaimer, it 
doesn't hurt to tweak it a bit to make it look better.

The idea is that readers are supposed to know that a protein doesn't really 
look like a cartoon--and those who don't understand the distinction probably 
won't have enough knowledge to have an opinion about your assignments anyway.

James



On Apr 8, 2011, at 9:19 AM, Cale Dakwar wrote:

 Hello all,
 
 Given a PDB file of a newly solved protein structure, what is the standard 
 procedure for assigning regions of secondary structure?  And by this I mean 
 to ask, how does one decide which residues form beta strands, which alpha 
 helices, and so on?  Is DSSP sufficient for this?  Are we supposed to 
 manually walk through the entire molecule and assign secondary structure as 
 we deem appropriate based on hydrogen bonding behaviour?  Some other 
 procedure?  And what of structures solved to ~2.7 A (or worse) where we can't 
 be sure of H-bonding.
 
 Cheers,
 Cale

Re: [ccp4bb] What happened to this innovative method by MV King?

2011-04-01 Thread James Stroud

This was not so much an advance but a remarkable observation. We have since 
learned that these clathrates are entirely impractical. The problem is not so 
much their dextrorotatory properties, which are more or less a nuisance, but 
that they are too dense and have absolutely no affinity for other compounds.

James




On Apr 1, 2011, at 3:39 AM, REX PALMER wrote:

 Dear Protein Crystallographers
 I would like to share with you something I came across today. Unfortunately I 
 was only able to copy the first 4 pages of the article by MV King as I need 
 to post the email before 12am and the quality of the copy is somewhat 
 lacking. I was wondering if anyone knows if anything came of the proposed 
 method of heavy atom substitution as a Google Scholar search has failed to 
 bring anything up.
 Best wishes
  
 Rex Palmer
 Birkbeck College, London
 King_1.jpgKing_2.jpgKing_3.jpgKIng_4.jpg

Re: [ccp4bb] Is there any program for specifically calculating Rvalue in CCP4

2011-03-02 Thread James Stroud


On Mar 2, 2011, at 11:13 PM, Ting-Wei Jiang wrote:
 I'm trying to calculate R-value (and free R) specifically  which is between 
 data and the modified structure(refined by myself without help from any 
 program)

You probably mean without help from any refinement program.

Why not fix all atoms, b-factors, etc., and do one round of refinement with 
refmac? This probably isn't the community-approved way to refine your 
structure, but it should do what you want.


James

Re: [ccp4bb] Harold Jeffreys

2011-03-01 Thread James Stroud

Bah...no Kindle Edition...I guess I'll have to keep reading The Grand Design.

James



On Mar 1, 2011, at 4:40 PM, Bernhard Rupp (Hofkristallrat a.D.) wrote:

 For those who concern themselves with such matters, the 1973 edition of
 Sir Harold Jeffreys' Scientific Inference has just been reissued as a
 Cambridge paperback.
 Together with AFW Edwards' Likelihood this is good material for the finer
 aspects of
 inference I did only cursory touch in BMC chapter 7.  
 
 Links and more on
 http://www.ruppweb.org/books/books_and_review_page.htm
 
 Best, BR
 
 1. Edwards AWF (1992) Likelihood - An Account of the Statistical Concept of
 Likelihood and Its Application to Scientific Inference. Baltimore, MD: The
 Johns Hopkins University Press.
 2. Jeffreys H (1973) Scientific Inference. Cambridge: Cambridge University
 Press.
 3. Sivia DS (1996) Data Analysis - A Bayesian Tutorial. Oxford, UK: Oxford
 University Press.
 -
 Bernhard Hieronimus Rupp, Hofkristallrat a.D.
 001 (925) 209-7429
 +43 (676) 571-0536
 b...@ruppweb.org
 hofkristall...@gmail.com
 http://www.ruppweb.org/
 --
 Knowledge: When you know a thing, to know that you know it,
 and when you do not know a thing, to recognize that you 
 do not know it. 
 Conficius.
 --

Re: [ccp4bb] Could someone can help me to explain why EDTA-2Na can formate salt crystals

2011-02-21 Thread James Stroud

The pH is too low and the EDTA is insoluble. You need to be above pH ~8.0.

Your best bet is to try another chelator.

James

On Feb 21, 2011, at 2:22 PM, Yibin Lin wrote:

 Dear all,
 
 I got a lot of salt crystals in reservior solution (well solution), which 
 contains 0.1 M phosphate/citrate ph 4.2, PEG200 47%, EDTA-2Na 0-22mM. 
 Reservior solution appears crystals from 12mM EDTA. Could someone help me to 
 explain why?
 
 Thank you very much!
 
 Yibin

1 2 3 >

1 - 100 of 234 matches

Mail list logo