Re: [ccp4bb] Compounded problems for molecular replacement

2012-10-27 Thread Eleanor Dodson
You don't say whether you have any indication of non cryst translation or 
likely dimer NC axes? The self rotation can help sometimes to select likely 
pairings - for instance if one (or both) domain(s) is forming a dimer. 

Eleanor




On 26 Oct 2012, at 13:43, Bosch, Juergen wrote:

 We had recently a similar case.
 Indexed in P2x2x2x but truly was P21 with variable amount of twin fraction 
 depending on the dataset  beamline between 9% and 40%.
 We were able to solve it using phenix.ensembler using all available known 
 structures plus our various homology models.
 We still have not gone back and done a postmortem analysis of why we failed 
 with a highly similar homolog to find a solution in the first place with 
 either Phaser, Molrep or Phenix using single models.
 It might help to process your data in XDS to take care of the overlaps a bit 
 better in case you have not done so already.
 
 Here's a link advertisement on Hain et al. 2012 
 http://www.ncbi.nlm.nih.gov/pubmed/22982544 /advertisement off.
 
 Jürgen
 ..
 Jürgen Bosch
 Johns Hopkins University
 Bloomberg School of Public Health
 Department of Biochemistry  Molecular Biology
 Johns Hopkins Malaria Research Institute
 615 North Wolfe Street, W8708
 Baltimore, MD 21205
 Office: +1-410-614-4742
 Lab:  +1-410-614-4894
 Fax:  +1-410-955-2926
 http://lupo.jhsph.edu
 
 On Oct 26, 2012, at 8:27 AM, Seijo, Jose A. Cuesta wrote:
 
 Hi all,
  
 I am dealing with a molecular replacement problem for a 60KDa protein 
 composed of 2 rigid domains joined by a flexible linker which can move 
 relative to each other. Sequence identity for my best model is 46% evenly 
 spread, so in principle this should be a tractable problem.
 Then the problems start to pile up:
 a)  The unit cell is 56.7Å, 288.5Å, 69.4Å, 90 93.5, 90. Spacegroup P21. 
 Rmerge 12% to 2.4Å. The data also merges relatively well (Rmerge 16%) in 
 P222 with the same a, c and b axes, now of course in that order. In the P21 
 case, that corresponds to 4 monomers in the asymmetric unit with a solvent 
 content of approx. 50%, giving me 8 domains to find if I separate them.
 b)  The 288 axis means that my data show some overlap in almost all 
 orientations (might be corrected in the future with new datasets), so that 
 my low resolution data are likely unreliable.
 c)   Intensity distributions suggest twinning in either point groups. 
 Actually, they are beyond the perfect twinning case, which I attribute to 
 the reflection overlaps making the strong reflections weaker (integration 
 box too small) and the small stronger (from tails of adjacent strong ones). 
 Of course the latest would mean that the twin fraction estimation is 
 unreliable, but all moments, etc show perfect twin statistics, so I am 
 assuming that there is indeed perfect twinning of some sort.
  
 So, the question is, what is the best strategy to deal with this many (4 or 
 8) body / noisy / twinned problem?
  
 I am trying EPMR with many bodies, but I suspect the twinning would throw it 
 out of the right track, and one domain seems to be too little of the 
 diffracting matter to show any sort of discriminations between solutions and 
 non-solutions if do the usual serial searches. I plan to let autotracing 
 programs be the judge of success, but I am not sure of how well those can 
 deal with twinning. Can Arp-Warp use twinned data?
  
 Thanks in advance for any tips.
  
 Jose.
  
 
 Jose Antonio Cuesta-Seijo, PhD
 Carlsberg Laboratory
 Gamle Carlsberg Vej 10
 DK-1799 Copenhagen V
 Denmark
 
 Tlf +45 3327 5332
 Email josea.cuesta.se...@carlsberglab.dk
 
 
 
 
 



[ccp4bb] Compounded problems for molecular replacement

2012-10-26 Thread Seijo, Jose A. Cuesta
Hi all,

 

I am dealing with a molecular replacement problem for a 60KDa protein composed 
of 2 rigid domains joined by a flexible linker which can move relative to each 
other. Sequence identity for my best model is 46% evenly spread, so in 
principle this should be a tractable problem.

Then the problems start to pile up:

a)  The unit cell is 56.7Å, 288.5Å, 69.4Å, 90 93.5, 90. Spacegroup P21. 
Rmerge 12% to 2.4Å. The data also merges relatively well (Rmerge 16%) in P222 
with the same a, c and b axes, now of course in that order. In the P21 case, 
that corresponds to 4 monomers in the asymmetric unit with a solvent content of 
approx. 50%, giving me 8 domains to find if I separate them.

b)  The 288 axis means that my data show some overlap in almost all 
orientations (might be corrected in the future with new datasets), so that my 
low resolution data are likely unreliable.

c)   Intensity distributions suggest twinning in either point groups. 
Actually, they are beyond the perfect twinning case, which I attribute to the 
reflection overlaps making the strong reflections weaker (integration box too 
small) and the small stronger (from tails of adjacent strong ones). Of course 
the latest would mean that the twin fraction estimation is unreliable, but all 
moments, etc show perfect twin statistics, so I am assuming that there is 
indeed perfect twinning of some sort.

 

So, the question is, what is the best strategy to deal with this many (4 or 8) 
body / noisy / twinned problem?

 

I am trying EPMR with many bodies, but I suspect the twinning would throw it 
out of the right track, and one domain seems to be too little of the 
diffracting matter to show any sort of discriminations between solutions and 
non-solutions if do the usual serial searches. I plan to let autotracing 
programs be the judge of success, but I am not sure of how well those can deal 
with twinning. Can Arp-Warp use twinned data?

 

Thanks in advance for any tips.

 

Jose.

 


Jose Antonio Cuesta-Seijo, PhD
Carlsberg Laboratory
Gamle Carlsberg Vej 10
DK-1799 Copenhagen V
Denmark

Tlf +45 3327 5332
Email josea.cuesta.se...@carlsberglab.dk
 



Re: [ccp4bb] Compounded problems for molecular replacement

2012-10-26 Thread Bosch, Juergen
We had recently a similar case.
Indexed in P2x2x2x but truly was P21 with variable amount of twin fraction 
depending on the dataset  beamline between 9% and 40%.
We were able to solve it using phenix.ensembler using all available known 
structures plus our various homology models.
We still have not gone back and done a postmortem analysis of why we failed 
with a highly similar homolog to find a solution in the first place with either 
Phaser, Molrep or Phenix using single models.
It might help to process your data in XDS to take care of the overlaps a bit 
better in case you have not done so already.

Here's a link advertisement on Hain et al. 2012 
http://www.ncbi.nlm.nih.gov/pubmed/22982544 /advertisement off.

Jürgen
..
Jürgen Bosch
Johns Hopkins University
Bloomberg School of Public Health
Department of Biochemistry  Molecular Biology
Johns Hopkins Malaria Research Institute
615 North Wolfe Street, W8708
Baltimore, MD 21205
Office: +1-410-614-4742
Lab:  +1-410-614-4894
Fax:  +1-410-955-2926
http://lupo.jhsph.edu

On Oct 26, 2012, at 8:27 AM, Seijo, Jose A. Cuesta wrote:

Hi all,

I am dealing with a molecular replacement problem for a 60KDa protein composed 
of 2 rigid domains joined by a flexible linker which can move relative to each 
other. Sequence identity for my best model is 46% evenly spread, so in 
principle this should be a tractable problem.
Then the problems start to pile up:
a)  The unit cell is 56.7Å, 288.5Å, 69.4Å, 90 93.5, 90. Spacegroup P21. 
Rmerge 12% to 2.4Å. The data also merges relatively well (Rmerge 16%) in P222 
with the same a, c and b axes, now of course in that order. In the P21 case, 
that corresponds to 4 monomers in the asymmetric unit with a solvent content of 
approx. 50%, giving me 8 domains to find if I separate them.
b)  The 288 axis means that my data show some overlap in almost all 
orientations (might be corrected in the future with new datasets), so that my 
low resolution data are likely unreliable.
c)   Intensity distributions suggest twinning in either point groups. 
Actually, they are beyond the perfect twinning case, which I attribute to the 
reflection overlaps making the strong reflections weaker (integration box too 
small) and the small stronger (from tails of adjacent strong ones). Of course 
the latest would mean that the twin fraction estimation is unreliable, but all 
moments, etc show perfect twin statistics, so I am assuming that there is 
indeed perfect twinning of some sort.

So, the question is, what is the best strategy to deal with this many (4 or 8) 
body / noisy / twinned problem?

I am trying EPMR with many bodies, but I suspect the twinning would throw it 
out of the right track, and one domain seems to be too little of the 
diffracting matter to show any sort of discriminations between solutions and 
non-solutions if do the usual serial searches. I plan to let autotracing 
programs be the judge of success, but I am not sure of how well those can deal 
with twinning. Can Arp-Warp use twinned data?

Thanks in advance for any tips.

Jose.


Jose Antonio Cuesta-Seijo, PhD
Carlsberg Laboratory
Gamle Carlsberg Vej 10
DK-1799 Copenhagen V
Denmark

Tlf +45 3327 5332
Email 
josea.cuesta.se...@carlsberglab.dkmailto:josea.cuesta.se...@carlsberglab.dk







Re: [ccp4bb] Compounded problems for molecular replacement

2012-10-26 Thread Roger Rowlett
I had a similar issue (8 chains in a large ASU, 2.4 A resolution, 29% 
homology, possibly--and ultimately confirmed--twinned data). Here is how 
I solved my structure (not pretty!):


1. Phaser was used to find an initial placement for the chains using
   (what turned out to be) the twin maps. This actually worked fine to
   get a grossly correct position for all chains in the ASU. But the
   low homology structures were pretty much useless as a starting point
   for refinement in Refmac, and the twin maps were, um, just bad, as
   you would expect. I used this structure as a starting point for
   phases for the next step.
2. Once I had that, is used Parrot and Buccaneer to do density
   modification with NCS and autotraced about 98% of the structure,
   with only a couple of gaps/insertions.
3. Running Phaser with the twin option enabled identified the twin
   fraction easily and resulted in excellent maps for further refinement.

If I had been quicker in recognizing the twinning of the data, I 
probably could have detwinned the data from the start, and Phaser, 
perhaps in combination with Parrot  Buccaneer with NCS, would have 
arrived at a solution quicker.


If you have not used Parrot with NCS pipelined with Buccaneer, give it a 
try. I was very pleasantly surprised (and impressed) at how efficient 
this was at getting me to a place where I could start refining the final 
structure.


Your data quality may hamper your ability to solve the structure, 
especially if you have a large number of overlaps in your 
high-resolution data. In that case, I wonder if you might fare better by 
using data cut off to a lower resolution, where the overlap issue is not 
as severe. With your unit cell, you are probably looking at a long 
camera distance and thin-slicing to get usable data. That's what we have 
to do with a 50 x 150 x 250 A unit cell (C2) which is favored by one of 
our favorite proteins and its variants.


Cheers,

___
Roger S. Rowlett
Gordon  Dorothy Kline Professor
Department of Chemistry
Colgate University
13 Oak Drive
Hamilton, NY 13346

tel: (315)-228-7245
ofc: (315)-228-7395
fax: (315)-228-7935
email: rrowl...@colgate.edu

On 10/26/2012 8:27 AM, Seijo, Jose A. Cuesta wrote:


Hi all,

I am dealing with a molecular replacement problem for a 60KDa protein 
composed of 2 rigid domains joined by a flexible linker which can move 
relative to each other. Sequence identity for my best model is 46% 
evenly spread, so in principle this should be a tractable problem.


Then the problems start to pile up:

a)The unit cell is 56.7Å, 288.5Å, 69.4Å, 90 93.5, 90. Spacegroup P21. 
Rmerge 12% to 2.4Å. The data also merges relatively well (Rmerge 16%) 
in P222 with the same a, c and b axes, now of course in that order. In 
the P21 case, that corresponds to 4 monomers in the asymmetric unit 
with a solvent content of approx. 50%, giving me 8 domains to find if 
I separate them.


b)The 288 axis means that my data show some overlap in almost all 
orientations (might be corrected in the future with new datasets), so 
that my low resolution data are likely unreliable.


c)Intensity distributions suggest twinning in either point groups. 
Actually, they are beyond the perfect twinning case, which I attribute 
to the reflection overlaps making the strong reflections weaker 
(integration box too small) and the small stronger (from tails of 
adjacent strong ones). Of course the latest would mean that the twin 
fraction estimation is unreliable, but all moments, etc show perfect 
twin statistics, so I am assuming that there is indeed perfect 
twinning of some sort.


So, the question is, what is the best strategy to deal with this many 
(4 or 8) body / noisy / twinned problem?


I am trying EPMR with many bodies, but I suspect the twinning would 
throw it out of the right track, and one domain seems to be too little 
of the diffracting matter to show any sort of discriminations between 
solutions and non-solutions if do the usual serial searches. I plan to 
let autotracing programs be the judge of success, but I am not sure of 
how well those can deal with twinning. Can Arp-Warp use twinned data?


Thanks in advance for any tips.

Jose.


Jose Antonio Cuesta-Seijo, PhD
Carlsberg Laboratory
Gamle Carlsberg Vej 10
DK-1799 Copenhagen V
Denmark

Tlf +45 3327 5332
Email josea.cuesta.se...@carlsberglab.dk






Re: [ccp4bb] Compounded problems for molecular replacement

2012-10-26 Thread Phil Jeffrey

Hello Jose,

Depending on what data integration program you used, trying XDS may help 
you out a little with spot overlap.


Example #3 in my rather out-of-date page:
http://xray0.princeton.edu/~phil/Facility/Guides/MolecularReplacement.html
illustrates how you could find 8 domains, especially if you pay 
attention to the rotation angle values for the candidate domain 
solutions.  This example did not have twinning but did have a little 
pseudo-centering.  This is a 15 year-old example from back when I was 
using AMORE, so I should clearly rewrite that page.


Additionally, if the inter-domain flexibility is restricted to rotation 
about a single axis, it would be a good idea to rotate your model so 
that this rotation axis is parallel to the Z axis.  This was a method 
that was exploited with Fab structures (whose elbow angle is a fairly 
restricted rotation).  If so oriented, rotation function peaks relating 
different domains in the same molecule should show very similar alpha, 
beta and differ in gamma.


Good luck,
Phil Jeffrey
Princeton


On 10/26/12 8:27 AM, Seijo, Jose A. Cuesta wrote:

Hi all,

I am dealing with a molecular replacement problem for a 60KDa protein
composed of 2 rigid domains joined by a flexible linker which can move
relative to each other. Sequence identity for my best model is 46%
evenly spread, so in principle this should be a tractable problem.

Then the problems start to pile up:

a)The unit cell is 56.7Å, 288.5Å, 69.4Å, 90 93.5, 90. Spacegroup P21.
Rmerge 12% to 2.4Å. The data also merges relatively well (Rmerge 16%) in
P222 with the same a, c and b axes, now of course in that order. In the
P21 case, that corresponds to 4 monomers in the asymmetric unit with a
solvent content of approx. 50%, giving me 8 domains to find if I
separate them.

b)The 288 axis means that my data show some overlap in almost all
orientations (might be corrected in the future with new datasets), so
that my low resolution data are likely unreliable.

c)Intensity distributions suggest twinning in either point groups.
Actually, they are beyond the perfect twinning case, which I attribute
to the reflection overlaps making the strong reflections weaker
(integration box too small) and the small stronger (from tails of
adjacent strong ones). Of course the latest would mean that the twin
fraction estimation is unreliable, but all moments, etc show perfect
twin statistics, so I am assuming that there is indeed perfect twinning
of some sort.

So, the question is, what is the best strategy to deal with this many (4
or 8) body / noisy / twinned problem?

I am trying EPMR with many bodies, but I suspect the twinning would
throw it out of the right track, and one domain seems to be too little
of the diffracting matter to show any sort of discriminations between
solutions and non-solutions if do the usual serial searches. I plan to
let autotracing programs be the judge of success, but I am not sure of
how well those can deal with twinning. Can Arp-Warp use twinned data?

Thanks in advance for any tips.

Jose.


Jose Antonio Cuesta-Seijo, PhD
Carlsberg Laboratory
Gamle Carlsberg Vej 10
DK-1799 Copenhagen V
Denmark

Tlf +45 3327 5332
Email josea.cuesta.se...@carlsberglab.dk