[EMAIL PROTECTED] wrote:
I don't have a problem, per se, but would like to discuss the problems that may, or may not, arise when mixing force fields.

It is clear to me why one would not want to calculate the free energy of binding for two proteins, one using the amber ff and the other using the opls ff; also it is clear that there would be problems simulating a box of water half of which is tip3p and half of which is spc. The common thing to these examples is that such simulations would apply dissimilar parameter sets for similar functional groups and therefore any results could be subject to significant biases, the source of which will not be obvious to the user.

However, If one was simulating the binding of a protein to DNA, or a protein embedded in a lipid bilayer, the functional groups are no longer shared by different types of macromolecules. Since I work on membrane proteins, let me take the case of an oplsaa protein in a Berger lipid bilayer. Not only are these ff's differently generated, but one is all-atom and one is united-atom. The important difference in this case is that there are few functional groups of the lipids that resemble those of the protein e.g. the NH3 of a lipid head-group choline and a lysine of the protein. Generally though, the functional groups are entirely different between these macromolecules. I believe that this is also the case for protein-DNA simulations. Therefore, what biases can possibly occur by the combination of different ff's in this case that could not also occur by combinations that exclusively use a single ff?

Force field parameters are the result of some kind of global optimization procedure. As such it is well-known that you should not expect a strong correlation between a bond stretching parameter and any real measure of bond strength. This is because that real interaction is being modeled a) approximately, and b) through model interactions not necessarily localised to the two bonded atoms.

One would not expect to reach the same near-global minimum after optimizing over protein parameters for two given sets of water parameters. Trivially, the water-protein Coulomb interactions will have to be different. Thus, the intra-protein Coulomb interactions will have to be different. This may directly affect some bonded interactions, depending on your exclusion treatment. Finally, then can be all manner of indirect effects that might depend on which local minimum your optimization ended up on. The same goes for any other sets of constrained and free variables you might use in a parameterization process, and IMO makes for a clear presumption of numerical suicide from mixing force fields, possibly except in some fortuitous and well-tested cases. Hopefully this oplsaa-Berger mix is such a case, but I don't know anything about it.

I take the extreme example and ask: what special relevance do the opls ion parameters have to the opls protein parameters? It seems to me that, although they "derive them in a manner consistent with how the rest of the force field was originally derived" (http://wiki.gromacs.org/index.php/Parameterization), in this extreme case I believe that this is an entirely abstract concept of no particular value. In other words, how can Na+ possibly be generated consistently/inconsistently with an amino acid that contains no Na?

In part, the general advice you cite is sound for cases where one is not going to do a fully rigorous test of the performance of the parameters - e.g. the antechamber or PRODRG approach. Using a similar methodology gives one some basis for optimism. Using a different one *and not testing* is random and asks for trouble. Using a different one *and testing* for performance on observables relevant to the study you wish to perform using those parameters seems quite reasonable to me. The only value in an extended MM force field is its ability to model a physical system featuring the elements of that extension. If you can demonstrate it does that well enough, then the method by which you extended it seems irrelevant.

Also, it could be true that achieving success in such a test has been experienced to be difficult unless one has followed a similar methodology.

To clearly state my current point of view in the absence of a shred of data, I suggest the following: "One should not combine parameters that are derived inconsistently of one another except in cases where such combination can be made without introducing multiple parametric definitions of a given functional group."

I would disagree strongly for the above kinds of reasons.

If you believe that, it would therefore be acceptable to combine the following in any way: i) protein, ii) water, iii) ion, iv) DNA, v) lipid, vi) carbohydrate. The seventh group: small molecules, is difficult to classify since one must take into consideration the specific functional groups. For example, I would suggest that ATP and a protein should be fine if different ff's are used, but that ATP and DNA should use a consistent ff when simulated in conjunction.

As we ramp up our simulations for ever-increasing cpu power and for gromacs 4, these questions are well beyond pedantic. It is one thing to develop parameters for a small molecule consistently with the the methodology used for the protein/DNA ff. However, simulations of more than one different type of macromolecule (e.g. protein-DNA simulations) would greatly benefit, it seems, from the ability to use the DNA parameters that lead to the most accurate sampling of DNA phase space and the protein parameters that lead to the most accurate sampling of protein phase space. It is my conjecture that such combinations would not only be appropriate, but that they would be optimal.

These phase spaces are not independent. A solute phase space is sampled differently in different solvent models. There is no reason to suppose that the combinations you suggest would even be close to effective, never mind optimal.

Disclaimer: If you are considering combining differently-generated force-fields, please do not take this post as encouragement. The standard logic never to combine force-fields is still recommended. I only wanted to have some discussion on this topic.

Thanks for all comments, especially those that are in disagreement with my proposition.

You're welcome :-)

Mark
_______________________________________________
gmx-users mailing list    [email protected]
http://www.gromacs.org/mailman/listinfo/gmx-users
Please search the archive at http://www.gromacs.org/search before posting!
Please don't post (un)subscribe requests to the list. Use the www interface or send it to [EMAIL PROTECTED]
Can't post? Read http://www.gromacs.org/mailing_lists/users.php

Reply via email to