Re: [gmx-users] combining differently-generated force-fields

Mark Abraham Thu, 01 May 2008 22:59:01 -0700

[EMAIL PROTECTED] wrote:

I don't have a problem, per se, but would like to discuss the problemsthat may, or may not, arise when mixing force fields.
It is clear to me why one would not want to calculate the free energy ofbinding for two proteins, one using the amber ff and the other using theopls ff; also it is clear that there would be problems simulating a boxof water half of which is tip3p and half of which is spc. The commonthing to these examples is that such simulations would apply dissimilarparameter sets for similar functional groups and therefore any resultscould be subject to significant biases, the source of which will not beobvious to the user.
However, If one was simulating the binding of a protein to DNA, or aprotein embedded in a lipid bilayer, the functional groups are no longershared by different types of macromolecules. Since I work on membraneproteins, let me take the case of an oplsaa protein in a Berger lipidbilayer. Not only are these ff's differently generated, but one isall-atom and one is united-atom. The important difference in this caseis that there are few functional groups of the lipids that resemblethose of the protein e.g. the NH3 of a lipid head-group choline and alysine of the protein. Generally though, the functional groups areentirely different between these macromolecules. I believe that this isalso the case for protein-DNA simulations. Therefore, what biases canpossibly occur by the combination of different ff's in this case thatcould not also occur by combinations that exclusively use a single ff?

Force field parameters are the result of some kind of globaloptimization procedure. As such it is well-known that you should notexpect a strong correlation between a bond stretching parameter and anyreal measure of bond strength. This is because that real interaction isbeing modeled a) approximately, and b) through model interactions notnecessarily localised to the two bonded atoms.

One would not expect to reach the same near-global minimum afteroptimizing over protein parameters for two given sets of waterparameters. Trivially, the water-protein Coulomb interactions will haveto be different. Thus, the intra-protein Coulomb interactions will haveto be different. This may directly affect some bonded interactions,depending on your exclusion treatment. Finally, then can be all mannerof indirect effects that might depend on which local minimum youroptimization ended up on. The same goes for any other sets ofconstrained and free variables you might use in a parameterizationprocess, and IMO makes for a clear presumption of numerical suicide frommixing force fields, possibly except in some fortuitous and well-testedcases. Hopefully this oplsaa-Berger mix is such a case, but I don't knowanything about it.

I take the extreme example and ask: what special relevance do the oplsion parameters have to the opls protein parameters? It seems to me that,although they "derive them in a manner consistent with how the rest ofthe force field was originally derived"(http://wiki.gromacs.org/index.php/Parameterization), in this extremecase I believe that this is an entirely abstract concept of noparticular value. In other words, how can Na+ possibly be generatedconsistently/inconsistently with an amino acid that contains no Na?

In part, the general advice you cite is sound for cases where one is notgoing to do a fully rigorous test of the performance of the parameters -e.g. the antechamber or PRODRG approach. Using a similar methodologygives one some basis for optimism. Using a different one *and nottesting* is random and asks for trouble. Using a different one *andtesting* for performance on observables relevant to the study you wishto perform using those parameters seems quite reasonable to me. The onlyvalue in an extended MM force field is its ability to model a physicalsystem featuring the elements of that extension. If you can demonstrateit does that well enough, then the method by which you extended it seemsirrelevant.

Also, it could be true that achieving success in such a test has beenexperienced to be difficult unless one has followed a similar methodology.

To clearly state my current point of view in the absence of a shred ofdata, I suggest the following: "One should not combine parameters thatare derived inconsistently of one another except in cases where suchcombination can be made without introducing multiple parametricdefinitions of a given functional group."


I would disagree strongly for the above kinds of reasons.

If you believe that, it wouldtherefore be acceptable to combine the following in any way: i) protein,ii) water, iii) ion, iv) DNA, v) lipid, vi) carbohydrate. The seventhgroup: small molecules, is difficult to classify since one must takeinto consideration the specific functional groups. For example, I wouldsuggest that ATP and a protein should be fine if different ff's areused, but that ATP and DNA should use a consistent ff when simulated inconjunction.
As we ramp up our simulations for ever-increasing cpu power and forgromacs 4, these questions are well beyond pedantic. It is one thing todevelop parameters for a small molecule consistently with the themethodology used for the protein/DNA ff. However, simulations of morethan one different type of macromolecule (e.g. protein-DNA simulations)would greatly benefit, it seems, from the ability to use the DNAparameters that lead to the most accurate sampling of DNA phase spaceand the protein parameters that lead to the most accurate sampling ofprotein phase space. It is my conjecture that such combinations wouldnot only be appropriate, but that they would be optimal.

These phase spaces are not independent. A solute phase space is sampleddifferently in different solvent models. There is no reason to supposethat the combinations you suggest would even be close to effective,never mind optimal.

Disclaimer: If you are considering combining differently-generatedforce-fields, please do not take this post as encouragement. Thestandard logic never to combine force-fields is still recommended. Ionly wanted to have some discussion on this topic.
Thanks for all comments, especially those that are in disagreement withmy proposition.


You're welcome :-)

Mark
_______________________________________________
gmx-users mailing list    [email protected]
http://www.gromacs.org/mailman/listinfo/gmx-users
Please search the archive at http://www.gromacs.org/search before posting!

Please don't post (un)subscribe requests to the list. Use thewww interface or send it to [EMAIL PROTECTED]

Can't post? Read http://www.gromacs.org/mailing_lists/users.php

Re: [gmx-users] combining differently-generated force-fields

Reply via email to