Hi Toby,
tonight I will check with the MMFF94 literature which is the current
atom typing for that hydrogen and I'll get back to you.
Thanks, cheers
Paolo
On 04/14/2014 03:42 PM, Toby Wright wrote:
Hi,
I've been using the MMFF94 forcefield and noticed an odd behaviour is
a couple of molecules.
> phenolish1 = Chem.MolFromSmiles('Oc1ncccn1')
> phenolish2 = Chem.MolFromSmiles('Oc1ncncc1')
> prop1 = AllChem.MMFFGetMoleculeProperties(Chem.AddHs(phenolish1))
> prop2 = AllChem.MMFFGetMoleculeProperties(Chem.AddHs(phenolish2))
> print prop1.GetMMFFAtomType(7) #Atom 7 is the H of the OH
21
> print prop2.GetMMFFAtomType(7) #Atom 7 is the H of the OH
29
Type 29 is a hydrogen attached to an oxygen in enols, phenols or
HO-C=N which is only sort of the case here (but perhaps pragmatically
we should consider phenol the closest option?)
Digging into the code in GraphMol/ForceFieldHelpers/MMFF/AtomTyper.cpp
we see (between lines 2092 and 2133) that we consider an atom to hit
the phenol case if we have the oxygen attached to a carbon attached
via an aromatic bond to another carbon. We have this in phenolish2 but
not in phenolish1 hence the different outputs. If we change the test
on line 2115 to:
if ((bond->getBondType() == Bond::AROMATIC) ||
((nbr3Atom->getAtomicNum() == 6) && (bond->getBondType() ==
Bond::DOUBLE))) {
then both cases above show the same behaviour, considering phenolish
things to be phenols for the sake of MMFF94 atom typing. Alternatively
we could consider phenolish things to be not phenols and implement
atom type 21 for the hydrogen in both cases. Any thoughts?
Yours,
Toby Wright
PS I'm aware that the tautomers above aren't ideal, these are
fragments snipped from more complex molecules to demonstrate the
behaviour.
--
InhibOx Ltd
------------------------------------------------------------------------------
Learn Graph Databases - Download FREE O'Reilly Book
"Graph Databases" is the definitive new guide to graph databases and their
applications. Written by three acclaimed leaders in the field,
this first edition is now available. Download your free book today!
http://p.sf.net/sfu/NeoTech
_______________________________________________
Rdkit-discuss mailing list
Rdkit-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss
--
==========================================================
Paolo Tosco, Ph.D.
Department of Drug Science and Technology
Via Pietro Giuria, 9 - 10125 Torino (Italy)
Tel: +39 011 670 7680 | Mob: +39 348 5537206
Fax: +39 011 670 7687 | E-mail: paolo.to...@unito.it
http://open3dqsar.org | http://open3dalign.org
==========================================================
------------------------------------------------------------------------------
Learn Graph Databases - Download FREE O'Reilly Book
"Graph Databases" is the definitive new guide to graph databases and their
applications. Written by three acclaimed leaders in the field,
this first edition is now available. Download your free book today!
http://p.sf.net/sfu/NeoTech
_______________________________________________
Rdkit-discuss mailing list
Rdkit-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss