Hi, I'm trying to understand how to properly use explicit hydrogens in substructure searches. Below is an example. I would like to find all molecules that contain my query with hydrogens at the nitrogens, and I thought I was on the right track ... Why does the first query with the explicit H not match m1?
Thanks Markus <code> from rdkit import Chem from rdkit.Chem.Draw import IPythonConsole from rdkit.Chem import rdDepictor rdDepictor.SetPreferCoordGen(True) IPythonConsole.ipython_useSVG = True m1 = Chem.MolFromSmiles('c1cn[nH]c1N') m2 = Chem.MolFromSmiles('CNc1ccn[nH]1') m3 = Chem.MolFromSmiles('Nc1ccnn(C)1') # do not remove explicit H params = Chem.SmilesParserParams() params.removeHs=False query = Chem.MolFromSmiles('c1cn[nH]c(N([H])([H]))1', params) # first should be True, but all are False m1.HasSubstructMatch(query) m2.HasSubstructMatch(query) m3.HasSubstructMatch(query) # rebuild query with explicit H removed, not what I want query = Chem.MolFromSmiles('c1cn[nH]c(N([H])([H]))1') m1.HasSubstructMatch(query) m2.HasSubstructMatch(query) m3.HasSubstructMatch(query) </code> -- Markus Heller, PhD Senior Scientist Direct: 604.827.1122 Main: 604.827.1147 [A027228F] 2405 Wesbrook Mall, 4th Floor, Vancouver, BC V6T 1Z3 This email and any attachments thereto may contain confidential material for the sole use of the intended recipient. Any review, copying, or distribution of this email (or any attachments thereto) by others is strictly prohibited. If you are not the intended recipient, please contact the sender immediately and permanently delete the original and any copies of this email and any attachments thereto.
_______________________________________________ Rdkit-discuss mailing list Rdkit-discuss@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/rdkit-discuss