Hi Benny, that is a pure InChI problem (not a RDKit one). Back then when the Standard InChI was defined, the 15T and the KET option for the InChI calculation weren't either available or still experimental (I don't remember :-)), so they didn't make it into the standard set of options for the Standard InChI calculation. Hence it isn't too surprising that this tautomer pair doesn't calculate the same Standard InChI (InChI isn't/wasn't particularly strong regarding tautomerism outside rings). You might use (non-standard) InChI and switch the 15T and KET options on, that should fix your particular case.
In general there are still ongoing efforts to make InChI stronger regarding tautomerism: https://pubmed.ncbi.nlm.nih.gov/32043883/ Markus On Tue, Jul 21, 2020 at 12:11 PM Da'Adoosh Binyamin < daado...@tauex.tau.ac.il> wrote: > Hi, > > > > I have a question about RDKit/tautomers. > > > > Let's say I have smiles input: > > > > C[CH]2CCC(=O)C1=C(O)[CH](O)C[CH](O)[CH]12 > > C[CH]2CCC(O)=C1C(=O)[CH](O)C[CH](O)[CH]12 > > > > Now, if I make this code for each input: > > > > m = Chem.MolFromSmiles(input) > > inchi = Chem.rdinchi.MolToInchi(m) > > > > I get different InChIs: > > > > > InChI=1S/C11H16O4/c1-5-2-3-6(12)10-9(5)7(13)4-8(14)11(10)15/h5,7-9,13-15H,2-4H2,1H3 > > > InChI=1S/C11H16O4/c1-5-2-3-6(12)10-9(5)7(13)4-8(14)11(10)15/h5,7-9,12-14H,2-4H2,1H3 > > > > My question is why is it happening. Usually if I enter two tautomers - > they have the same InChI (like it is supposed to be, according to the > literature ). What is the difference in this example? > > > > Thanks, > > Benny > > > _______________________________________________ > Rdkit-discuss mailing list > Rdkit-discuss@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/rdkit-discuss >
_______________________________________________ Rdkit-discuss mailing list Rdkit-discuss@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/rdkit-discuss