Hi Benny,

that is a pure InChI problem (not a RDKit one). Back then when the Standard
InChI was defined, the 15T and the KET option for the InChI calculation
weren't either available or still experimental (I don't remember :-)), so
they didn't make it into the standard set of options for the Standard InChI
calculation. Hence it isn't too surprising that this tautomer pair doesn't
calculate the same Standard InChI (InChI isn't/wasn't particularly strong
regarding tautomerism outside rings). You might use (non-standard) InChI
and switch the 15T and KET options on, that should fix your particular case.

In general there are still ongoing efforts to make InChI stronger regarding
tautomerism: https://pubmed.ncbi.nlm.nih.gov/32043883/

Markus


On Tue, Jul 21, 2020 at 12:11 PM Da'Adoosh Binyamin <
daado...@tauex.tau.ac.il> wrote:

> Hi,
>
>
>
> I have a question about RDKit/tautomers.
>
>
>
> Let's say I have smiles input:
>
>
>
> C[CH]2CCC(=O)C1=C(O)[CH](O)C[CH](O)[CH]12
>
> C[CH]2CCC(O)=C1C(=O)[CH](O)C[CH](O)[CH]12
>
>
>
> Now, if I make this code for each input:
>
>
>
> m = Chem.MolFromSmiles(input)
>
> inchi = Chem.rdinchi.MolToInchi(m)
>
>
>
> I get different InChIs:
>
>
>
>
> InChI=1S/C11H16O4/c1-5-2-3-6(12)10-9(5)7(13)4-8(14)11(10)15/h5,7-9,13-15H,2-4H2,1H3
>
>
> InChI=1S/C11H16O4/c1-5-2-3-6(12)10-9(5)7(13)4-8(14)11(10)15/h5,7-9,12-14H,2-4H2,1H3
>
>
>
> My question is why is it happening. Usually if I enter two tautomers -
> they have the same InChI (like it is supposed to be, according to the
> literature ). What is the difference in this example?
>
>
>
> Thanks,
>
> Benny
>
>
> _______________________________________________
> Rdkit-discuss mailing list
> Rdkit-discuss@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/rdkit-discuss
>
_______________________________________________
Rdkit-discuss mailing list
Rdkit-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss

Reply via email to