I am not a chemist, so it can be a silly question, but I am interested in
the logic behind it, also because other libraries (like OpenBabel) behave
differently.

Why sometimes RDKit writes hydrogens explicitly?

mol = rdkit.MolFromSmiles('CCS=O', sanitize=False)
rdkit.MolToSmiles(mol)
'CC[SH]=O'

The input SMILES is intended as a pattern, not a molecule. I make a mol out
of it only to get the canonical SMILES, that will be then used as SMARTS.
Logically, I don't understand how the number of H attached to the S can be
"guessed" by the library, still it cannot be left implicit.

Furthermore, I have seen this behaviour only with S and P. I was wondering
if it's a confined issue, or it can happen with any element.
Thank you
_______________________________________________
Rdkit-discuss mailing list
Rdkit-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss

Reply via email to