Hi,
I have a vendor fragmentation algorithm and I want to evaluate the presence of
the fragments/substructures in a list of molecules with the RDKit C++ API. In
order to avoid a slow SubstructMatch comparison of n fragments x m molecules I
first SmilesToMol the fragment, generate a fingerprint, calculate the Tversky
similarity with the molecule fingerprint, and only if the value is high a
SubstructMatch is run. This makes the process extremely fast.
The problem I observe is that for many SmilesToMol of the substructures I'm
getting exceptions like
[10:34:46] Can't kekulize mol
[10:34:46] non-ring atom 4 marked aromatic
Is there a way to "force" SmilesToMol to accept the fragment SMILES so that a
fingerprint of the fragment graph can be generated (btw, I don't understand why
a kekulization is performed) even when the fragment is not a complete molecule?
Thanks a lot,
Gonzalo
------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and
threat landscape has changed and how IT managers can respond. Discussions
will include endpoint security, mobile security and the latest in malware
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
Rdkit-discuss mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss