Hi RDKit,
How does one input the number of bits to the ShapeTanimotoDist function?
The docs indicate the default is
*rdkit.DataStructs.cDataStructs.DiscreteValueType.TWOBITVALUE,
 *but I tried some other values and this gave unexpected results.
Specifically: when increasing to higher bit values, the tanimoto similarity
gets quite small, whereas I assumed increasing the sampled bits would
simply improve the precision of the calculation.

example:
from rdkit import Chem
from rdkit.DataStructs import TWOBITVALUE, FOURBITVALUE, EIGHTBITVALUE
from rdkit.Chem.rdShapeHelpers import ShapeTanimotoDist

testmol = Chem.MolFromSmiles('CCCC')
testmolH = Chem.AddHs(testmol)
AllChem.EmbedMultipleConfs(testmolH, 2)
for bits in [TWOBITVALUE, FOURBITVALUE, EIGHTBITVALUE]:
    print(ShapeTanimotoDist(testmolH, testmolH, 0,1,bitsPerPoint=bits))

output:

0.38089171974522296
0.14635701022642242
0.00452852989652238



Thanks!
Lewis
_______________________________________________
Rdkit-discuss mailing list
Rdkit-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss

Reply via email to