Hi Lewis,

This looks odd to me as well, but I don't have  a quick answer to
explain/account for it.
I'll try and take a look in the near future.

-greg


On Wed, Jul 21, 2021 at 11:57 PM Lewis Martin <lewis.marti...@gmail.com>
wrote:

> Hi RDKit,
> How does one input the number of bits to the ShapeTanimotoDist function?
> The docs indicate the default is 
> *rdkit.DataStructs.cDataStructs.DiscreteValueType.TWOBITVALUE,
>  *but I tried some other values and this gave unexpected results.
> Specifically: when increasing to higher bit values, the tanimoto similarity
> gets quite small, whereas I assumed increasing the sampled bits would
> simply improve the precision of the calculation.
>
> example:
> from rdkit import Chem
> from rdkit.DataStructs import TWOBITVALUE, FOURBITVALUE, EIGHTBITVALUE
> from rdkit.Chem.rdShapeHelpers import ShapeTanimotoDist
>
> testmol = Chem.MolFromSmiles('CCCC')
> testmolH = Chem.AddHs(testmol)
> AllChem.EmbedMultipleConfs(testmolH, 2)
> for bits in [TWOBITVALUE, FOURBITVALUE, EIGHTBITVALUE]:
>     print(ShapeTanimotoDist(testmolH, testmolH, 0,1,bitsPerPoint=bits))
>
> output:
>
> 0.38089171974522296
> 0.14635701022642242
> 0.00452852989652238
>
>
>
> Thanks!
> Lewis
> _______________________________________________
> Rdkit-discuss mailing list
> Rdkit-discuss@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/rdkit-discuss
>
_______________________________________________
Rdkit-discuss mailing list
Rdkit-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss

Reply via email to