Hi Leon,

There's not really a "right" answer to your question - it depends on what
you want to calculate.
I personally think it makes more sense to use the heavy atom RMSD (which is
what you get if you remove Hs before calculating the RMSD), particularly if
you are comparing to experiment.

Note that AllChem.GetConformerRMSMatrix() does not take symmetry into
account, so you may not get the correct results.
I just opened a ticket to fix this, but in the meantime if you have
molecules with symmetry-equivalent atoms you are probably better off
generating the conformer RMS matrix manually using GetBestRMS().

Best,
-greg

On Wed, Nov 13, 2019 at 5:17 PM topgunhaides . <sunzhi....@gmail.com> wrote:

> Hi guys,
>
> I am new to RDKit, and have a question about adding and removing Hs.
>
> As recommended in the documentation, hydrogen atoms should be added for
> generating conformers, optimization, etc.
>
> However, for clustering, should the Hs be removed first, before generating
> the conformer RMS matrix? For instance:
>
>
> from rdkit import Chem
> from rdkit.Chem import AllChem, TorsionFingerprints
>
> suppl = Chem.SDMolSupplier('molecule.sdf')
>
> for mol in suppl:
>     mh = Chem.AddHs(mol)
>     cids = AllChem.EmbedMultipleConfs(mh, numConfs=5, maxAttempts=1000,
>                                       pruneRmsThresh=0.5, numThreads=0,
> randomSeed=1)
>     m = Chem.RemoveHs(mh)
>     # RMS matrix
>     rmsmat = AllChem.GetConformerRMSMatrix(m, prealigned=False)
>     # TFD matrix
>     tfdmat = TorsionFingerprints.GetTFDMatrix(m)
>     print(rmsmat)
>     print(tfdmat)
>
>
> Note I remove the Hs before getting RMS and TFD matrices. Both resulting
> matrices are different if I do not remove Hs. The RMS without Hs, in
> general, tend to be smaller than the RMS with Hs. This will in turn affect
> the subsequent clustering result.
>
> Could you guys give me some suggestions?  Thank you!
>
> Best,
> Leon
> _______________________________________________
> Rdkit-discuss mailing list
> Rdkit-discuss@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/rdkit-discuss
>
_______________________________________________
Rdkit-discuss mailing list
Rdkit-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss

Reply via email to