Hi Wendong, The morgan fingerprint algorithm removes redundant atom environments (environments which contain exactly the same atoms/bonds). For example, when looking at valine: [image: image.png] The environments with radius 2 which are centered on atoms 5 and 6 are redundant with the environment of radius 1 which is centered on atom 4, so those environments are not reported in the output.
This is described in more detail in the ECFP paper: Rogers, D.; Hahn, M. “Extended-Connectivity Fingerprints.” *J. Chem. Inf. and Model.* **50**:742-54 (2010). Best, -greg On Fri, Mar 12, 2021 at 4:16 AM Wendong Wang <wendong.w...@sjtu.edu.cn> wrote: > Greetings, > I have a question about morgan fingerprints. The code is pasted at the end > of the email, and please see the attached images for the results. > > For valine molecule, the radius is set to be 2. The dictionary (atom > index, radius) shows all the substructures of all atoms with radius 0 as > fingerprints, and all the substructures of all the atoms with radius 1 as > fingerprints. But there are only a few substructures with radius 2 as > fingerprints. Why so few? > > Thanks. > > Best wishes, > Wendong > > PS. The code is below: > m1 = Chem.MolFromSmiles('CC(C)[C@@H](C(=O)O)N') > di1 = {} > fp1 = AllChem.GetMorganFingerprintAsBitVect(m1, radius = 2, nBits = 2048, > bitInfo = di1) > tu1 = [(m1, x, di1) for x in fp1.GetOnBits()] > Draw.DrawMorganBits(tu1, molsPerRow = 4, legends=[str(x) for x in > fp1.GetOnBits()]) > > _______________________________________________ > Rdkit-discuss mailing list > Rdkit-discuss@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/rdkit-discuss >
_______________________________________________ Rdkit-discuss mailing list Rdkit-discuss@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/rdkit-discuss