Hi

I am using rdkit to fetch atom pair information as AtomPairFingerprint. I
am  reading from a list of molecules as individual SDF. Calling
GetHashedAtomPairFingerprintAsBitVect on the object "m" (where m is read as
each molecule), object strings are returned.

trainResi=open(rootpath + 'trialResi.list').readlines()
trainResi = [x.strip() for x in trainResi]
print len(trainResi)

for name in trainResi:
    for ligname in glob.glob(os.path.join(modelDir+name+'.sdf'))[:]:
        m = Chem.MolFromMolFile(ligname, sanitize=False, removeHs=False)

        for atm in m.GetAtoms():
            aid = atm.GetIdx()
            m.UpdatePropertyCache(strict=False)      ##Required because I
want to include charged molecules too!
            fp =
AllChem.GetHashedAtomPairFingerprintAsBitVect(m,maxLength=APLength, \

fromAtoms=[aid])
           print fp

* "fp" are returned as (including only one example) :*
<rdkit.DataStructs.cDataStructs.ExplicitBitVect object at 0x11bfc4d70>

For my code to work further, I am converting my fp into a numpy array,
which returns [0 0 0 ..., 0 0 0] (zero values)
What am I doing wrong?

I am new to rdkit and machine learning, and I am using jupyter notebook.

Thanks
Payal
_______________________________________________
Rdkit-discuss mailing list
Rdkit-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss

Reply via email to