Hi,? ??Supposing I'd like to matching 100 substructures with 1000 compounds 
represented as smiles.What I did is:
suppl = AllChem.SmilesMolSupplier('allmoleculenew.smi',delimiter='\t')l = 
len(suppl)for j in range(ll): ?# I have to make substructures in the first 
loop.? ??for i in range(l):

? ??? ??suppl[i].GetSubstructMatches(s[j])?and found the performance is not 
good.
Then I did a comparison and found that it was because the conformation of the 
compounds where not initiated.If I use MolFromSmiles,the performance will 
improve a lot.start = time.clock()suppl = 
AllChem.SmilesMolSupplier('allmoleculenew.smi',delimiter='\t')

l=len(suppl)?print time.clock()-start ? # >>>?0.0373735355168 ?indicating that 
the molecules were not initiated.
for i in range(l):

? ??suppl[i].GetSubstructMatches(sa)

? ??suppl[i].GetSubstructMatches(sa2)

print time.clock()-start ? # >>>?11.1884715172
start = time.clock()

f = open('allmoleculenew.smi')

for i in range(l):

? ??mol = Chem.MolFromSmiles(f.next().split('\t')[0])

? ??mol.GetSubstructMatches(sa)

? ??mol.GetSubstructMatches(sa2)print time.clock()-start # >>>?5.44030582111
The second method was double faster than the first, indicating that the "init" 
is more time consuming compared to matching.I think?SmilesMolSupplier is a good 
API to load multiple compounds but it didnot parse the smiles immediately, 
which adds the?time complexity to the further application. So is it possible to 
manually initiate the compounds?


Hongbin Yang 杨弘宾

Research: Toxicophore and Chemoinformatics
Pharmaceutical Science, School of Pharmacy

East China University of Science and Technology?


------------------------------------------------------------------------------
Developer Access Program for Intel Xeon Phi Processors
Access to Intel Xeon Phi processor-based developer platforms.
With one year of Intel Parallel Studio XE.
Training and support from Colfax.
Order your platform today. http://sdm.link/xeonphi
_______________________________________________
Rdkit-discuss mailing list
Rdkit-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss

Reply via email to