RDKit-sters,
I'm working with a huge SD file that by all ways I measure it contains
~5,050,000 structures. (This is an eMolecules dataset.) In processing the
file, I've run into an odd error. Even with the following very simple
code, the file seems to be bottomless. I let it run overnight and I saw
number as high as 42,000,000.
Any ideas?
-Kirk
from rdkit import Chem
sdin = Chem.SDMolSupplier
for i,m in enumerate(sdin):
if ( i % 100000 == 0 ):
print 'Structure #' + str(i)
------------------------------------------------------------------------------
All the data continuously generated in your IT infrastructure
contains a definitive record of customers, application performance,
security threats, fraudulent activity, and more. Splunk takes this
data and makes sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-novd2d
_______________________________________________
Rdkit-discuss mailing list
Rdkit-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss