RDKit-sters,

I'm working with a huge SD file that by all ways I measure it contains
~5,050,000 structures.  (This is an eMolecules dataset.)  In processing the
file, I've run into an odd error.  Even with the following very simple
code, the file seems to be bottomless.  I let it run overnight and I saw
number as high as 42,000,000.

Any ideas?

-Kirk



from rdkit import Chem

sdin = Chem.SDMolSupplier

for i,m in enumerate(sdin):

   if ( i % 100000 == 0 ):
      print 'Structure #' + str(i)
------------------------------------------------------------------------------
All the data continuously generated in your IT infrastructure 
contains a definitive record of customers, application performance, 
security threats, fraudulent activity, and more. Splunk takes this 
data and makes sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-novd2d
_______________________________________________
Rdkit-discuss mailing list
Rdkit-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss

Reply via email to