Many thanks for your replies

Triggering gc doesn't seem to help (though the object count goes down) 
but reducing how much is processed at a time does. I actually didn't 
have to go down to a molecule-at-a-time level but made due with inputs 
half of the previous size. The RAM still fills up after a few files but 
then stays that way without overflowing the swap and crashing things.

Cheers,
Adam

On 27-Jun-15 16:54, Dmitri Maziuk wrote:
> On 6/27/2015 5:45 AM, Greg Landrum wrote:
> ...
>> That's assuming that the futures code (which I don't know) isn't doing
>> anything odd behind the scenes to hold onto references.
> Or every mol in supplier holds a pointer to c++ dll that python vm
> doesn't quite know how to garbage-collect, which keeps a
> still-referenced object inside the job, which means jobs=[] creates a
> new array without deleting the old jobs. Who knows.
>
> The first answer was the right one: process one molecule at a time. Even
> better, split the input file into one-per-molecule, then use ec2 or
> condor or osg to run your one-at-a-time script on all of them at once.
>
> Dima
>
>
> ------------------------------------------------------------------------------
> Monitor 25 network devices or servers for free with OpManager!
> OpManager is web-based network management software that monitors
> network devices and physical & virtual servers, alerts via email & sms
> for fault. Monitor 25 devices for free with no restriction. Download now
> http://ad.doubleclick.net/ddm/clk/292181274;119417398;o
> _______________________________________________
> Rdkit-discuss mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/rdkit-discuss


------------------------------------------------------------------------------
Monitor 25 network devices or servers for free with OpManager!
OpManager is web-based network management software that monitors 
network devices and physical & virtual servers, alerts via email & sms 
for fault. Monitor 25 devices for free with no restriction. Download now
http://ad.doubleclick.net/ddm/clk/292181274;119417398;o
_______________________________________________
Rdkit-discuss mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss

Reply via email to