I have been using DMTCP successfully for a long-running optim() task. This is a
single-core process running on a large linux cluster with slurm as the job
manager. This cluster places an 8-hour limit on individual jobs, and since my
cost function takes 11 minutes to compute, I need many such
Has anyone ever considered what it would take to implement checkpointing in R,
so that long-running processes could be interrupted and resumed later, from a
different process or even a different machine?
Thanks,
Andy
--
Andy Jacobson
andy.jacob...@noaa.gov
NOAA Global Monitoring Lab
325
2 matches
Mail list logo