Joseph Farran <[email protected]> writes: [Please don't post content-type: text/html.]
> Hi all. > > We have Grid Engine 8.1.4 running on a cluster with CentOS 6.4, using kernel > 2.6.32-358.18.1. We are just getting started on setting up job checkpoint. > > We got BLCR compiled and are currently testing it. Before we go much > further and setup Grid Engine with it, I like to know if others are using > DMTCP? > > Which one is the better choice, DMTCP or BLCR? Or are folks using both > depending on the need? If you're running MPI, and want checkpointing integrated with it, you probably don't have a choice, certainly not if you don't use TCP. For single node jobs, DMTCP presumably wins by virtue being in user space, but if you already have BLCR for parallel jobs, it's probably simplest to just use that. With BLCR you probably want DKMS packages to keep up with kernel updates. -- Community Grid Engine: http://arc.liv.ac.uk/SGE/ _______________________________________________ users mailing list [email protected] https://gridengine.org/mailman/listinfo/users
