We used Gluster as our database filesystem for years. It was slow, and
occassionally got into a split-brain state, but had very few serious
problems. I would make sure that Gluster isn't reporting any errors, and
also that you really only have one qmaster running (this was the cause of
our only real serious problem - Gluster was in a split-brain, and someone
got confused and tried to start a qmaster when one was already running).

Now we're on GPFS and are much happier. It's overkill for GE but we already
have it available for our research and home directory filesystems.

On Mon, Jul 09, 2018 at 03:48:20PM +0200, Paul Paul wrote:
> Hello,
> 
> In order to use the shadow master functionality, the SGE local configuration 
> files have to be stored on NFS. This is usually done by using a single 
> server, thus implies a single point of failure.
> 
> If you're using SGE with a distributed file system, can you please indicate 
> which one?
> We tried GlusterFS (version 3.12) with SGE 8.1.9 but it appeared that jobs 
> were randomly killed (after few days where everything run smoothly); going 
> back to NFS (4.0) on a single server fixed this behavior.
> 
> Thanks for sharing,
> 
> Paul.
> _______________________________________________
> users mailing list
> users@gridengine.org
> https://gridengine.org/mailman/listinfo/users

-- 
-- Skylar Thompson (skyl...@u.washington.edu)
-- Genome Sciences Department, System Administrator
-- Foege Building S046, (206)-685-7354
-- University of Washington School of Medicine
_______________________________________________
users mailing list
users@gridengine.org
https://gridengine.org/mailman/listinfo/users

Reply via email to