Re: [Beowulf] Putting /home on Lusture of GPFS

Joe Landman Wed, 24 Dec 2014 08:00:07 -0800


On 12/24/2014 10:54 AM, Prentice Bisbal wrote:

Everyone,
Thanks for the feedback you've provided to my query below. I'm gladI'm not the only one who thought of this, and a lot of you raised verygood points I haven't thought about. While I've been followingparallel filesystems for years, I have very little experience actuallymanaging them up to this point. (My BG/P came with GPFS filesystem for/scratch, but everything was already setup before I got here, so I'veonly had to deal with it when something breaks).
You've all convinced me that this may not be an ideal solutionarrangement, but if I go this route, GPFS might be a better fit forthis than Lustre (mainly because Chris Samuels has proven it *is*possible with GPFS, and GPFS has snapshotting).
Joe Landman, as always, has provided a wealth of information, and therest of you have pointed out other potential pitfalls. with thisapproach.

My pleasure ... I do think asking James Cuff, Chris Dwan, and othersrunning/managing big kit (and the teams running the kit), what they aredoing and why would be quite instructive in a bigger picture sense.

Which to a degree suggests that mebbe a devops/best practices BoF ortalk series, or educational workshop at SC15 wouldn't be a bad thing... I'd be happy to submit a proposal for this for this year.


Let me know ...

Thanks again for the feedback, and please keep the conversation going.

Prentice

On 12/23/2014 12:12 PM, Prentice Bisbal wrote:
Beowulfers,
I have limited experience managing parallel filesytems like GPFS orLustre. I was discussing putting /home and /usr/local for my clusteron a GPFS or Lustre filesystem, in addition to using it just for/scratch. I've never done this before, but it doesn't seem like allthat bad an idea. My logic for this is the following:
1. Users often try to run programs from in /home, which leads toerrors, no matter how many times I tell them not to do that. Thiswould make the system more user-friendly. I could use quotas/policiesto encourage them to use 'steer' them to use other filesystems ifneeded.
2. Having one storage system to manage is much better than 3.

3. Profit?
Anyway, another person in the conversation felt that this would bebad, because if someone was running a job that would hammer thefileystem, it would make the filesystem unresponsive, and keep otherpeople from logging in and doing work. I'm not buying this concernfor the following reasons:
If a job can hammer your parallel filesystem so that the login nodesbecome unresponsive, you've got bigger problems, because that meansother jobs can't run on the cluster, and the job hitting thefilesystem hard has probably slowed down to a crawl, too.
I know there are some concerns with the stability of parallelfilesystems, so if someone wants to comment on the dangers of that,too, I'm all ears. I think that the relative instability of parallelfilesystems compared to NFS would be the biggest concern, notperformance.
_______________________________________________
Beowulf mailing list, [email protected] sponsored by Penguin Computing
To change your subscription (digest mode or unsubscribe) visithttp://www.beowulf.org/mailman/listinfo/beowulf


--
Joseph Landman, Ph.D
Founder and CEO
Scalable Informatics, Inc.
email: [email protected]
web  : http://scalableinformatics.com
twtr : @scalableinfo
phone: +1 734 786 8423 x121
cell : +1 734 612 4615

_______________________________________________
Beowulf mailing list, [email protected] sponsored by Penguin Computing
To change your subscription (digest mode or unsubscribe) visit 
http://www.beowulf.org/mailman/listinfo/beowulf

Re: [Beowulf] Putting /home on Lusture of GPFS

Reply via email to