Re: [gpfsug-discuss] metadata vdisks on fusionio.. doable?

Salvatore Di Nardo Sat, 11 Oct 2014 02:37:58 -0700

Thanks for your answer.

Yes, the idea is to have 3 servers in 3 different failure groups. Eachof them with a drive and set 3 metadata replica as the default one.

I have not considered that the vdisks could be off after a 'reboot' orfailure, so that's a good point, but anyway , after a failure or even astandard reboot, the server and the cluster have to be checked anyway,and i always check the vdisk status, so no big deal.

Your answer made me consider also another thing... Once put them backonline, they will be restriped automatically or should i run every time'mmrestripefs' to verify/correct the replicas?

I understand that use lodal disk sound strange, infact our first ideawas just to add some ssd to the shared storage, but then we consideredthat the sas cable could be a huge bottleneck. The cost difference isnot huge and the fusioio locally on the server would make the metadatajust fly.



On 10/10/14 17:02, Sanchez, Paul wrote:

Hi Salvatore,
We've done this before (non-shared metadata NSDs with GPFS 4.1) andnoted these constraints:
* Filesystem descriptor quorum: since it will be easier to have ametadata disk go offline, it's even more important to have threefailure groups with FusionIO metadata NSDs in two, and at least adesc_only NSD in the third one. You may even want to explore havingthree full metadata replicas on FusionIO. (Or perhaps if your workloadcan tolerate it the third one can be slower but in another GPFS"subnet" so that it isn't used for reads.)
* Make sure to set the correct default metadata replicas in yourfilesystem, corresponding to the number of metadata failure groups youset up. When a metadata server goes offline, it will take the metadatadisks with it, and you want a replica of the metadata to be available.
* When a metadata server goes offline and comes back up (after amaintenance reboot, for example), the non-shared metadata disks willbe stopped. Until those are brought back into a well-known replicatedstate, you are at risk of a cluster-wide filesystem unmount if thereis a subsequent metadata disk failure. But GPFS will continue to work,by default, allowing reads and writes against the remaining metadatareplica. You must detect that disks are stopped (e.g. mmlsdisk) andrestart them (e.g. with mmchdisk <fs> start –a).
I haven't seen anyone "recommend" running non-shared disk like this,and I wouldn't do this for things which can't afford to go offlineunexpectedly and require a little more operational attention. But itdoes appear to work.
Thx
Paul Sanchez
*From:*[email protected][mailto:[email protected]] *On Behalf Of *Salvatore DiNardo
*Sent:* Thursday, October 09, 2014 8:03 AM
*To:* gpfsug main discussion list
*Subject:* [gpfsug-discuss] metadata vdisks on fusionio.. doable?

Hello everyone,
Suppose we want to build a new GPFS storage using SAN attachedstorages, but instead to put metadata in a shared storage, we want touse FusionIO PCI cards locally on the servers to speed up metadataoperation( http://www.fusionio.com/products/iodrive) and forreliability, replicate the metadata in all the servers, will this workin case of server failure?
To make it more clear: If a server fail i will loose also a metadatavdisk. Its the replica mechanism its reliable enough to avoid metadatacorruption and loss of data?
Thanks in advance
Salvatore Di Nardo




_______________________________________________
gpfsug-discuss mailing list
gpfsug-discuss at gpfsug.org
http://gpfsug.org/mailman/listinfo/gpfsug-discuss

_______________________________________________
gpfsug-discuss mailing list
gpfsug-discuss at gpfsug.org
http://gpfsug.org/mailman/listinfo/gpfsug-discuss

Re: [gpfsug-discuss] metadata vdisks on fusionio.. doable?

Reply via email to