Hi Mark, Thank you for your answer. You are right, I have the following:
Mon Mar 16 11:02:24 EDT 2015: mounting /dev/gpfs2 Mon Mar 16 11:02:24.296 2015: Command: mount gpfs2 Mon Mar 16 11:02:24.723 2015: Disk failure. Volume gpfs2. rc = 5. Physical volume gpfs2nsd04. Mon Mar 16 11:02:24.724 2015: Disk failure. Volume gpfs2. rc = 5. Physical volume gpfs2nsd12. Mon Mar 16 11:02:24.723 2015: File System gpfs2 unmounted by the system with return code 5 reason code 0 Mon Mar 16 11:02:24.724 2015: Input/output error Mon Mar 16 11:02:24.725 2015: Failed to open gpfs2. Mon Mar 16 11:02:24.724 2015: Input/output error Mon Mar 16 11:02:24.725 2015: Command: err 666: mount gpfs2 Mon Mar 16 11:02:24.724 2015: Input/output error mount: Stale NFS file handle Mon Mar 16 11:02:24 EDT 2015: finished mounting /dev/gpfs2 But it is not mounted. This is after issuing the "mmmount gpfs2" command Mon Mar 16 11:03:52.010 2015: Command: mount gpfs2 Mon Mar 16 11:03:52.234 2015: VERBS RDMA connecting to 172.25.101.2 (heca2-ib0) on mlx4_0 port 1 id 4 Mon Mar 16 11:03:52.235 2015: VERBS RDMA connected to 172.25.101.2 (heca2-ib0) on mlx4_0 port 1 sl 0 Mon Mar 16 11:03:52.381 2015: Command: err 0: mount gpfs2 Now, my next step it to figure out how the clear the failure since it is not failed anymore. The mmlsdisk reports all the nsd are up and ready. One command I'm also looking for is a command to list which nsd server is used for a particular nsd. I have a redundant config for which 2 servers can access the same nsd. But I think that for some of them, the default server is not used and the load of the servers is not distributed as wanted. Richard 2015-03-20 15:24 GMT-04:00 Marc A Kaplan <[email protected]>: > Look in /var/adm/ras/mmfs.log.* -- on a node that should be mounting the > filesystem -- find a file that covers the time when GPFS is starting up > and should be mounting the filesystem. > > Look for clues and/or error messages. If you are not familiar, you may > also want to look at what messages are issued when everything is working > just right. > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at gpfsug.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > -- Richard Lefebvre, Sys-admin, CQ, (514)343-6111 x5313 "Don't Panic" [email protected] -- THGTTG Calcul Quebec (calculquebec.ca) ------ Calcul Canada (computecanada.ca)
_______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at gpfsug.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss
