We had an OSS lockup and had to be reset. Heartbeat failed to mount one of the OSTs and unmounted all of its local OSTs.
I'm trying to run mount on one of the OSTs (ost08) but it claims its mounted when it is not. I have other OSTs mounted so I can't remove the driver right now. Any ideas? Redhat 5.3 [r...@oss-0-0 ~]# uname -a Linux oss-0-0.local 2.6.18-128.7.1.el5_lustre.1.8.1.1 #1 SMP Tue Oct 6 05:48:57 MDT 2009 x86_64 x86_64 x86_64 GNU/Linux [r...@oss-0-0 ~]# mount | grep ost /dev/dsk/ost12 on /mnt/scratch/ost12 type lustre (rw) /dev/dsk/ost16 on /mnt/scratch/ost16 type lustre (rw) /dev/dsk/ost20 on /mnt/scratch/ost20 type lustre (rw) /dev/dsk/ost00 on /mnt/scratch/ost00 type lustre (rw) /dev/dsk/ost04 on /mnt/scratch/ost04 type lustre (rw) /dev/dsk/ost110 on /mnt/scratch/ost110 type lustre (rw) [r...@oss-0-0 ~]# umount -f /mnt/scratch/ost08 umount2: Invalid argument umount: /mnt/scratch/ost08: not mounted [r...@oss-0-0 ~]# e2fsck -n /dev/dsk/ost08 | tee /state/partition1/e2fsck-n.ost08_`date '+%m.%d.%y-%H:%M:%S'`.log e2fsck 1.41.6.sun1 (30-May-2009) device /dev/sdj mounted by lustre per /proc/fs/lustre/obdfilter/scratch-OST0018/mntdev Warning! /dev/dsk/ost08 is mounted. Warning: skipping journal recovery because doing a read-only filesystem check. scratch-OST0018 contains a file system with errors, check forced. Pass 1: Checking inodes, blocks, and sizes Pass 2: Checking directory structure Pass 3: Checking directory connectivity Pass 4: Checking reference counts Pass 5: Checking group summary information Free blocks count wrong (895594326, counted=895594327). Fix? no scratch-OST0018: 3363862/244015104 files (1.2% non-contiguous), 80436992/976031318 blocks
_______________________________________________ Lustre-discuss mailing list [email protected] http://lists.lustre.org/mailman/listinfo/lustre-discuss
