Hi,

I have 2 osc out of 3 on an oss which start recovering after mounting them but 
the recovering does not stop. All other disks, the one on this oss and 3 
others on another start with more or less the same message, but after the 
timeout period the recovery is complete.
For this 2 disks nothing happens. 
What can I do to find out what happens and how can I get it to run?

 cat /proc/fs/lustre/obdfilter/testfs-OST0000/recovery_status shows
status: RECOVERING
recovery_start: 1184583617
time remaining: 0
connected_clients: 0/2
completed_clients: 0/2
replayed_requests: 0/??
queued_requests: 0
next_transno: 11634691

in /var/log/messages I found (but nothing in addition after the timout should 
appear)
Jul 16 12:17:40 node kernel: kjournald starting.  Commit interval 5 seconds
Jul 16 12:17:40 node kernel: LDISKFS FS on sda, internal journal
Jul 16 12:17:40 node kernel: LDISKFS-fs: mounted filesystem with ordered data 
mode.
Jul 16 12:17:41 node kernel: Lustre: 28498:0:
(filter.c:784:filter_init_server_data()) RECOVERY: service testfs-OST0000, 2 
recoverable clients, last_rcvd 11634690
Jul 16 12:17:41 node kernel: Lustre: OST testfs-OST0000 now serving dev 
(testfs-OST0000/3025f303-d12e-43be-8cca-0fb5bef69ead), but will be in 
recovery until 2 clients reconnect, or if no clients reconnect for 4:10; 
during that time new clients will not be allowed to connect. Recovery 
progress can be monitored by 
watching /proc/fs/lustre/obdfilter/testfs-OST0000/recovery_status.
Jul 16 12:17:41 node kernel: Lustre: Server testfs-OST0000 on device /dev/sda 
has started


_______________________________________________
Lustre-discuss mailing list
[email protected]
https://mail.clusterfs.com/mailman/listinfo/lustre-discuss

Reply via email to