Hello,
Brock Palen wrote:
I was able to catch a client and server in the act:
client dmesg:
eth0: no IPv6 routers present
Lustre: nobackup-MDT-mdc-01012bd39800: Connection to service
nobackup-MDT via nid [EMAIL PROTECTED] was lost; in progress
operations using this service
If client get eviction from the server, it might be triggered by
1) server did not get client pinger msg in a long time.
2) client is too busy to handle the server lock cancel req.
Clients show a load of 4.2 (4 cores total, 1 process per core).
3) client cancel the lock, but the network
Hi Brock,
On Monday 04 February 2008 07:11:11 am Brock Palen wrote:
on our cluster that has been running lustre for about 1 month. I have
1 MDT/MGS and 1 OSS with 2 OST's.
Our cluster uses all Gige and has about 608 nodes 1854 cores.
This seems to be a lot of clients for only one OSS (and