Matt Harrison wrote:
Hi all,

We've got a filer built on consumer hardware running SXCE snv_97, holding a small (1.4TB) raidz array. It's been going great for the last 6 months or so, but recently its started misbehaving.

We use in-kernel CIFS for most of our needs and it works perfectly when playing media or mounting backed-up CD images. The problem comes when we try to explicitly copy something from it.

When you actually try a direct copy via CIFS, HTTP, SSH or FTP, the transfer has about a 70% chance it will hang the machine. The larger the file, the more probable it is.

I have no idea how to start investigating this as the network is inaccessible, the console is frozen and there are no hints left behind in the logs.

My only way to recover the server is to shutdown (with the soft-off button on the case) and bootup. I can tell the machine isn't totally hung as it will apparently do a proper shutdown procedure.

We're really out of ideas and worried that there could be a problem with our raidz array, even though there are no errors logged concerning it. As far as we can tell, there is no problem with any data (yet), just the system itself.

Any ideas how to go about investigating this further?

I hate to pester but I'm surprised no-one has any ideas on this. We are constantly worried about what the side-effects of the server hanging might be, and of course it is decreasing the availability of our data.

Thanks

Matt
_______________________________________________
opensolaris-discuss mailing list
[email protected]

Reply via email to