Hi all,
We've got a filer built on consumer hardware running SXCE snv_97,
holding a small (1.4TB) raidz array. It's been going great for the last
6 months or so, but recently its started misbehaving.
We use in-kernel CIFS for most of our needs and it works perfectly when
playing media or mounting backed-up CD images. The problem comes when we
try to explicitly copy something from it.
When you actually try a direct copy via CIFS, HTTP, SSH or FTP, the
transfer has about a 70% chance it will hang the machine. The larger the
file, the more probable it is.
I have no idea how to start investigating this as the network is
inaccessible, the console is frozen and there are no hints left behind
in the logs.
My only way to recover the server is to shutdown (with the soft-off
button on the case) and bootup. I can tell the machine isn't totally
hung as it will apparently do a proper shutdown procedure.
We're really out of ideas and worried that there could be a problem with
our raidz array, even though there are no errors logged concerning it.
As far as we can tell, there is no problem with any data (yet), just the
system itself.
Any ideas how to go about investigating this further?
Many thanks
Matt
_______________________________________________
opensolaris-discuss mailing list
[email protected]