On Wed, 31 Dec 2008, Miles O'Neal wrote:

About a month ago, I said:

Our local vendor built us a Supermicro/Adaptec
system with 16x1TB SATA drives.  We have a 12TB
partition that they built as EXT2.  When I tried
to add journaling, it took forever, and then the
system locked up.  On reboot, the FS was still
EXT2, and takes hours (even empty) to fsck.  Based
on the messages flying by I am also not confident
fsck rally understands a filesystem this large.

Is the XFS module stable on 5.1 and 5.2?  (The
vendor installed 5.1 because that's what they
have, but I ran "yum update"), so it's effectively
5.2.

I rebuilt 12TB partition as XFS.  But after about 11GB
of data moved, the system locked up with "bus error".
After reboot, the system looks fine.  The vendor always
runs diags and burns the systems in, though it could
still be hardware or driver issue.

Is it likely to be the OS/XFS with the large partition,
or would you just send it back for diagnostics again?

I'd send it back:

We have a (growing) number of SL5 systems with large XFS filesystems (8.9 or 12 TiB), and we haven't seen a single lockup.
One of those with an 8.9 TiB XFS has been up for nine months, and
moved .3 PB of data during that time, sometimes under considerable load. It's still running 2.6.18-53.1.14.el5.

Supermicros have been very reliable for us, but between
the Adaptec and 1TB SATAs, and the large partition, I'm
not sure how reliable the current drivers are.

Maybe try an LSI based controller instead.

I'd hoped to just have one mount point, but could make
2-3 smaller partitions if that seems to be the likely
issue.

Probably not.

Hope this helps,
        Stephan

Thanks,
Miles


--
Stephan Wiesand
  DESY - DV -
  Platanenallee 6
  15738 Zeuthen, Germany

Reply via email to