Re: [Lustre-discuss] Problems with failover

Joe Kraska Thu, 03 Jan 2008 19:59:27 -0800

> You currently need another mechanism (hardware or software RAID) to
> provide data redundancy in case of disk failure.  We are working to
> provide data replication at the Lustre level, but that is not yet
> available.



I should say. That technology has me pretty excited. Right now, unless I
bend over backwards
and do something like "vertical" RAID stripe/mirrors across multiple disk
trays in a storage cluster,
I can end up with a very bad situation if I lose an entire tray. This can
have a potentially devastating
impact on my entire storage tier.

A few companies here and there (XIV, Isilon) are starting to abandon
hardware raid and are doing
block replication across the entire storage cluster. With that, I can forget
worrying about specific
disks (except to replace them), and don't even have to worry about whole
trays (insofar as I have
spare capacity).

This is a pretty neat capability. If you add to it the ability to
"rebalance" your cluster on the fly as
new nodes are added, what you end up with is a self-healing storage cluster.
Pretty compelling
for those availability figures, and can help with the disk-service pattern
as well.

Joe Kraska
San Diego CA
USA

_______________________________________________
Lustre-discuss mailing list
[email protected]
https://mail.clusterfs.com/mailman/listinfo/lustre-discuss

Re: [Lustre-discuss] Problems with failover

Reply via email to