On 09/14/2011 02:02 PM, Per Steffensen wrote:
Hi
If my goal is to have multiple physical disks seem as one big disk with
redundancy built in, why would I use a HDFS cluster among machines with
one disk each, instead of using software RAID like md(adm) directly on
top of the disks? I am looking for pros and cons on the two solutions.
http://en.wikipedia.org/wiki/RAID#Software-based_RAID
http://en.wikipedia.org/wiki/Mdadm
Regards, Per Steffensen
HDFS was never intended to be a general-purpose file system. It is a
system optimized for a) running map/reduce, and b) holding large files.
It should not be considered as a replacement for RAID.
DR