Hello,
On 07/18/11 21:43, Uma Maheswara Rao G 72686 wrote:
Hi,
We have already thoughts about it.
No, I think we are talking about different problems. What I'm talking
about is how to reduce the number of replica while still achieving the
same data reliability. The replica of data can already
!
*
- Original Message -
From: Da Zheng
Date: Tuesday, July 19, 2011 9:23 am
Subject: Re: replicate data in HDFS with smarter encoding
To: common-u...@hadoop.apache.org
Cc: Joey Echeverria , "hdfs-user@hadoop.apache.org"
> So this kind of feature is desired by
So this kind of feature is desired by the community?
It seems this implementation can only reduce the data size on the disk
by the background daemon RaidNode, but it cannot reduce the disk
bandwidth and network bandwidth when the client writes data to HDFS. It
might be more interesting to redu
Facebook contributed some code to do something similar called HDFS RAID:
http://wiki.apache.org/hadoop/HDFS-RAID
-Joey
On Jul 18, 2011, at 3:41, Da Zheng wrote:
> Hello,
>
> It seems that data replication in HDFS is simply data copy among nodes. Has
> anyone considered to use a better encodi