Thanks all. Lars, in your post: "Personally I would place the maximum disk space per machine that can be served exclusively with HBase around 6T" - what is the reasoning behind the limit of 6TB per RS? Does having 20 TB region server (10 disks of 2TB) cause any issues?
On Sat, Dec 13, 2014 at 8:58 PM, lars hofhansl <[email protected]> wrote: > > See also: > > http://hadoop-hbase.blogspot.com/2013/01/hbase-region-server-memory-sizing.html > > ----- Original Message ----- > From: Nick Dimiduk <[email protected]> > To: "[email protected]" <[email protected]> > Cc: > Sent: Saturday, December 13, 2014 5:37 PM > Subject: Re: RS disk capacity limits > > Correct, you want more spindles/TB. Furthermore, you'll need RAM to use > that storage effectively. GB RAM/TB of disk will very depending on your > access patterns. 20 TB raw storage is about 6.6TB after replication. Even > with compressed BlockCache and a compression ratio of 0.3, it'll be > difficult to keep that much data hot. > > On Friday, December 12, 2014, 张铎 <[email protected]> wrote: > > > I think 10 disks each 2TB is better. > > A disk can only seek about 50 times per second no matter it is 10TB or > 2TB, > > so more disks means you can do more seeks and increase the random read > > write performance. > > > > 2014-12-13 9:04 GMT+08:00 Krishna <[email protected] > <javascript:;>>: > > > > > > > Hi, > > > > > > Is there any logical/practical limit on HBase RS storage size? > > > Which works better for HBase - a region server with 10 disks that are > > each > > > 2 TB or 2 disks that are each 10TB? > > > I remember, one of the recommendations is to keep each disk on RS to be > > > less than 6 TB - is that correct? > > > > > > Thanks > > > > > >
