Correct, you want more spindles/TB. Furthermore, you'll need RAM to use that storage effectively. GB RAM/TB of disk will very depending on your access patterns. 20 TB raw storage is about 6.6TB after replication. Even with compressed BlockCache and a compression ratio of 0.3, it'll be difficult to keep that much data hot.
On Friday, December 12, 2014, 张铎 <[email protected]> wrote: > I think 10 disks each 2TB is better. > A disk can only seek about 50 times per second no matter it is 10TB or 2TB, > so more disks means you can do more seeks and increase the random read > write performance. > > 2014-12-13 9:04 GMT+08:00 Krishna <[email protected] <javascript:;>>: > > > > Hi, > > > > Is there any logical/practical limit on HBase RS storage size? > > Which works better for HBase - a region server with 10 disks that are > each > > 2 TB or 2 disks that are each 10TB? > > I remember, one of the recommendations is to keep each disk on RS to be > > less than 6 TB - is that correct? > > > > Thanks > > >
