Have you read http://www.slideshare.net/larsgeorge/hbase-sizing-notes ?
Cheers On Feb 6, 2014, at 8:47 PM, suresh babu <[email protected]> wrote: > Hi Stana, > > We are trying to find out how many data nodes (including hardware > configuration detail)should be configured or setup for this requirement > > -suresh > > On Friday, February 7, 2014, stana <[email protected]> wrote: > >> HI suresh babu : >> >> how many data nodes do you have? >> >> >> 2014-02-07 suresh babu <[email protected] <javascript:;>>: >> >>> refreshing the thread, >>> >>> Can you please suggest any inputs for the hardware configuration(for the >>> below mentioned use case). >>> >>> >>> >>> >>> On Wed, Feb 5, 2014 at 10:31 AM, suresh babu <[email protected]> >>> wrote: >>> >>>> Please find the data requirements for our use case below : >>>> >>>> Raw data processing >>>> ---------------------------------- >>>> 1. Data is populated into hdfs , after etl around 3 billion puts per >> day >>>> in to hbase >>>> >>>> 2. Oldest data after X days to be deleted from hbase >>>> >>>> Aggregates processing >>>> ---------------------------------- >>>> 3 billion reads per day ... Large scan or reads >>>> >>>> KV size around 1 KB Daily Processing, raw and aggregates, via M/R jobs >>>> Hive queries in future, but not of immediate focus >>>> On Feb 5, 2014 12:48 AM, "Vladimir Rodionov" <[email protected]> >>>> wrote: >>>> >>>>> Yes, >>>>> >>>>> 1. What is the expected avg and peak load in >>> writes/updates/deletes/reads? >>>>> 2. What is the average size of a KV? >>>>> 3. Reads/small scans/medium/large scan %% >>>>> 4. Do you plan M/R jobs, Hive query? >>>>> >>>>> >>>>> Best regards, >>>>> Vladimir Rodionov >>>>> Principal Platform Engineer >>>>> Carrier IQ, www.carrieriq.com >>>>> e-mail: [email protected] >>>>> >>>>> ________________________________________ >>>>> From: Nick Xie [[email protected]] >>>>> Sent: Tuesday, February 04, 2014 10:02 AM >>>>> To: [email protected] >>>>> Subject: Re: Regarding Hardware configuration for HBase cluster >>>>> >>>>> I guess you'd better describe a little bit more about your >> applications. >>>>> Does the data increase over the time at all? >>>>> >>>>> Nick >>>>> >>>>> >>>>> On Tue, Feb 4, 2014 at 5:22 AM, suresh babu <[email protected]> >>>>> wrote: >>>>> >>>>>> Hi folks, >>>>>> >>>>>> We are trying to setup HBase cluster for the following requirement: >>>>>> >>>>>> We have to maintain data of size around 800TB, >>>>>> >>>>>> For the above requirement,please suggest me the best hardware >>>>> configuration >>>>>> details like >>>>>> >>>>>> 1)how many disks to consider for machine and the capacity of disks >>> ,for >>>>>> example, 16/24 disks per node with 1/2TB capacity per each disk >>>>>> >>>>>> 2) which compression method is suited for production environment , >>>>> space is >>>>>> not a major limitation , but speed is of prime concern for my use >> case >>>>>> >>>>>> 3) how many CPU Cores should be configured for each node/machine ? >> Or >>>>>> ideal ratio of number of cores to the number of disks,for example >>>>>> 1core/1disk ? >>>>>> >>>>>> Regards, >>>>>> Kaushik >>>>> >>>>> Confidentiality Notice: The information contained in this message, >>>>> including any attachments hereto, may be confidential and is intended >>> to be >>>>> read only by the individual or entity to whom this message is >>> addressed. If >>>>> the reader of this message is not the intended recipient or an agent >> or >>>>> designee of the intended recipient, please note that any review, use, >>>>> disclosure or distribution of this message or its attachments, in any >>> form, >>>>> is strictly prohibited. If you have received this message in error, >>> please >>>>> immediat-- >> Best Regards >> >> 亦思科技 is-land Systems Inc. >> Tel:03-5630345 Ext.14 >> Fax:03-5631345 >> e-MAIL:[email protected] <javascript:;> >> >> 何永安 Yung An He >>
