[ 
https://issues.apache.org/jira/browse/HBASE-11165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14115524#comment-14115524
 ] 

stack commented on HBASE-11165:
-------------------------------

bq. ..by the way, could somebody estimate the distribution or read vs writes to 
meta table, in terms of requests per second/networking traffic/disk access?

[~toffer] or [~virag] Do you have any numbers from your experiments?

bq. I believe there are 2 dimensions here, right?

There is a more immediate 3rd dimension/option and that is what we have now 
where we have a single master and if it fails, backup assumes its role.

For your dimension 1., (also known as  "HBASE-10296 Replace ZK with a consensus 
lib(paxos,zab or raft running within master processes to provide better master 
failover performance and state consistency"), it would be a nice-to-have but if 
we can, lets try and avoid our having to have a HA master. As long as the 
master comes back inside some reasonable window and the cluster can keep on 
chugging while its figuring out its recovery, this would be a simpler deploy 
than one that needs a HA master mini-cluster.  Multi-master would help scale 
reads but as you note, doesn't help if one massive meta region and we want to 
cache it.

We'd go to partitioned masters, as I see it, if we can't make one master do.

bq. The viable combined solution ...

I suggest we look at single master with split meta served by regionservers 
first?

> Scaling so cluster can host 1M regions and beyond (50M regions?)
> ----------------------------------------------------------------
>
>                 Key: HBASE-11165
>                 URL: https://issues.apache.org/jira/browse/HBASE-11165
>             Project: HBase
>          Issue Type: Brainstorming
>            Reporter: stack
>         Attachments: HBASE-11165.zip, Region Scalability test.pdf, 
> zk_less_assignment_comparison_2.pdf
>
>
> This discussion issue comes out of "Co-locate Meta And Master HBASE-10569" 
> and comments on the doc posted there.
> A user -- our Francis Liu -- needs to be able to scale a cluster to do 1M 
> regions maybe even 50M later.  This issue is about discussing how we will do 
> that (or if not 50M on a cluster, how otherwise we can attain same end).
> More detail to follow.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to