[jira] [Commented] (HBASE-11165) Scaling so cluster can host 1M regions and beyond (50M regions?)

stack (JIRA) Mon, 18 Aug 2014 21:27:41 -0700

    [ 
https://issues.apache.org/jira/browse/HBASE-11165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14101827#comment-14101827
 ]


stack commented on HBASE-11165:
-------------------------------

On the nice attached doc:

{code}
Thoughts:
Advantage       of      zk      less    assignment
1)      Better  API's   (ls     on      znode   vs      scan    table)
2)      Scalability     (If     META    is      split)
                        a)      Read/Write      throughput
                        b)      Storage (1M     child   znodes  vs      1M      
rows    in      META)
{code}

Can I have some pointers on how to read the above.  Zk-less AM is better 
because you scan a table -- you don't have to ls znodes?  What is the 1M znodes 
vs 1M rows about in above?

[~toffer] Is the above the basis for your "...As our experiments shows 
splitting is a must for scaling."?  If split meta, then more read/write 
throughput?  Because the meta table could be served by many machines so field 
more reads/writes?   The reads/writes are needed at starttime or during cluster 
lifetime in your judgement?  Thanks.

[~virag] Is the shrink in time from 12 mins to 5mins on zk-less assign or 
forceSync=no? (or for both)?

[~enis]
bq. Do we need to bring root back? Can't we simply host all of root in 
zookeeper? We expect the number of meta regions in the tens / hundreds case 
right?
 
Root access would be different to the access of any other table if we went this 
route. Everyone -- all clients, etc. -- would need to know how to do this new 
access type.  How would you do it in zk anyways?  Would be a single znode with 
all of root in it? Root would have zk enforced limits. Might be ok for a small 
table that changes infrequently. Not sure about znode with 1k rows in it 
(history, replicas? etc.). We'd have to test.

[~lhofhansl]
bq. We need a bootstrapping mechanism to find root anyway...

Yeah. There is zk now. Elsewhere, a quorum of masters has been proposed; you'd 
go to the master to figure where everything is.  That'd be a big change. 2.0.x.

> Scaling so cluster can host 1M regions and beyond (50M regions?)
> ----------------------------------------------------------------
>
>                 Key: HBASE-11165
>                 URL: https://issues.apache.org/jira/browse/HBASE-11165
>             Project: HBase
>          Issue Type: Brainstorming
>            Reporter: stack
>         Attachments: HBASE-11165.zip, Region Scalability test.pdf, 
> zk_less_assignment_comparison_2.pdf
>
>
> This discussion issue comes out of "Co-locate Meta And Master HBASE-10569" 
> and comments on the doc posted there.
> A user -- our Francis Liu -- needs to be able to scale a cluster to do 1M 
> regions maybe even 50M later.  This issue is about discussing how we will do 
> that (or if not 50M on a cluster, how otherwise we can attain same end).
> More detail to follow.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HBASE-11165) Scaling so cluster can host 1M regions and beyond (50M regions?)

Reply via email to