[jira] [Comment Edited] (HBASE-12233) Bring back root table

2018-08-23 Thread Francis Liu (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-12233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16590617#comment-16590617
 ] 

Francis Liu edited comment on HBASE-12233 at 8/23/18 6:06 PM:
--

Thanks for the interest in this [~stack]. We have been running with root+split 
meta since 0.98 and now 1.3. We've done all the work you mentioned except for 
the facade you mentioned (and prolly more). We went with the approach 
[~apurtell] suggested (a fork in the code to detect wether root or meta is the 
starting table something you and I also talked about many moons ago. If you are 
interested we can push up the 1.3 code into a feature branch. I've have a 
partially working master branch port but it's I got distractraced getting 1.3 
stable internally. Now I believe I have time, would you guys have time to look 
at a master patch?


was (Author: toffer):
Thanks for the interest in this [~stack]. We have been running with root+split 
meta since 0.98 and now 1.3. We've done all the work you mentioned except for 
the facade you mentioned (and prolly more). We went with the approach 
[~apurtell] suggested (a fork in the code to detect wether root or meta is the 
starting table something we also talked about many moons ago. If you are 
interested we can push up the 1.3 code into a feature branch. I've have a 
partially working master branch port but it's I got distractraced getting 1.3 
stable internally. Now I believe I have time, would you guys have time to look 
at a master patch?

> Bring back root table
> -
>
> Key: HBASE-12233
> URL: https://issues.apache.org/jira/browse/HBASE-12233
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Virag Kothari
>Assignee: Virag Kothari
>Priority: Major
> Attachments: HBASE-12233.patch
>
>
> First step towards splitting meta.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Comment Edited] (HBASE-12233) Bring back root table

2014-10-10 Thread Elliott Clark (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-12233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14167927#comment-14167927
 ] 

Elliott Clark edited comment on HBASE-12233 at 10/11/14 1:45 AM:
-

So right now I don't see the need for split meta. We found a 5x win on meta 
versions. This alone means that we can handle lots more regions per cluster 
with out breaking a sweat. And there's lots more that can be done without 
hurting the default case.

* We haven't tried meta region replicas.
** This is where we will see the biggest wins. Right now meta gets 2x more 
reads than writes on real production clusters.
** We already have the region replica stuff being worked on
** All reads to meta can already be stale so timeline consistent is good enough 
for 99% of reads that are currently going to meta.
* We haven't tried allowing a shared cache infront of meta.
** It would be easy to put memcache (or redis if you already have it) in the 
MetaTableAccessor and that would allow a very fast shared cache that gets 
warmed by all clients.
* We haven't tried a more compact meta table representation
* We haven't tried picking smaller split keys.
** Right now we just pick a mid point. It would be possible to do something 
like our index key generation to drastically reduce the size of region names.
* We haven't tried stripe compactions on meta.
* We haven't gotten block encoding working for meta.

Each of those things are largish wins that haven't been tried, and each of 
those things helps every user. While split meta helps maybe 0.1% of HBase's 
users.

Until the less invasive more positive options  are tried and shown to not be 
enough, it just feels like everyone is trying to split meta without a concrete 
need.

Splitting meta doesn't make us able to scale to the sky. The NN is still a 
bottle neck on cluster size. NameNodes fall over way before we will need a 
split meta.  Even if someone gets NN to scale to infinity, the network becomes 
a bottleneck. HDFS clusters can only get so large before they need to be 
federated (and HBase doesn't tolerate federation). So making every user and 
every cluster feel the pain of increased complexity and mttr so that we the 
HBase developers can perform a thought experiment is just not something I see 
the benefit in right now.

I am for sure against bringing root back in for branch-1 ( -1 ). It's just way 
too late in the stabilization period to not set us way back.


was (Author: eclark):
So right now I don't see the need for split meta. We found a 5x win on meta 
versions. This alone means that we can handle lots more regions per cluster 
with out breaking a sweat. And there's lots more that can be done without 
hurting the default case.

* We haven't tried meta region replicas.
** This is where we will see the biggest wins. Right now meta gets 2000x more 
reads than writes on real production clusters.
** We already have the region replica stuff being worked on
** All reads to meta can already be stale so timeline consistent is good enough 
for 99% of reads that are currently going to meta.
* We haven't tried allowing a shared cache infront of meta.
** It would be easy to put memcache (or redis if you already have it) in the 
MetaTableAccessor and that would allow a very fast shared cache that gets 
warmed by all clients.
* We haven't tried a more compact meta table representation
* We haven't tried picking smaller split keys.
** Right now we just pick a mid point. It would be possible to do something 
like our index key generation to drastically reduce the size of region names.
* We haven't tried stripe compactions on meta.
* We haven't gotten block encoding working for meta.

Each of those things are largish wins that haven't been tried, and each of 
those things helps every user. While split meta helps maybe 0.1% of HBase's 
users.

Until the less invasive more positive options  are tried and shown to not be 
enough, it just feels like everyone is trying to split meta without a concrete 
need.

Splitting meta doesn't make us able to scale to the sky. The NN is still a 
bottle neck on cluster size. NameNodes fall over way before we will need a 
split meta.  Even if someone gets NN to scale to infinity, the network becomes 
a bottleneck. HDFS clusters can only get so large before they need to be 
federated (and HBase doesn't tolerate federation). So making every user and 
every cluster feel the pain of increased complexity and mttr so that we the 
HBase developers can perform a thought experiment is just not something I see 
the benefit in right now.

I am for sure against bringing root back in for branch-1 ( -1 ). It's just way 
too late in the stabilization period to not set us way back.

 Bring back root table
 -

 Key: HBASE-12233
 URL: https://issues.apache.org/jira/browse/HBASE-12233