[
https://issues.apache.org/jira/browse/HBASE-14090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14628600#comment-14628600
]
Matteo Bertozzi commented on HBASE-14090:
-----------------------------------------
[~eclark] it does not need to be "meta" as "meta" used by the client. logically
we have 3 things to place somewhere
Assignment - RegionId: [server=host.xyz, skey=xyz, table=xyz, …]
Region Manifests - RegionId: [family:[storefile, …], …]
HFile Refs - HFileId: Ref-Count
all of them can be splitted without problems because aside the master startup
that does a "scan assignment" you only do point put and get to update the
states. if they end up in the same table (e.g. meta) or separated that's a
different discussion.
> Redo FS layout; let go of tables/regions/stores directory hierarchy in DFS
> --------------------------------------------------------------------------
>
> Key: HBASE-14090
> URL: https://issues.apache.org/jira/browse/HBASE-14090
> Project: HBase
> Issue Type: Sub-task
> Reporter: stack
>
> Our layout as is won't work if 1M regions; e.g. HDFS will fall over if
> directories of hundreds of thousands of files. HBASE-13991 (Humongous Tables)
> would address this specific directory problem only by adding subdirs under
> table dir but there are other issues with our current layout:
> * Our table/regions/column family 'facade' has to be maintained in two
> locations -- in master memory and in the hdfs directory layout -- and the
> farce needs to be kept synced or worse, the model management is split between
> master memory and DFS layout. 'Syncing' in HDFS has us dropping constructs
> such as 'Reference' and 'HalfHFiles' on split, 'HFileLinks' when archiving,
> and so on. This 'tie' makes it hard to make changes.
> * While HDFS has atomic rename, useful for fencing and for having files
> added atomically, if the model were solely owned by hbase, there are hbase
> primitives we could make use of -- changes in a row are atomic and
> coprocessors -- to simplify table transactions and provide more consistent
> views of our model to clients; file 'moves' could be a memory operation only
> rather than an HDFS call; sharing files between tables/snapshots and when it
> is safe to remove them would be simplified if one owner only; and so on.
> This is an umbrella blue-sky issue to discuss what a new layout would look
> like and how we might get there. I'll follow up with some sketches of what
> new layout could look like that come of some chats a few of us have been
> having. We are also under the 'delusion' that move to a new layout could be
> done as part of a rolling upgrade and that the amount of work involved is not
> gargantuan.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)