[
https://issues.apache.org/jira/browse/HBASE-14623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15206757#comment-15206757
]
Stephen Yuan Jiang commented on HBASE-14623:
--------------------------------------------
[~stack], you are right that we need to add tests (and run chaos monkey tests)
to see the benefit of this change before committing the change.
Since you have some concern on this JIRA, let me give you some background and
explanation (based on customer issue and internal chaos monkey tests):
Today, we have a dedicated WAL for META system table because we don't want to
other regions to wait for META to recovery and want to make sure that META
region recovers first. That change is about 3 years ago. Since then, we saw
usage of HBASE exploded and we have more system table that are also very
important. From time to time (in product env), we saw log split takes long
time or even fails due to resource limitation. If a system table region is in
the part of log in the split (or in the log that are waiting to be split), the
functionality of the cluster would be affected. Therefore, we thought of
dedicated WAL for system table approach to speed up system table recovery.
In terms of benefit to region assignment, we have already had special case for
META region assignment in the Assignment manager code (for user regions, if
assignment fails after some retry, it would give up and queue it and come back
to try again later; for meta region, the retry is forever until assignment
succeeds). From internally chaos money testing, we often see that a system
table region fails after maximum retries (the number is configurable, 10 is the
default max retry attempts), without system table regions online, we cannot do
a lot of thing and sometime test fails/times out. At that time, there is some
discussion on whether to give system table the infinite retry treatment as META
region. When I worked on designing new proc-v2 based assignment, I exchanged
this idea with [~mbertozzi], he had similar thoughts on this. We discussed
that the recovery for region assignment priority should be META, then system
table, and then user table regions. When [~tedyu] tried to fix a customer
issue in HBASE-14190 and he had this idea on dedicate system table WAL. After
discussion, we decided to have this JIRA to separate the work of HBASE-14190 to
make it cleaner. Having dedicated WAL(s) for system tables would help assign
system table region in higher priority than user table region, and hence speed
up system table region assignment.
Once we agree that there is a need to have dedicated WAL(s) for system tables,
then we can discuss the approach to implement this JIRA. I think [~tedyu]'s
approach is reasonable - to be safe and minimize the impact on this change, not
touch META WAL handling and use a separate logroller for system table.
However, I am open to more discussion on the approach. (By the way, my
personal opinion is that we should have sys:meta / sys:namespace / etc. instead
of hbase.meta /hbase:namespace / etc., but too late to make this change :-);
the same is that we have to keep .meta for META table WAL even it is a system
table)
> Implement dedicated WAL for system tables
> -----------------------------------------
>
> Key: HBASE-14623
> URL: https://issues.apache.org/jira/browse/HBASE-14623
> Project: HBase
> Issue Type: Sub-task
> Components: wal
> Reporter: Ted Yu
> Assignee: Ted Yu
> Labels: wal
> Fix For: 2.0.0
>
> Attachments: 14623-v1.txt, 14623-v2.txt, 14623-v2.txt, 14623-v2.txt,
> 14623-v2.txt, 14623-v3.txt, 14623-v4.txt
>
>
> As Stephen suggested in parent JIRA, dedicating separate WAL for system
> tables (other than hbase:meta) should be done in new JIRA.
> This task is to fulfill the system WAL separation.
> Below is summary of discussion:
> For system table to have its own WAL, we would recover system table faster
> (fast log split, fast log replay). It would probably benefit
> AssignmentManager on system table region assignment. At this time, the new
> AssignmentManager is not planned to change WAL. So the existence of this JIRA
> is good for overall system, not specific to AssignmentManager.
> There are 3 strategies for implementing system table WAL:
> 1. one WAL for all non-meta system tables
> 2. one WAL for each non-meta system table
> 3. one WAL for each region of non-meta system table
> Currently most system tables are one region table (only ACL table may become
> big). Choices 2 and 3 basically are the same.
> From implementation point of view, choices 2 and 3 are cleaner than choice 1
> (as we have already had 1 WAL for META table and we can reuse the logic).
> With choice 2 or 3, assignment manager performance should not be impacted and
> it would be easier for assignment manager to assign system table region (eg.
> without waiting for user table log split to complete for assigning system
> table region).
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)