[
https://issues.apache.org/jira/browse/HBASE-16488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16814932#comment-16814932
]
Andrew Purtell edited comment on HBASE-16488 at 4/10/19 11:18 PM:
------------------------------------------------------------------
Tests looked good running locally.
[~xucang] if you have some time and the interest, it would be good if we could
have a unit test for this change where:
* Case 1: hbase.master.start.wait.for.namespacemanager=true . Set the
namespace init timeout low. Inject a stall to keep the master from starting up
in time. Expect failure (master shutdown)
* Case 2: hbase.master.start.wait.for.namespacemanager=false. Same timeout
and injected delay but master should successfully initialize.
was (Author: apurtell):
Tests looked good running locally.
[~xucang] if you have some time and the interest, it would be good if we could
have a unit test for this change where:
* Case 1: hbase.master.start.wait.for.namespacemanager=true . Set the
namespace init timeout low. Inject a stall to keep the master from starting up
in time. Expect failure (master shutdown)
* Case 2: hbase.master.start.wait.for.namespacemanager=true . Same timeout and
injected delay but master should successfully initialize.
> Starting namespace and quota services in master startup asynchronizely
> ----------------------------------------------------------------------
>
> Key: HBASE-16488
> URL: https://issues.apache.org/jira/browse/HBASE-16488
> Project: HBase
> Issue Type: Improvement
> Components: master
> Affects Versions: 1.3.0, 1.0.3, 1.4.0, 1.1.5, 1.2.2, 2.0.0
> Reporter: Stephen Yuan Jiang
> Assignee: Xu Cang
> Priority: Major
> Attachments: HBASE-16488.branch-1.012.patch,
> HBASE-16488.branch-1.012.patch, HBASE-16488.revisit.v11-branch-1.patch,
> HBASE-16488.v1-branch-1.patch, HBASE-16488.v1-master.patch,
> HBASE-16488.v10-branch-1.patch, HBASE-16488.v2-branch-1.patch,
> HBASE-16488.v2-branch-1.patch, HBASE-16488.v3-branch-1.patch,
> HBASE-16488.v3-branch-1.patch, HBASE-16488.v4-branch-1.patch,
> HBASE-16488.v5-branch-1.patch, HBASE-16488.v6-branch-1.patch,
> HBASE-16488.v7-branch-1.patch, HBASE-16488.v8-branch-1.patch,
> HBASE-16488.v9-branch-1.patch
>
>
> From time to time, during internal IT test and from customer, we often see
> master initialization failed due to namespace table region takes long time to
> assign (eg. sometimes split log takes long time or hanging; or sometimes RS
> is temporarily not available; sometimes due to some unknown assignment
> issue). In the past, there was some proposal to improve this situation, eg.
> HBASE-13556 / HBASE-14190 (Assign system tables ahead of user region
> assignment) or HBASE-13557 (Special WAL handling for system tables) or
> HBASE-14623 (Implement dedicated WAL for system tables).
> This JIRA proposes another way to solve this master initialization fail
> issue: namespace service is only used by a handful operations (eg. create
> table / namespace DDL / get namespace API / some RS group DDL). Only quota
> manager depends on it and quota management is off by default. Therefore,
> namespace service is not really needed for master to be functional. So we
> could start namespace service asynchronizely without blocking master startup.
>
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)