[jira] [Commented] (HBASE-9457) Master could fail start if region server with system table is down

Francis Liu (JIRA) Sun, 15 Sep 2013 12:20:03 -0700

    [ 
https://issues.apache.org/jira/browse/HBASE-9457?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13767881#comment-13767881
 ]


Francis Liu commented on HBASE-9457:
------------------------------------

Looked at the patch. It seems the zk namespace data gets stale with this 
approach. Since the namespace table is lazily loaded, this causes the namespace 
zk information to be lazily loaded as well. Which causes the data read by zkNM 
clients to be stale until one of the namespace apis are invoked which seems to 
be unbounded.

I think there are two other options:

1. move zkNM to a region observer coprocessor
2. roll namespace table into the meta handler classes (metaWAL, metaSSH, etc)

If time is a concern we can just remove zkNM and address it in a separate jira.

Good catch on the race condition with the janitor. I'll address that in 
HBASE-9540.
                
> Master could fail start if region server with system table is down
> ------------------------------------------------------------------
>
>                 Key: HBASE-9457
>                 URL: https://issues.apache.org/jira/browse/HBASE-9457
>             Project: HBase
>          Issue Type: Bug
>          Components: master, Region Assignment
>            Reporter: Jimmy Xiang
>            Assignee: Jimmy Xiang
>            Priority: Critical
>         Attachments: trunk-9457.patch, trunk-9457_v2.1.patch, 
> trunk-9457_v2.2.patch, trunk-9457_v2.patch
>
>
> In the region server holding the system table is killed while master is 
> starting, master will hang there waiting for system table to be assigned 
> which won't happen.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-9457) Master could fail start if region server with system table is down

Reply via email to