[ https://issues.apache.org/jira/browse/OMID-304?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17937325#comment-17937325 ]
Istvan Toth edited comment on OMID-304 at 3/24/25 7:13 AM: ----------------------------------------------------------- Omid Zookeeper Authentication Design Document h1. Motivation Traditionally Big Data clusters did not focus on security, and relied on firewalls and limited access for security. The requirements are changing, and many users require encryption and/or authorization for all network traffic. While this is foremost a regulatory requirement, implementing access controls does increase security and protects against attacks. h1. Scope This applies to Omid's direct uses of ZK: * HA * ZK timestamp store The ZK used by HBase is outside the scope, unless it interferes with the above use cases. Zookeeper has a very flexible authentication and authorization framework, but as Omid is tied to HBase, only the authorization method supported by HBase (SASL) is considered. h1. Subjects and permissions Omid is a distributed system, its code is running in three environments: * TSO server * HBase coprocessors * HBase client The ZK timestamp store is only ever accessed from the TSOprocess (if configured, as it is not the default) The ZK HA nodes are only modified by the TSO process, and are read by the HBase client and coprocessor components. h1. Authenticating Omid The Omid HBase client already supports logging into a Hadoop/HBase cluster. The same Subject is used for authenticating to ZK. Unless problems are found, the current authentication logic will be used without changes. {color:#FF0000}Q: Does the TSO log on to HBase (Kerberos) ? {color}It has to when it uses HBaseCommitTable, but what happens when InMemoryCommitTable is configured ? Do we even care ? h1. The HBase solution for ACLs HBase uses two roles, a superuser , and the client role, and marks some znodes as "client readable" Full access is provided to the creator of all Znodes. (usually the hbase superuser) Full access is provided for all Znodes to all principals in defined in the *hbase.superuser* property For Znodes which are _isClientReadable_ (as returned by the ZNodePaths class), a world readable permission is also added. h1. Setting the ACLs We can pretty much copy the HBase logic for this. Timestamp store Znodes are only used by the TSO processes, so only the TSO principal needs to access them, we can treat them as the HBase treats the non client readable Znodes. The HA Znodes are only modified by the TSO processes, and only read by everyone else, so they can be treated as client readable Znodes. HBase uses some helper classes for handling the ACLs,Curator has built-in support for that, so we only need to implement the Curator hooks. was (Author: stoty): Omid Zookeeper Authentication Design Document h1. Motivation Traditionally Big Data clusters did not focus on security, and relied on firewalls and limited access for security. The requirements are changing, and many users require encryption and/or authorization for all network traffic. While this is foremost a regulatory requirement, implementing access controls does increase security and protects against attacks. h1. Scope This applies to Omid's direct uses of ZK: * HA * ZK timestamp store The ZK used by HBase is outside the scope, unless it interferes with the above use cases. Zookeeper has a very flexible authentication and authorization framework, but as Omid is tied to HBase, only the authorization method supported by HBase (SASL) is considered. h1. Subjects and permissions Omid is a distributed system, its code is running in three environments: * TSO server * HBase coprocessors * HBase client {color:#FF0000}The first task is identifying which of these environments are accessing ZK, and what operations they perform, for both (ZK, timestamp) use cases.{color} Based on this, we need to group the ZNodes created by Omid into classes which can have the same ACLs applied to them. h1. Authenticating Omid The Omid HBase client already supports logging into a Hadoop/HBase cluster. The same Subject is used for authenticating to ZK. Unless problems are found, the current authentication logic will be used without changes. {color:#FF0000}Q: Does TSO log on to HBase when the ZK timestamp store is configured ?{color} If not, then we may need to change that. h1. Setting the ACLs See general considerations above for the different subjects and required permissions. We can define roles, and use fixed permissions for each role and class of ZNodes, and specify a list of subjects per role. This can be achieved by adding configurable lists for each class. As all ZK access is done via Curator, we can implement this with an ACLProvider. For example, if for the HA ZNodes, we want an ACL list like this: SASL:omid_tso_subject@realm : CRDWA SASL:hbase_server_subject@realm: R SASL:phoenix_client_subject@realm: R Then we can define a HA_TSO role (CRDWA) and a HA_CLIENT (R) role ( fixed in code), and the configuration could look something like: subjects: HA_TSO: * SASL:omid_tso_subject@realm HA_CLIENT: * SASL:hbase_server_subject@realm * SASL:phoenix_client_subject@realm > Support Zookeper Authorization > ------------------------------ > > Key: OMID-304 > URL: https://issues.apache.org/jira/browse/OMID-304 > Project: Phoenix Omid > Issue Type: New Feature > Reporter: Istvan Toth > Assignee: Istvan Toth > Priority: Major > > Omid uses ZK directly both for HA and for its timestamp storage, but does not > support authorization. > This is not acceptable in some use cases. > Add support for ZK authorization. -- This message was sent by Atlassian Jira (v8.20.10#820010)