Re: Ranger + Hive

Odon Copon Tue, 26 Mar 2019 14:41:19 -0700

Hi Bosco,
I was reading this link:
https://community.hortonworks.com/articles/175133/how-hiveserver2-and-ranger-interact-internals.html
And it was specifically mentioning the Hiveserver2, so that's why I was
wondering if just the Hiveserver2 needed to run the SSSD.


As a good practice do you mean is mandatory?

Thanks.

On Tue, 26 Mar 2019 at 21:28, Don Bosco Durai <bo...@apache.org> wrote:

> Hi Odon
>
>
>
> As a good practice, each node should have the SSSD installed and
> configured.
>
>
>
> If you are doing PoC or testing out, then at least the master nodes should
> have it configured.
>
>
>
> Bosco
>
>
>
>
>
> *From: *Odon Copon <odonco...@gmail.com>
> *Reply-To: *<user@ranger.apache.org>
> *Date: *Tuesday, March 26, 2019 at 2:07 AM
> *To: *<user@ranger.apache.org>
> *Subject: *Re: Ranger + Hive
>
>
>
> Good point. Not sure which components need to have the users and groups
> from ldap.
>
> Just Hiveserver2? Any other Hadoop component?
>
> Is there any link to that information?
>
>
>
> Thanks
>
>
>
> On Tue, 26 Mar 2019, 01:30 Don Bosco Durai, <bo...@apache.org> wrote:
>
> If you already have LDAP, then you should install SSSD on all nodes. SSSD
> will only materialize the users when requested for.
>
>
>
> I think, in your case, if you are only using Hive, then you would just
> need SSSD on the server which is running HiveServer2
>
>
>
> Depending on the users you want to set policies in Ranger, you can apply
> filters during user sync.
>
>
>
> Bosco
>
>
>
>
>
> *From: *Odon Copon <odonco...@gmail.com>
> *Reply-To: *<user@ranger.apache.org>
> *Date: *Monday, March 25, 2019 at 3:33 PM
> *To: *<user@ranger.apache.org>
> *Subject: *Re: Ranger + Hive
>
>
>
> Hi Bosco,
>
> Thank you for your help and the information provided.
>
>
>
> I don't want to have all users and groups as part of the server, that's
> why I'm looking for the mapping option with LDAP.
>
> My groups are changing rapidly and I'm not considering having something
> like Ansible adding and removing users and groups from the server
> constantly.
>
>
>
> Does it make sense?
>
> Thanks
>
> On Mon, 25 Mar 2019, 21:58 Don Bosco Durai, <bo...@apache.org> wrote:
>
> Hi Odon
>
>
>
> If you are not using Kerberos, then it is much simpler. You don’t need do
> a lot…
>
>
>
> Do you even need groups or group level policies? If so, you just need to
> create OS users and assign the groups you want to on the server where Hive
> Server2 is running
>
>
>
> > Why do you say "LdapGroupsMapping is not recommended". It seems the only
> way to ingest and use information from LDAP.
>
> By default, Hadoop will go to the OS and get the groups for the user. So
> if you are doing SSSD (or similar technology), then it will get the groups
> from LDAP for you. So you don’t need to do any configuration in the
> core-site.xml.
>
>
>
> Check this article :
> https://www.cloudera.com/documentation/enterprise/5-9-x/topics/cm_sg_ldap_grp_mappings.html
>
>
>
> Bosco
>
>
>
>
>
>
>
> *From: *Odon Copon <odonco...@gmail.com>
> *Reply-To: *<user@ranger.apache.org>
> *Date: *Monday, March 25, 2019 at 10:09 AM
> *To: *<user@ranger.apache.org>
> *Subject: *Re: Ranger + Hive
>
>
>
> Hi Bosco,
>
> Thanks for your help.
>
> For this test I'm not using Kerberos, I'm just testing a simple pipeline
> with Hive+Ranger and some external tables in S3 and see what are the
> requirements.
>
> From your comments, I understand I need to setup SSSD as explained in the
> link you provided:
> https://docs.hortonworks.com/HDPDocuments/Ambari-2.6.2.0/bk_ambari-security/content/setting_up_hadoop_group_mappping_for_ldap_ad.html
>
> and having also the sync for Ranger would allow me to create policies.
>
>
>
> Why do you say "LdapGroupsMapping is not recommended". It seems the only
> way to ingest and use information from LDAP.
>
>
>
> Thanks.
>
>
>
> On Mon, 25 Mar 2019 at 16:47, Don Bosco Durai <bo...@apache.org> wrote:
>
> There are few things:
>
>    1. In Kerberos/secure mode, users needs to be materialized on each
>    node. If you are using AD/LDAP, then you can use SSSD (or equivalent), else
>    you need to create the users explicitly on each node using ansible or
>    puppet or manually…
>    2. The group mapping can be via LDAP or by groups from unix (SSSD will
>    also do this you). FYI, LdapGroupsMapping is not recommended due to
>    performance reasons. FYI, if you are using SSSD, it will get the groups
>    from LDAP/AD
>    3. In Kerberos/secure mode, you need to materialize users on each node
>    regardless whether you are accessing S3 or HDFS. This is a YARN
>    requirement. So the that the YARN job process will run as the end user.
>    4. The users and groups in Ranger are just for convenience to create
>    policy. Having it or not in Ranger, doesn’t affect the service. However,
>    you will not be able to create the policies in Ranger. During testing or
>    PoC, if you don’t want to sync, you can manually add to Ranger to using
>    Ranger Admin UI
>
>
>
> Bosco
>
>
>
>
>
> *From: *Odon Copon <odonco...@gmail.com>
> *Reply-To: *<user@ranger.apache.org>
> *Date: *Monday, March 25, 2019 at 8:36 AM
> *To: *<user@ranger.apache.org>
> *Subject: *Ranger + Hive
>
>
>
> Hi,
>
> On my last test using HDFS + Ranger I had to sync my LDAP groups with
> Hadoop based on the following link:
> https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.6.5/bk_security/content/setting_up_hadoop_group_mappping_for_ldap_ad.html
>
>
>
> That means users and groups had to be in Ranger and Hadoop cluster to make
> policies to work.
>
> But what about Hive + Ranger?
>
> Is that mapping also required?
>
> do I need users also to be mapped in Hadoop cluster?
>
> what if tables are in S3 instead of HDFS per example?
>
>
>
> Thanks.
>
>

Re: Ranger + Hive

Reply via email to