Madhan Neethiraj created RANGER-3923:
----------------------------------------
Summary: Dataset policies
Key: RANGER-3923
URL: https://issues.apache.org/jira/browse/RANGER-3923
Project: Ranger
Issue Type: New Feature
Components: Ranger
Reporter: Madhan Neethiraj
Assignee: Madhan Neethiraj
Given the primary business value of Apache Ranger is to enable sharing of
resources, it will help if Apache Ranger provides an abstraction that enables a
set of resources/data across services, a dataset, to be the unit of sharing
instead of one or more resources in each service. This has several benefits,
like:
# A single policy to manage access to data in multiple services - like HBase,
Hive, Snowflake, Kafka, Google BigQuery, AWS S3, AWS Redshift, ADLS-Gen2. This
enables authorization to be centered around a purpose, like:
* Marketing Campaign 2022 dataset
* Sales 2021 dataset
* CA Claims 2021 dataset
# Enables different set of users to manage sharing data into a dataset and
manage access to the data in a dataset:
* Data owners share data into a dataset, with necessary masking, row-filters
and schedules; they can update the share details, including stop sharing into a
dataset.
* Dataset admins manage who has access to the data in the dataset. This
relieves data owners from having to micromanage access to the shared data, for
example when a user needs access to the data in multiple services to
participate in a project.
Attached document has more details on this new abstraction, including a number
of questions & answers that to help understand various aspects of this feature.
Please read and add your comments/suggestions.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)