[jira] [Updated] (ATLAS-4808) Automatic data classification Support by Atlas

2023-11-06 Thread Jagadesh Kiran N (Jira)


 [ 
https://issues.apache.org/jira/browse/ATLAS-4808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jagadesh Kiran N updated ATLAS-4808:

Description: 
a.   We are maintaining a Datalake , Data we store in folders in HDFS and from 
there hive takes the data and stores in tables.

In one of the Hive table we have bunch of columns. one of the column contains 
PII Data. Can Atlas automatically scan data inside and if it finds any PII 
information ( like name, email , phone number etc ) ,

It needs to mark data classification( classify columns in attribute )  for that 
object as PII  and then propagate if any other child objects created from that 
by asynchronously scan data automatically.

The Scan process classify a column as PII or relevant info.

b. Also do atlas support include / exclude option for tables , topics etc ?

  was:
a.   We are maintaining a Datalake , Data we store in folders in HDFS and from 
there hive takes the data and stores in tables.

In one of the Hive table we have bunch of columns. one of the column contains 
PII Data. Can Atlas product will automatically scan data inside and if it finds 
any PII information ( like name, email , phone number etc ) ,

It needs to mark data classification( classify columns in attribute )  for that 
object as PII  and then propagate if any other child objects created from that 
by asynchronously scan data automatically.

The Scan process classify a column as PII or relevant info.

b. Also do atlas support include / exclude option for tables , topics etc ?


> Automatic data classification Support by Atlas
> --
>
> Key: ATLAS-4808
> URL: https://issues.apache.org/jira/browse/ATLAS-4808
> Project: Atlas
>  Issue Type: Wish
>Reporter: Jagadesh Kiran N
>Priority: Minor
>
> a.   We are maintaining a Datalake , Data we store in folders in HDFS and 
> from there hive takes the data and stores in tables.
> In one of the Hive table we have bunch of columns. one of the column contains 
> PII Data. Can Atlas automatically scan data inside and if it finds any PII 
> information ( like name, email , phone number etc ) ,
> It needs to mark data classification( classify columns in attribute )  for 
> that object as PII  and then propagate if any other child objects created 
> from that by asynchronously scan data automatically.
> The Scan process classify a column as PII or relevant info.
> b. Also do atlas support include / exclude option for tables , topics etc ?



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (ATLAS-4808) Automatic data classification Support by Atlas

2023-11-06 Thread Jagadesh Kiran N (Jira)


 [ 
https://issues.apache.org/jira/browse/ATLAS-4808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jagadesh Kiran N updated ATLAS-4808:

Description: 
a.   We are maintaining a Datalake , Data we store in folders in HDFS and from 
there hive takes the data and stores in tables.

In one of the Hive table we have bunch of columns. one of the column contains 
PII Data. Can Atlas product will automatically scan data inside and if it finds 
any PII information ( like name, email , phone number etc ) ,

It needs to mark data classification( classify columns in attribute )  for that 
object as PII  and then propagate if any other child objects created from that 
by asynchronously scan data automatically.

The Scan process classify a column as PII or relevant info.

b. Also do atlas support include / exclude option for tables , topics etc ?

  was:
a.   We are maintaining a Datalake , Data we store in folders in HDFS and from 
there hive takes the data and stores in tables.

In one of the Hive table we have bunch of columns. one of them is  child_id 
which contains PII Data. Can Atlas product will automatically scan data inside 
and if it finds any PII information ( like name, email , phone number etc ) ,

It needs to mark data classification( classify columns in attribute )  for that 
object as PII  and then propagate if any other child objects created from that 
by asynchronously scan data automatically.

The Scan process classify a column as PII or relevant info.

b. Also do atlas support include / exclude option for tables , topics etc ?


> Automatic data classification Support by Atlas
> --
>
> Key: ATLAS-4808
> URL: https://issues.apache.org/jira/browse/ATLAS-4808
> Project: Atlas
>  Issue Type: Wish
>Reporter: Jagadesh Kiran N
>Priority: Minor
>
> a.   We are maintaining a Datalake , Data we store in folders in HDFS and 
> from there hive takes the data and stores in tables.
> In one of the Hive table we have bunch of columns. one of the column contains 
> PII Data. Can Atlas product will automatically scan data inside and if it 
> finds any PII information ( like name, email , phone number etc ) ,
> It needs to mark data classification( classify columns in attribute )  for 
> that object as PII  and then propagate if any other child objects created 
> from that by asynchronously scan data automatically.
> The Scan process classify a column as PII or relevant info.
> b. Also do atlas support include / exclude option for tables , topics etc ?



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (ATLAS-4808) Automatic data classification Support by Atlas

2023-11-06 Thread Jagadesh Kiran N (Jira)
Jagadesh Kiran N created ATLAS-4808:
---

 Summary: Automatic data classification Support by Atlas
 Key: ATLAS-4808
 URL: https://issues.apache.org/jira/browse/ATLAS-4808
 Project: Atlas
  Issue Type: Wish
Reporter: Jagadesh Kiran N


a.   We are maintaining a Datalake , Data we store in folders in HDFS and from 
there hive takes the data and stores in tables.

In one of the Hive table we have bunch of columns. one of them is  child_id 
which contains PII Data. Can Atlas product will automatically scan data inside 
and if it finds any PII information ( like name, email , phone number etc ) ,

It needs to mark data classification( classify columns in attribute )  for that 
object as PII  and then propagate if any other child objects created from that 
by asynchronously scan data automatically.

The Scan process classify a column as PII or relevant info.

b. Also do atlas support include / exclude option for tables , topics etc ?



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


Re: Review Request 74562: ATLAS-4788 : Encrpytion of clear text kafka password in application.properties

2023-11-06 Thread chaitali

---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/74562/
---

(Updated Nov. 6, 2023, 3:44 p.m.)


Review request for atlas, Jayendra Parab, Paresh Devalia, and Sheetal Shah.


Summary (updated)
-

ATLAS-4788 : Encrpytion of clear text kafka password in application.properties


Bugs: ATLAS-4788
https://issues.apache.org/jira/browse/ATLAS-4788


Repository: atlas


Description
---

atlas.jaas.KafkaClient.option.username=username
atlas.jaas.KafkaClient.option.password=

We have to encrypt this passsword using jceks file

./cputil.py  -k atlas.jaas.KafkaClient.option.password -p P@$$w0rd -r 
jceks://file/home/project/atlas/kafka.jceks


Diffs
-

  common/src/main/java/org/apache/atlas/utils/KafkaUtils.java 167442259 
  common/src/test/java/org/apache/atlas/utils/KafkaUtilsTest.java 562e28ae1 


Diff: https://reviews.apache.org/r/74562/diff/10/


Testing
---

test case added
https://ci-builds.apache.org/job/Atlas/job/PreCommit-ATLAS-Build-Test/1498/consoleFull


Thanks,

chaitali



[jira] [Updated] (ATLAS-4788) Encrpytion of clear text kafka password in application.properties

2023-11-06 Thread chaitali borole (Jira)


 [ 
https://issues.apache.org/jira/browse/ATLAS-4788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

chaitali borole updated ATLAS-4788:
---
Summary: Encrpytion of  clear text kafka password in application.properties 
 (was: Encrpytion of kafka password in ap+plication.properties)

> Encrpytion of  clear text kafka password in application.properties
> --
>
> Key: ATLAS-4788
> URL: https://issues.apache.org/jira/browse/ATLAS-4788
> Project: Atlas
>  Issue Type: Improvement
>Affects Versions: 3.0.0
>Reporter: chaitali borole
>Assignee: chaitali borole
>Priority: Major
> Fix For: 3.0.0
>
>
> atlas.jaas.KafkaClient.option.username=username
> atlas.jaas.KafkaClient.option.password=
> We have to encrypt this passsword using jceks file



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (ATLAS-4788) Encrpytion of kafka password in ap+plication.properties

2023-11-06 Thread chaitali borole (Jira)


 [ 
https://issues.apache.org/jira/browse/ATLAS-4788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

chaitali borole updated ATLAS-4788:
---
Summary: Encrpytion of kafka password in ap+plication.properties  (was: 
Kafka password is in clear text in application.properties)

> Encrpytion of kafka password in ap+plication.properties
> ---
>
> Key: ATLAS-4788
> URL: https://issues.apache.org/jira/browse/ATLAS-4788
> Project: Atlas
>  Issue Type: Improvement
>Affects Versions: 3.0.0
>Reporter: chaitali borole
>Assignee: chaitali borole
>Priority: Major
> Fix For: 3.0.0
>
>
> atlas.jaas.KafkaClient.option.username=username
> atlas.jaas.KafkaClient.option.password=
> We have to encrypt this passsword using jceks file



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (ATLAS-4788) Kafka password is in clear text in application.properties

2023-11-06 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/ATLAS-4788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17783234#comment-17783234
 ] 

ASF subversion and git services commented on ATLAS-4788:


Commit 55519c0c13e19187ef3496d5f9f775e90cd6dbb0 in atlas's branch 
refs/heads/branch-2.0 from chaitali
[ https://gitbox.apache.org/repos/asf?p=atlas.git;h=55519c0c1 ]

ATLAS-4788 : Kafka password is in clear text in application.properties

Signed-off-by: Pinal Shah 


> Kafka password is in clear text in application.properties
> -
>
> Key: ATLAS-4788
> URL: https://issues.apache.org/jira/browse/ATLAS-4788
> Project: Atlas
>  Issue Type: Improvement
>Affects Versions: 3.0.0
>Reporter: chaitali borole
>Assignee: chaitali borole
>Priority: Major
> Fix For: 3.0.0
>
>
> atlas.jaas.KafkaClient.option.username=username
> atlas.jaas.KafkaClient.option.password=
> We have to encrypt this passsword using jceks file



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (ATLAS-4788) Kafka password is in clear text in application.properties

2023-11-06 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/ATLAS-4788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17783233#comment-17783233
 ] 

ASF subversion and git services commented on ATLAS-4788:


Commit 14adbad94a0de95f6d390b2d8c80d9bd914c0438 in atlas's branch 
refs/heads/master from chaitali
[ https://gitbox.apache.org/repos/asf?p=atlas.git;h=14adbad94 ]

ATLAS-4788 : Kafka password is in clear text in application.properties

Signed-off-by: Pinal Shah 


> Kafka password is in clear text in application.properties
> -
>
> Key: ATLAS-4788
> URL: https://issues.apache.org/jira/browse/ATLAS-4788
> Project: Atlas
>  Issue Type: Improvement
>Affects Versions: 3.0.0
>Reporter: chaitali borole
>Assignee: chaitali borole
>Priority: Major
> Fix For: 3.0.0
>
>
> atlas.jaas.KafkaClient.option.username=username
> atlas.jaas.KafkaClient.option.password=
> We have to encrypt this passsword using jceks file



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


Re: Review Request 74562: ATLAS-4788 : Kafka password is in clear text in application.properties

2023-11-06 Thread Mandar Ambawane

---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/74562/#review225936
---


Ship it!




Ship It!

- Mandar Ambawane


On Nov. 6, 2023, 11:13 a.m., chaitali wrote:
> 
> ---
> This is an automatically generated e-mail. To reply, visit:
> https://reviews.apache.org/r/74562/
> ---
> 
> (Updated Nov. 6, 2023, 11:13 a.m.)
> 
> 
> Review request for atlas, Jayendra Parab, Paresh Devalia, and Sheetal Shah.
> 
> 
> Bugs: ATLAS-4788
> https://issues.apache.org/jira/browse/ATLAS-4788
> 
> 
> Repository: atlas
> 
> 
> Description
> ---
> 
> atlas.jaas.KafkaClient.option.username=username
> atlas.jaas.KafkaClient.option.password=
> 
> We have to encrypt this passsword using jceks file
> 
> ./cputil.py  -k atlas.jaas.KafkaClient.option.password -p P@$$w0rd -r 
> jceks://file/home/project/atlas/kafka.jceks
> 
> 
> Diffs
> -
> 
>   common/src/main/java/org/apache/atlas/utils/KafkaUtils.java 167442259 
>   common/src/test/java/org/apache/atlas/utils/KafkaUtilsTest.java 562e28ae1 
> 
> 
> Diff: https://reviews.apache.org/r/74562/diff/10/
> 
> 
> Testing
> ---
> 
> test case added
> https://ci-builds.apache.org/job/Atlas/job/PreCommit-ATLAS-Build-Test/1498/consoleFull
> 
> 
> Thanks,
> 
> chaitali
> 
>



Re: Review Request 74562: ATLAS-4788 : Kafka password is in clear text in application.properties

2023-11-06 Thread chaitali

---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/74562/
---

(Updated Nov. 6, 2023, 11:13 a.m.)


Review request for atlas, Jayendra Parab, Paresh Devalia, and Sheetal Shah.


Bugs: ATLAS-4788
https://issues.apache.org/jira/browse/ATLAS-4788


Repository: atlas


Description
---

atlas.jaas.KafkaClient.option.username=username
atlas.jaas.KafkaClient.option.password=

We have to encrypt this passsword using jceks file

./cputil.py  -k atlas.jaas.KafkaClient.option.password -p P@$$w0rd -r 
jceks://file/home/project/atlas/kafka.jceks


Diffs
-

  common/src/main/java/org/apache/atlas/utils/KafkaUtils.java 167442259 
  common/src/test/java/org/apache/atlas/utils/KafkaUtilsTest.java 562e28ae1 


Diff: https://reviews.apache.org/r/74562/diff/10/


Testing
---

test case added
https://ci-builds.apache.org/job/Atlas/job/PreCommit-ATLAS-Build-Test/1498/consoleFull


Thanks,

chaitali