Hi, Given the recent recent GDPR policy https://eugdpr.org/ some community members might be wondering about anonymization and data privacy features that we could add to MADlib.
There is a JIRA here https://issues.apache.org/jira/browse/MADLIB-911 and it would be great if folks could add comments or thoughts on which direction to take. Seems like porting the PDL Tools library function http://pivotalsoftware.github.io/PDLTools/group__grp__anonymization.html might be useful but probably just a minimum. For one thing I do not know what kind of hashing they do. More comprehensive integrations like https://arx.deidentifier.org/ as suggested in the JIRA would be great but a lot of work. Frank
