klockla opened a new pull request, #1728: URL: https://github.com/apache/stormcrawler/pull/1728
This PR adds a bolt which allows removal of Personally Identifiable Information (PII). The PiiBolt is to be used with a class implementing the PiiInterface and which will provide the actual implementation of PII. This PR implements also the PresidioRedactor class which uses Microsoft Presidio ( https://microsoft.github.io/presidio/ ) as a PII back-end. It can be configured for different PII entities (names, phones, location, etc...) and different languages according to how you deployed the back-end. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
