tolleybot opened a new pull request, #34616:
URL: https://github.com/apache/arrow/pull/34616

   ### Rationale for this change
   
   The purpose of this pull request is to support modular encryption in the new 
Database API.  See 
[https://docs.google.com/document/d/13EysCNC6-Nu9wnJ8YpdzmD-aMLn4i2KXUJTNqIihy7A/edit#](url)
 for supporting document.
   
   
   ### What changes are included in this PR?
   
   I made improvements to the C++ and Python code to enable the file writer and 
file reader to use customized encryption settings for each file. Previously, 
the Dataset API applied the same encryption properties to all saved files, but 
now I've updated the code to allow for greater flexibility. In the Python code, 
I've added support for the changes by updating the ParquetFormat class to 
accept DatasetEncryptionConfiguration and DatasetDecryptionConfiguration 
structures. With these changes, you can pass the format object to the 
write_dataset function, giving you the ability to set unique encryption 
properties for each file in your Dataset.
   
   ### Are these changes tested?
   
   Yes, unit tests are included. I have also included a python sample project.
   
   ### Are there any user-facing changes?
   
   Yes, as stated above the ParquetFormat class has optional parameters for 
DatasetEncryptionConfiguration and DatasetDecryptionConfiguration through 
setters and getters.
   MakeReaderProperties now optionally takes in a filesystem object and a path. 
  
   
   <!--
   If there are any breaking changes to public APIs, please uncomment the line 
below and explain which changes are breaking.
   -->
   <!-- **This PR includes breaking changes to public APIs.** -->
   
   <!--
   Please uncomment the line below (and provide explanation) if the changes fix 
either (a) a security vulnerability, (b) a bug that caused incorrect or invalid 
data to be produced, or (c) a bug that causes a crash (even when the API 
contract is upheld). We use this to highlight fixes to issues that may affect 
users without their knowledge. For this reason, fixing bugs that cause errors 
don't count, since those are usually obvious.
   -->
   <!-- **This PR contains a "Critical Fix".** -->


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to