RyogaWan commented on issue #41431:
URL: https://github.com/apache/arrow/issues/41431#issuecomment-2095591833
> ```
> def test_encrypted_parquet_dataset():
> source_enc_parquet = "./test.enc.parquet"
> crypt_factory = pe.CryptoFactory(kms_client_factory)
> encryption_config = pe.EncryptionConfiguration(
> footer_key=FOOTER_KEY_NAME,
> column_keys={
> FOOTER_KEY_NAME: rb.schema.names, // <-- something wrong here
> },
> encryption_algorithm="AES_GCM_V1",
> data_key_length_bits=256,
> )
> kms_connection_config = pe.KmsConnectionConfig()
> ```
>
> @RyogaWan Could you double check the suspicious line above? It seems that
we need to use COL_KEY_NAME list as in
>
>
https://github.com/apache/arrow/blob/4cf44b4bc3ab053b03c937d3327d43c105790462/python/examples/dataset/write_dataset_encrypted.py#L43-L53
I think this is an identifier for column_keys used in KmsClient to get key
to wrap or unwrap real key that encrypt dataset. In the example, I want to use
same key for column and footer, so i just used FOOTER_KEY_NAME in the dict. Is
there anything i‘m misunderstanding? And I apologize for the confusion I have
caused.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]