Anuj Modi created HADOOP-18910:
----------------------------------

             Summary: ABFS: Adding Support for MD5 Hash based integrity 
verification of the request content during transport 
                 Key: HADOOP-18910
                 URL: https://issues.apache.org/jira/browse/HADOOP-18910
             Project: Hadoop Common
          Issue Type: Sub-task
          Components: fs/azure
            Reporter: Anuj Modi
            Assignee: Anuj Modi


Azure Storage Supports Content-MD5 Request Headers in Both Read and Append APIs.
Read: [Path - Read - REST API (Azure Storage Services) | Microsoft 
Learn|https://learn.microsoft.com/en-us/rest/api/storageservices/datalakestoragegen2/path/read]
Append: [Path - Update - REST API (Azure Storage Services) | Microsoft 
Learn|https://learn.microsoft.com/en-us/rest/api/storageservices/datalakestoragegen2/path/update]

This change is to make client-side changes to support them. In Read request, we 
will send the appropriate header in response to which server will return the 
MD5 Hash of the data it sends back. On Client we will tally this with the MD5 
hash computed from the data received.

In Append request, we will compute the MD5 Hash of the data that we are sending 
to the server and specify that in appropriate header. Server on finding that 
header will tally this with the MD5 hash it will compute on the data received. 

This whole Checksum Validation Support is guarded behind a config, Config is by 
default disabled because with the use of "https" integrity of data is preserved 
anyways. This is introduced as an additional data integrity check which will 
have a performance impact as well.

Users can decide if they want to enable this or not by setting the following 
config to *"true"* or *"false"* respectively. *Config: 
"fs.azure.enable.checksum.validation"*



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: common-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-dev-h...@hadoop.apache.org

Reply via email to