[
https://issues.apache.org/jira/browse/HADOOP-13887?focusedWorklogId=597348&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-597348
]
ASF GitHub Bot logged work on HADOOP-13887:
-------------------------------------------
Author: ASF GitHub Bot
Created on: 16/May/21 07:56
Start Date: 16/May/21 07:56
Worklog Time Spent: 10m
Work Description: mehakmeet commented on pull request #2706:
URL: https://github.com/apache/hadoop/pull/2706#issuecomment-841783066
Had merge conflicts so had to force push.
Tests:
```
[ERROR] Tests run: 1430, Failures: 1, Errors: 34, Skipped: 538
```
Scale:
```
[ERROR] Tests run: 151, Failures: 3, Errors: 21, Skipped: 29
```
Most errors are MultiPart upload related:
```
com.amazonaws.SdkClientException: Invalid part size: part sizes for
encrypted multipart uploads must be multiples of the cipher block size (16)
with the exception of the last part.
```
Simply adding 16(Padding length) to multipart upload block size won't work.
The part sizes need to be a multiple of 16, so it has that restriction for CSE.
Also, one more thing to note here is that it assumes the last part to be an
exception, which makes me believe that multipart upload in CSE has to be
sequential(or can we parallel upload the starting parts and then upload the
last part?)? So, potentially another constraint while uploading could have
performance impacts here apart from the HEAD calls being required while
downloading/listing.
@steveloughran
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
Issue Time Tracking
-------------------
Worklog Id: (was: 597348)
Time Spent: 2h (was: 1h 50m)
> Encrypt S3A data client-side with AWS SDK (S3-CSE)
> --------------------------------------------------
>
> Key: HADOOP-13887
> URL: https://issues.apache.org/jira/browse/HADOOP-13887
> Project: Hadoop Common
> Issue Type: Sub-task
> Components: fs/s3
> Affects Versions: 2.8.0
> Reporter: Jeeyoung Kim
> Assignee: Igor Mazur
> Priority: Minor
> Labels: pull-request-available
> Attachments: HADOOP-13887-002.patch, HADOOP-13887-007.patch,
> HADOOP-13887-branch-2-003.patch, HADOOP-13897-branch-2-004.patch,
> HADOOP-13897-branch-2-005.patch, HADOOP-13897-branch-2-006.patch,
> HADOOP-13897-branch-2-008.patch, HADOOP-13897-branch-2-009.patch,
> HADOOP-13897-branch-2-010.patch, HADOOP-13897-branch-2-012.patch,
> HADOOP-13897-branch-2-014.patch, HADOOP-13897-trunk-011.patch,
> HADOOP-13897-trunk-013.patch, HADOOP-14171-001.patch, S3-CSE Proposal.pdf
>
> Time Spent: 2h
> Remaining Estimate: 0h
>
> Expose the client-side encryption option documented in Amazon S3
> documentation -
> http://docs.aws.amazon.com/AmazonS3/latest/dev/UsingClientSideEncryption.html
> Currently this is not exposed in Hadoop but it is exposed as an option in AWS
> Java SDK, which Hadoop currently includes. It should be trivial to propagate
> this as a parameter passed to the S3client used in S3AFileSystem.java
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]