[
https://issues.apache.org/jira/browse/HADOOP-12020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17554637#comment-17554637
]
Steve Loughran commented on HADOOP-12020:
-----------------------------------------
ooh, this breaks all the s3 select tests if you ask for reduced redundancy.
will file a jira
{code}
testSelectOddRecordsIgnoreHeaderV1(org.apache.hadoop.fs.s3a.select.ITestS3Select)
Time elapsed: 1.42 s <<< ERROR!
org.apache.hadoop.fs.s3a.AWSBadRequestException: SELECT * FROM S3OBJECT s WHERE
s._5 = 'TRUE' on
s3a://stevel-london/fork-0004/test/testSelectOddRecordsIgnoreHeaderV1.csv:
com.amazonaws.services.s3.model.AmazonS3Exception: We do not support
REDUCED_REDUNDANCY storage class. Please check the service documentation and
try again. (Service: Amazon S3; Status Code: 400; Error Code:
UnsupportedStorageClass; Request ID: 6P6ZCHS8Z0ZYXEK7; S3 Extended Request ID:
K382PGoq6l0YtKTgU3FmZyj6SsrTanbXRA5+BNIt4yqLPcB9Li97Lu2GeCBBsJLmnKbKdyeyRQI=;
Proxy: null), S3 Extended Request ID:
K382PGoq6l0YtKTgU3FmZyj6SsrTanbXRA5+BNIt4yqLPcB9Li97Lu2GeCBBsJLmnKbKdyeyRQI=:UnsupportedStorageClass:
We do not support REDUCED_REDUNDANCY storage class. Please check the service
documentation and try again. (Service: Amazon S3; Status Code: 400; Error Code:
UnsupportedStorageClass; Request ID: 6P6ZCHS8Z0ZYXEK7; S3 Extended Request ID:
K382PGoq6l0YtKTgU3FmZyj6SsrTanbXRA5+BNIt4yqLPcB9Li97Lu2GeCBBsJLmnKbKdyeyRQI=;
Proxy: null)
{code}
> Support AWS S3 reduced redundancy storage class
> -----------------------------------------------
>
> Key: HADOOP-12020
> URL: https://issues.apache.org/jira/browse/HADOOP-12020
> Project: Hadoop Common
> Issue Type: Sub-task
> Components: fs/s3
> Affects Versions: 2.7.0
> Environment: Hadoop on AWS
> Reporter: Yann Landrin-Schweitzer
> Assignee: Monthon Klongklaew
> Priority: Major
> Labels: pull-request-available
> Fix For: 3.3.4
>
> Time Spent: 3h 50m
> Remaining Estimate: 0h
>
> Amazon S3 uses, by default, the NORMAL_STORAGE class for s3 objects.
> This offers, according to Amazon's material, 99.99999999% reliability.
> For many applications, however, the 99.99% reliability offered by the
> REDUCED_REDUNDANCY storage class is amply sufficient, and comes with a
> significant cost saving.
> HDFS, when using the legacy s3n protocol, or the new s3a scheme, should
> support overriding the default storage class of created s3 objects so that
> users can take advantage of this cost benefit.
> This would require minor changes of the s3n and s3a drivers, using
> a configuration property fs.s3n.storage.class to override the default storage
> when desirable.
> This override could be implemented in Jets3tNativeFileSystemStore with:
> S3Object object = new S3Object(key);
> ...
> if(storageClass!=null) object.setStorageClass(storageClass);
> It would take a more complex form in s3a, e.g. setting:
> InitiateMultipartUploadRequest initiateMPURequest =
> new InitiateMultipartUploadRequest(bucket, key, om);
> if(storageClass !=null ) {
> initiateMPURequest =
> initiateMPURequest.withStorageClass(storageClass);
> }
> and similar statements in various places.
--
This message was sent by Atlassian Jira
(v8.20.7#820007)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]