[ 
https://issues.apache.org/jira/browse/HADOOP-19272?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Steve Loughran resolved HADOOP-19272.
-------------------------------------
    Resolution: Fixed

> S3A: AWS SDK 2.25.53 warnings logged about transfer manager not using CRT 
> client
> --------------------------------------------------------------------------------
>
>                 Key: HADOOP-19272
>                 URL: https://issues.apache.org/jira/browse/HADOOP-19272
>             Project: Hadoop Common
>          Issue Type: Bug
>          Components: fs/s3
>    Affects Versions: 3.4.0, 3.5.0
>            Reporter: Steve Loughran
>            Assignee: Steve Loughran
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 3.5.0, 3.4.2
>
>         Attachments: output.txt
>
>
> When an S3 transfer manager is created for renaming/download a new message is 
> logged telling off the caller for not using the CRT client.
> {code}
> 5645:2024-09-13 16:29:17,375 [setup] WARN  s3.S3TransferManager 
> (LoggerAdapter.java:warn(225)) - The provided S3AsyncClient is an instance of 
> MultipartS3AsyncClient, and thus multipart download feature is not enabled. 
> To benefit from all features, consider using 
> S3AsyncClient.crtBuilder().build() instead.
> {code}
> This is a change in the SDK to tell us developers off -yet it is visible to 
> end users who don't benefit from it and for which it only creates confusion.
> It appears to have been downgraded to debug in the AWS trunk code in PR "S3 
> Async Client - Multipart download (#5164) -but:
> * it is too late to upgrade and qualify a new version for 3.4.1; downgrading 
> is all we can do
> * there is no guarantee this log message or similar will reoccur.
> Plan
> 1. Revert from 3.4.1
> 2. lift code from cloudstore library which uses reflection to access and 
> manipulate log4j logs where present
> 3. downgrade all transfer manager log levels to NONE. 
> 4. File an AWS report about how this is an incompatible regression, identify 
> how their process can evolve, particularly in the area of code guidelines 
> about safe logging use.
> I also intend to tighten up our review process to support more rigorous 
> detection of new .warn() messages in the AWS SDK. I'm going to propose that 
> as well as requiring review of our test/CLI output, we require ripgrep scans 
> of .warn(/.error( in SDK source, audit of any new changes. by saving the 
> output of the previous iteration, it'll be straightforward to identify new 
> changes -but not changes in codepaths which change their frequency of 
> appearance.
> I think we should revisit whether or not to move off the xfer manager in the 
> past. We've discussed it in the past, and avoided it just due to maintenance 
> costs. However, it is pushing maintenance costs anyway.
> meanwhile: no new AWS SDK updates until we are confident we have our 
> processes under control.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: common-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-dev-h...@hadoop.apache.org

Reply via email to