adamreeve commented on PR #47219: URL: https://github.com/apache/arrow/pull/47219#issuecomment-3134854354
This currently causes some test failures: https://github.com/apache/arrow/actions/runs/16586770398/job/46913552380?pr=47219#step:7:5290 Failing tests are: ``` [ FAILED ] TestS3FS.CopyFile [ FAILED ] TestS3FS.Move [ FAILED ] TestS3FSHTTPS.SSECustomerKeyMatch [ FAILED ] TestS3FSHTTPS.SSECustomerKeyMismatch [ FAILED ] TestS3FSHTTPS.SSECustomerKeyMissing [ FAILED ] TestS3FSHTTPS.SSECustomerKeyCopyFile [ FAILED ] TestS3FSGeneric.MoveFile [ FAILED ] TestS3FSGeneric.CopyFile [ FAILED ] TestS3FSGeneric.CopyFiles [ FAILED ] TestS3FSGeneric.SpecialChars ``` I started looking into the move/copy failures. Example of an HTTP request from the standard S3 client that works: ``` PUT /bucket/newfile HTTP/1.1\r\n Request Method: PUT Request URI: /bucket/newfile Request Version: HTTP/1.1 Host: 127.0.0.1:35423\r\n Accept: */*\r\n amz-sdk-invocation-id: 33994FF0-2F65-4FCE-88B0-9F3CF46E54E6\r\n amz-sdk-request: attempt=1\r\n […]authorization: AWS4-HMAC-SHA256 Credential=minio/20250730/ap-southeast-2/s3/aws4_request, SignedHeaders=amz-sdk-invocation-id;amz-sdk-request;content-length;content-type;host;x-amz-api-version;x-amz-content-sha256;x-amz-copy-source;x content-length: 0\r\n [Content length: 0] content-type: application/xml\r\n user-agent: aws-sdk-cpp/1.11.587 ua/2.1 api/S3 os/Linux#6.15.7-200.fc42.x86_64 lang/c++#C++11 md/aws-crt#0.32.8-dev+84e33412 md/arch#x86_64 md/GCC#15.1.1 m/Z,b,P\r\n x-amz-api-version: 2006-03-01\r\n x-amz-content-sha256: e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855\r\n x-amz-copy-source: bucket/somefile\r\n x-amz-date: 20250730T043553Z\r\n \r\n ``` But with the S3-CRT client, it sends a HEAD request that doesn't have the bucket name in the URI but it's prepended to the host ``` HEAD /somefile HTTP/1.1\r\n Host: bucket.0.0.1:41883\r\n User-Agent: CRTS3NativeClient/0.1.x platform/unknown\r\n X-Amz-Date: 20250730T044539Z\r\n x-amz-content-sha256: UNSIGNED-PAYLOAD\r\n Authorization: AWS4-HMAC-SHA256 Credential=minio/20250730/ap-southeast-2/s3/aws4_request, SignedHeaders=host;x-amz-content-sha256;x-amz-date, Signature=1d61d8dd959fe984e3d3ecaf4ebbd670c05485e277b3861d1e4fbdc26ef24900\r\n \r\n ``` Minio responds with a 404: ``` HTTP/1.1 404 Not Found\r\n Accept-Ranges: bytes\r\n Content-Length: 0\r\n Server: MinIO\r\n Strict-Transport-Security: max-age=31536000; includeSubDomains\r\n Vary: Origin\r\n Vary: Accept-Encoding\r\n X-Amz-Id-2: c96c702f27084562bfa0b12e8b5bc8aca70aebe488ce0201c23ff563c1935e13\r\n X-Amz-Request-Id: 1856EDF47B209DD2\r\n X-Content-Type-Options: nosniff\r\n X-Minio-Error-Code: NoSuchBucket\r\n X-Minio-Error-Desc: "The specified bucket does not exist"\r\n X-Ratelimit-Limit: 12117\r\n X-Ratelimit-Remaining: 12117\r\n X-Xss-Protection: 1; mode=block\r\n Date: Wed, 30 Jul 2025 04:45:39 GMT\r\n \r\n ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
