adamreeve commented on PR #47219:
URL: https://github.com/apache/arrow/pull/47219#issuecomment-3134854354

   This currently causes some test failures: 
https://github.com/apache/arrow/actions/runs/16586770398/job/46913552380?pr=47219#step:7:5290
   
   Failing tests are:
   ```
   [  FAILED  ] TestS3FS.CopyFile
   [  FAILED  ] TestS3FS.Move
   [  FAILED  ] TestS3FSHTTPS.SSECustomerKeyMatch
   [  FAILED  ] TestS3FSHTTPS.SSECustomerKeyMismatch
   [  FAILED  ] TestS3FSHTTPS.SSECustomerKeyMissing
   [  FAILED  ] TestS3FSHTTPS.SSECustomerKeyCopyFile
   [  FAILED  ] TestS3FSGeneric.MoveFile
   [  FAILED  ] TestS3FSGeneric.CopyFile
   [  FAILED  ] TestS3FSGeneric.CopyFiles
   [  FAILED  ] TestS3FSGeneric.SpecialChars
   ```
   
   I started looking into the move/copy failures.
   
   Example of an HTTP request from the standard S3 client that works:
   ```
   PUT /bucket/newfile HTTP/1.1\r\n
       Request Method: PUT
       Request URI: /bucket/newfile
       Request Version: HTTP/1.1
   Host: 127.0.0.1:35423\r\n
   Accept: */*\r\n
   amz-sdk-invocation-id: 33994FF0-2F65-4FCE-88B0-9F3CF46E54E6\r\n
   amz-sdk-request: attempt=1\r\n
    […]authorization: AWS4-HMAC-SHA256 
Credential=minio/20250730/ap-southeast-2/s3/aws4_request, 
SignedHeaders=amz-sdk-invocation-id;amz-sdk-request;content-length;content-type;host;x-amz-api-version;x-amz-content-sha256;x-amz-copy-source;x
   content-length: 0\r\n
       [Content length: 0]
   content-type: application/xml\r\n
   user-agent: aws-sdk-cpp/1.11.587 ua/2.1 api/S3 
os/Linux#6.15.7-200.fc42.x86_64 lang/c++#C++11 md/aws-crt#0.32.8-dev+84e33412 
md/arch#x86_64 md/GCC#15.1.1 m/Z,b,P\r\n
   x-amz-api-version: 2006-03-01\r\n
   x-amz-content-sha256: 
e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855\r\n
   x-amz-copy-source: bucket/somefile\r\n
   x-amz-date: 20250730T043553Z\r\n
   \r\n
   ```
   
   But with the S3-CRT client, it sends a HEAD request that doesn't have the 
bucket name in the URI but it's prepended to the host
   ```
   HEAD /somefile HTTP/1.1\r\n
   Host: bucket.0.0.1:41883\r\n
   User-Agent: CRTS3NativeClient/0.1.x platform/unknown\r\n
   X-Amz-Date: 20250730T044539Z\r\n
   x-amz-content-sha256: UNSIGNED-PAYLOAD\r\n
   Authorization: AWS4-HMAC-SHA256 
Credential=minio/20250730/ap-southeast-2/s3/aws4_request, 
SignedHeaders=host;x-amz-content-sha256;x-amz-date, 
Signature=1d61d8dd959fe984e3d3ecaf4ebbd670c05485e277b3861d1e4fbdc26ef24900\r\n
   \r\n
   ```
   
   Minio responds with a 404:
   ```
   HTTP/1.1 404 Not Found\r\n
   Accept-Ranges: bytes\r\n
   Content-Length: 0\r\n
   Server: MinIO\r\n
   Strict-Transport-Security: max-age=31536000; includeSubDomains\r\n
   Vary: Origin\r\n
   Vary: Accept-Encoding\r\n
   X-Amz-Id-2: 
c96c702f27084562bfa0b12e8b5bc8aca70aebe488ce0201c23ff563c1935e13\r\n
   X-Amz-Request-Id: 1856EDF47B209DD2\r\n
   X-Content-Type-Options: nosniff\r\n
   X-Minio-Error-Code: NoSuchBucket\r\n
   X-Minio-Error-Desc: "The specified bucket does not exist"\r\n
   X-Ratelimit-Limit: 12117\r\n
   X-Ratelimit-Remaining: 12117\r\n
   X-Xss-Protection: 1; mode=block\r\n
   Date: Wed, 30 Jul 2025 04:45:39 GMT\r\n
   \r\n
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to