Mikhail Tolmachev created ARROW-14640:
-----------------------------------------
Summary: reading data from S3
Key: ARROW-14640
URL: https://issues.apache.org/jira/browse/ARROW-14640
Project: Apache Arrow
Issue Type: Bug
Components: R
Affects Versions: 6.0.0
Reporter: Mikhail Tolmachev
I am trying to read data directly from S3. In my work pipeline I work under
proxy, so I set system environments HTTP_PROXY and HTTPS_PROXY. Also I have set
AWS_ACCESS_KEY_ID and AWS_SECRET_ACCESS_KEY. I have constructed links for
reading:
{code:java}
s3_uri <- paste0("s3://", accessKeyId, ":", URLencode(secretAccessKey, reserved
= T),"@", bucketName, "/", path)
s3_uri_short <- paste0("s3://", bucketName, "/", path)
bucketS3 <- arrow::s3_bucket(bucket = bucketName, access_key = accessKeyId,
secret_key = URLencode(secretAccessKey, reserved =
T)){code}
But none of them worked for arrow::read_parquet
{code:java}
df <- arrow::read_parquet(s3_uri)
df <- arrow::read_parquet(s3_uri_short)
df <- arrow::read_parquet(bucketS3$path(x = path)){code}
Error
{code:java}
IOError: When reading information for key AWS Error [code 15]: No response
body. with address : {code}
How can I fix the problem or maybe get more informative output ? Thanks for
attention.
I knew about same issue https://issues.apache.org/jira/browse/ARROW-12126 but I
can't reopen it.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)