[
https://issues.apache.org/jira/browse/ARROW-12026?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17307288#comment-17307288
]
Karthikeyan Janakiraman edited comment on ARROW-12026 at 3/23/21, 6:03 PM:
---------------------------------------------------------------------------
Hey [~npr], Thanks for the suggestion. Not sure if I got it right on the
syntax, it will be great if you could verify it. I tried it in two different
ways and seeing the respective errors below.
{code:java}
> df <-
> read_parquet("s3://my_bucket/test-parquet/refinement.parquet?scheme=http&endpoint_override=my_proxy.com%3A9099")
Error: IOError: Path does not exist 'my_bucket/test-parquet/refinement.parquet'
> df <-
> read_parquet("s3://my_bucket/test-parquet/refinement.parquet?region=eu-west-1?scheme=http&endpoint_override=my_proxy.com%3A9099")
Error: IOError: When reading information for key
'test-parquet/refinement.parquet' in bucket 'my_bucket': AWS Error [code 99]:
Unable to connect to endpoint with address : 172.22.65.50
>
{code}
All I need is the connection to reach out to AWS from my local via the proxy
that I specify.
Note - There is a path does not exist error for one of the above command, but I
can confirm the path exist in the bucket.
was (Author: karthik_j):
Hey [~npr], Thanks for the suggestion. Not sure if I got it right on the
syntax, it will be great if you could verify it. I tried it in two different
ways and seeing the respective errors below.
{code:java}
> df <-
> read_parquet("s3://my_bucket/test-parquet/refinement.parquet?scheme=http&endpoint_override=my_proxy.com%3A9099")
Error: IOError: Path does not exist
'cof-st-mosaic-apps-qa-eea-legoland-eu-west-1/test-parquet/refinement.parquet'
> df <-
> read_parquet("s3://my_bucket/test-parquet/refinement.parquet?region=eu-west-1?scheme=http&endpoint_override=my_proxy.com%3A9099")
Error: IOError: When reading information for key
'test-parquet/refinement.parquet' in bucket 'my_bucket': AWS Error [code 99]:
Unable to connect to endpoint with address : 172.22.65.50
>
{code}
All I need is the connection to reach out to AWS from my local via the proxy
that I specify.
Note - There is a path does not exist error for one of the above command, but I
can confirm the path exist in the bucket.
> [R] NotImplemented: Got S3 URI but Arrow compiled without S3 support
> --------------------------------------------------------------------
>
> Key: ARROW-12026
> URL: https://issues.apache.org/jira/browse/ARROW-12026
> Project: Apache Arrow
> Issue Type: Bug
> Components: R
> Affects Versions: 3.0.0
> Environment: QA
> Reporter: Karthikeyan Janakiraman
> Priority: Trivial
>
> I have followed below steps however seeing the error on summary when I try to
> read parquet from S3.
>
> 1. export LIBARROW_MINIMAL=false
> {code:java}
> [root@c1cce557dba3 tmp]# printenv | grep LIBARROW_MINIMAL
> LIBARROW_MINIMAL=false{code}
>
> 2. Install arrow
> {code:java}
> R CMD INSTALL arrow_3.0.0.tar.gz
> {code}
>
> 3. Get into R prompt and load arrow
> {code:java}
> > library('arrow')
> Attaching package: ‘arrow’The following object is masked from
> ‘package:utils’: timestamp
> >
> {code}
> 4. When I try to read a parquet from S3 bucket seeing below error,
>
> {code:java}
> > df <- read_parquet("s3://my_bucket/test-parquet/refinement.parquet")
> Error: NotImplemented: Got S3 URI but Arrow compiled without S3 support
> {code}
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)