[ 
https://issues.apache.org/jira/browse/ARROW-12026?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17307288#comment-17307288
 ] 

Karthikeyan Janakiraman edited comment on ARROW-12026 at 3/23/21, 6:03 PM:
---------------------------------------------------------------------------

Hey [~npr], Thanks for the suggestion. Not sure if I got it right on the 
syntax, it will be great if you could verify it. I tried it in two different 
ways and seeing the respective errors below. 
{code:java}
> df <- 
> read_parquet("s3://my_bucket/test-parquet/refinement.parquet?scheme=http&endpoint_override=my_proxy.com%3A9099")
Error: IOError: Path does not exist 'my_bucket/test-parquet/refinement.parquet'
> df <- 
> read_parquet("s3://my_bucket/test-parquet/refinement.parquet?region=eu-west-1?scheme=http&endpoint_override=my_proxy.com%3A9099")
Error: IOError: When reading information for key 
'test-parquet/refinement.parquet' in bucket 'my_bucket': AWS Error [code 99]: 
Unable to connect to endpoint with address : 172.22.65.50
> 

{code}
All I need is the connection to reach out to AWS from my local via the proxy 
that I specify. 

Note - There is a path does not exist error for one of the above command, but I 
can confirm the path exist in the bucket. 


was (Author: karthik_j):
Hey [~npr], Thanks for the suggestion. Not sure if I got it right on the 
syntax, it will be great if you could verify it. I tried it in two different 
ways and seeing the respective errors below. 


{code:java}
> df <- 
> read_parquet("s3://my_bucket/test-parquet/refinement.parquet?scheme=http&endpoint_override=my_proxy.com%3A9099")
Error: IOError: Path does not exist 
'cof-st-mosaic-apps-qa-eea-legoland-eu-west-1/test-parquet/refinement.parquet'
> df <- 
> read_parquet("s3://my_bucket/test-parquet/refinement.parquet?region=eu-west-1?scheme=http&endpoint_override=my_proxy.com%3A9099")
Error: IOError: When reading information for key 
'test-parquet/refinement.parquet' in bucket 'my_bucket': AWS Error [code 99]: 
Unable to connect to endpoint with address : 172.22.65.50
> 

{code}

All I need is the connection to reach out to AWS from my local via the proxy 
that I specify. 

Note - There is a path does not exist error for one of the above command, but I 
can confirm the path exist in the bucket. 

> [R] NotImplemented: Got S3 URI but Arrow compiled without S3 support
> --------------------------------------------------------------------
>
>                 Key: ARROW-12026
>                 URL: https://issues.apache.org/jira/browse/ARROW-12026
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: R
>    Affects Versions: 3.0.0
>         Environment: QA
>            Reporter: Karthikeyan Janakiraman
>            Priority: Trivial
>
> I have followed below steps however seeing the error on summary when I try to 
> read parquet from S3. 
>  
> 1. export LIBARROW_MINIMAL=false
> {code:java}
> [root@c1cce557dba3 tmp]# printenv | grep LIBARROW_MINIMAL
> LIBARROW_MINIMAL=false{code}
>  
> 2. Install arrow 
> {code:java}
> R CMD INSTALL arrow_3.0.0.tar.gz
> {code}
>   
> 3. Get into R prompt and load arrow
> {code:java}
> > library('arrow')
> Attaching package: ‘arrow’The following object is masked from 
> ‘package:utils’:    timestamp
> > 
> {code}
> 4. When I try to read a parquet from S3 bucket seeing below error, 
>  
> {code:java}
> > df <- read_parquet("s3://my_bucket/test-parquet/refinement.parquet")
> Error: NotImplemented: Got S3 URI but Arrow compiled without S3 support
> {code}
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to