[
https://issues.apache.org/jira/browse/ARROW-12026?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17306992#comment-17306992
]
Karthikeyan Janakiraman commented on ARROW-12026:
-------------------------------------------------
I am going through a new issue now, I can raise a separate ticket to track if
required. I need to use proxy to connect to AWS S3 from my environment.
I have set the proxy using Sys.setenv in the R terminal however I still
couldn't hit the S3, could you please advise ? will read_parquet use the proxy
to get connected to S3 ?
{code:java}
> Sys.setenv(http_proxy="http://proxy:9099")
> Sys.setenv(https_proxy="http://proxy:9099")
> Sys.setenv(HTTPS_PROXY="http://proxy:9099")
> Sys.setenv(HTTP_PROXY="http://proxy:9099")
> Sys.getenv("http_proxy")
[1] "http://proxy:9099"
> df <-
> read_parquet("s3://my_bucket/test-parquet/refinement.parquet?region=eu-west-1")
Error: IOError: When reading information for key
'test-parquet/refinement.parquet' in bucket 'my_bucket': AWS Error [code 99]:
Unable to connect to endpoint with address : 52.218.57.8
>
{code}
> [R] NotImplemented: Got S3 URI but Arrow compiled without S3 support
> --------------------------------------------------------------------
>
> Key: ARROW-12026
> URL: https://issues.apache.org/jira/browse/ARROW-12026
> Project: Apache Arrow
> Issue Type: Bug
> Components: R
> Affects Versions: 3.0.0
> Environment: QA
> Reporter: Karthikeyan Janakiraman
> Priority: Trivial
>
> I have followed below steps however seeing the error on summary when I try to
> read parquet from S3.
>
> 1. export LIBARROW_MINIMAL=false
> {code:java}
> [root@c1cce557dba3 tmp]# printenv | grep LIBARROW_MINIMAL
> LIBARROW_MINIMAL=false{code}
>
> 2. Install arrow
> {code:java}
> R CMD INSTALL arrow_3.0.0.tar.gz
> {code}
>
> 3. Get into R prompt and load arrow
> {code:java}
> > library('arrow')
> Attaching package: ‘arrow’The following object is masked from
> ‘package:utils’: timestamp
> >
> {code}
> 4. When I try to read a parquet from S3 bucket seeing below error,
>
> {code:java}
> > df <- read_parquet("s3://my_bucket/test-parquet/refinement.parquet")
> Error: NotImplemented: Got S3 URI but Arrow compiled without S3 support
> {code}
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)