[ 
https://issues.apache.org/jira/browse/ARROW-9261?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Roee Shlomo updated ARROW-9261:
-------------------------------
    Description: 
https://issues.apache.org/jira/browse/ARROW-9109 introduced S3 support in 
manylinux wheels. However, when trying to use S3FileSystem it fails with

 
{code:java}
Traceback (most recent call last):
 File "<string>", line 1, in <module>
 File "pyarrow/_fs.pyx", line 597, in pyarrow._fs.FileSystem.open_input_stream
 File "pyarrow/error.pxi", line 122, in 
pyarrow.lib.pyarrow_internal_check_status
 File "pyarrow/error.pxi", line 99, in pyarrow.lib.check_status
OSError: When reading information for key 'xxxxx' in bucket 'xxxxx': AWS Error 
[code 99]: curlCode: 77, Problem with the SSL CA cert (path? access rights?) 
with address{code}
It seems like it can't find the SSL CA cert directory that is installed in the 
runtime machine (tested on Ubuntu 16.04 and Ubuntu 18.04). It always searches 
in /etc/pki/tls/certs/ca-bundle.crt probably because the wheels are built on 
centos, whereas in Ubuntu the path is /etc/ssl/certs/ca-certificates.crt and is 
different on other distributions.

Reproduce with:
{code:java}
virtualenv -p python3.8 arrowenv
source arrowenv/bin/activate
pip install --extra-index-url https://repo.fury.io/arrow-nightlies/ --pre 
pyarrow
python -c "from pyarrow.fs import S3FileSystem; fs = S3FileSystem(); 
fs.open_input_stream('mybucket/myfile')"{code}
 

 

  was:
https://issues.apache.org/jira/browse/ARROW-9109 introduced S3 support in 
manylinux wheels. However, when trying to use S3FileSystem it fails with

 
{code:java}
Traceback (most recent call last):
 File "<string>", line 1, in <module>
 File "pyarrow/_fs.pyx", line 597, in pyarrow._fs.FileSystem.open_input_stream
 File "pyarrow/error.pxi", line 122, in 
pyarrow.lib.pyarrow_internal_check_status
 File "pyarrow/error.pxi", line 99, in pyarrow.lib.check_status
OSError: When reading information for key 'xxxxx' in bucket 'xxxxx': AWS Error 
[code 99]: curlCode: 77, Problem with the SSL CA cert (path? access rights?) 
with address{code}
 

It seems like it can't find the SSL CA cert directory that is installed in the 
runtime machine (tested on Ubuntu 16.04 and Ubuntu 18.04). 

Reproduce with:

 
{code:java}
virtualenv -p python3.8 arrowenv
source arrowenv/bin/activate
pip install --extra-index-url https://repo.fury.io/arrow-nightlies/ --pre 
pyarrow
python -c "from pyarrow.fs import S3FileSystem; fs = S3FileSystem(); 
fs.open_input_stream('mybucket/myfile')"{code}
 

 


> [Python][Packaging] S3FileSystem curl errors in manylinux wheels
> ----------------------------------------------------------------
>
>                 Key: ARROW-9261
>                 URL: https://issues.apache.org/jira/browse/ARROW-9261
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: Packaging, Python
>            Reporter: Roee Shlomo
>            Priority: Major
>
> https://issues.apache.org/jira/browse/ARROW-9109 introduced S3 support in 
> manylinux wheels. However, when trying to use S3FileSystem it fails with
>  
> {code:java}
> Traceback (most recent call last):
>  File "<string>", line 1, in <module>
>  File "pyarrow/_fs.pyx", line 597, in pyarrow._fs.FileSystem.open_input_stream
>  File "pyarrow/error.pxi", line 122, in 
> pyarrow.lib.pyarrow_internal_check_status
>  File "pyarrow/error.pxi", line 99, in pyarrow.lib.check_status
> OSError: When reading information for key 'xxxxx' in bucket 'xxxxx': AWS 
> Error [code 99]: curlCode: 77, Problem with the SSL CA cert (path? access 
> rights?) with address{code}
> It seems like it can't find the SSL CA cert directory that is installed in 
> the runtime machine (tested on Ubuntu 16.04 and Ubuntu 18.04). It always 
> searches in /etc/pki/tls/certs/ca-bundle.crt probably because the wheels are 
> built on centos, whereas in Ubuntu the path is 
> /etc/ssl/certs/ca-certificates.crt and is different on other distributions.
> Reproduce with:
> {code:java}
> virtualenv -p python3.8 arrowenv
> source arrowenv/bin/activate
> pip install --extra-index-url https://repo.fury.io/arrow-nightlies/ --pre 
> pyarrow
> python -c "from pyarrow.fs import S3FileSystem; fs = S3FileSystem(); 
> fs.open_input_stream('mybucket/myfile')"{code}
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to