[ 
https://issues.apache.org/jira/browse/ARROW-7208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16978177#comment-16978177
 ] 

Joris Van den Bossche commented on ARROW-7208:
----------------------------------------------

The {{ParquetFile}} object expects a single file, not a directory of files (the 
{{read_table}} can handle both). 
If you want to use the object interface for a directory of files, you need to 
use {{pq.ParquetDataset}}.

A better error message would be useful though.

> Arrow using ParquetFile class
> -----------------------------
>
>                 Key: ARROW-7208
>                 URL: https://issues.apache.org/jira/browse/ARROW-7208
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: Python
>    Affects Versions: 0.15.1
>            Reporter: Roelant Stegmann
>            Priority: Major
>
> Somehow have the same errors. We are working with pyarrow 0.15.1, trying to 
> access a folder of `parquet` files generated with Amazon Athena.
> ```python
> table2 = pq.read_table('C:/Data/test-parquet')
> ```
> works fine in contrast to
> ```python
> parquet_file = pq.ParquetFile('C:/Data/test-parquet')
> # parquet_file.read_row_group(0)
> ```
> which raises
> `ArrowIOError: Failed to open local file 'C:/Data/test-parquet', error: 
> Access is denied.`



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to