[
https://issues.apache.org/jira/browse/ARROW-7208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16978177#comment-16978177
]
Joris Van den Bossche commented on ARROW-7208:
----------------------------------------------
The {{ParquetFile}} object expects a single file, not a directory of files (the
{{read_table}} can handle both).
If you want to use the object interface for a directory of files, you need to
use {{pq.ParquetDataset}}.
A better error message would be useful though.
> Arrow using ParquetFile class
> -----------------------------
>
> Key: ARROW-7208
> URL: https://issues.apache.org/jira/browse/ARROW-7208
> Project: Apache Arrow
> Issue Type: Bug
> Components: Python
> Affects Versions: 0.15.1
> Reporter: Roelant Stegmann
> Priority: Major
>
> Somehow have the same errors. We are working with pyarrow 0.15.1, trying to
> access a folder of `parquet` files generated with Amazon Athena.
> ```python
> table2 = pq.read_table('C:/Data/test-parquet')
> ```
> works fine in contrast to
> ```python
> parquet_file = pq.ParquetFile('C:/Data/test-parquet')
> # parquet_file.read_row_group(0)
> ```
> which raises
> `ArrowIOError: Failed to open local file 'C:/Data/test-parquet', error:
> Access is denied.`
--
This message was sent by Atlassian Jira
(v8.3.4#803005)