[
https://issues.apache.org/jira/browse/PARQUET-1343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16527609#comment-16527609
]
Uwe L. Korn commented on PARQUET-1343:
--------------------------------------
This sounds like your file really got corrupted. When it was working some days
ago and you did not change artefact versions that this rather sounds like a
data problem.
> Unable to read a parquet file
> ------------------------------
>
> Key: PARQUET-1343
> URL: https://issues.apache.org/jira/browse/PARQUET-1343
> Project: Parquet
> Issue Type: Bug
> Environment: Linux x86
> anaconda python 3.6
> pyarrow version 0.9.0
> Reporter: Alok
> Priority: Minor
>
> I am unable to read a parquet file that was made after converting a csv to a
> parquet file using pyarrow. Following is the error
> ArrowIOError Traceback (most recent call last) <timed exec> in <module>()
> ~/anaconda3/lib/python3.6/site-packages/pyarrow/parquet.py in
> read_table(source, columns, nthreads, metadata, use_pandas_metadata) 937
> return fs.read_parquet(source, columns=columns, metadata=metadata) 938 -->
> 939 pf = ParquetFile(source, metadata=metadata) 940 return
> pf.read(columns=columns, nthreads=nthreads, 941
> use_pandas_metadata=use_pandas_metadata)
> ~/anaconda3/lib/python3.6/site-packages/pyarrow/parquet.py in __init__(self,
> source, metadata, common_metadata) 62 self.reader = ParquetReader() 63
> source = _ensure_file(source) ---> 64 self.reader.open(source,
> metadata=metadata) 65 self.common_metadata = common_metadata 66
> self._nested_paths_by_prefix = self._build_nested_paths() _parquet.pyx in
> pyarrow._parquet.ParquetReader.open() error.pxi in pyarrow.lib.check_status()
> ArrowIOError: Invalid parquet file. Corrupt footer.
>
> I was able to read and write parquet file earlier (about a few days ago) and
> how its stopped working
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)