Emiel Botman created ARROW-12007:
------------------------------------
Summary: Loading parquet file returns "Invalid UTF8 payload" error
Key: ARROW-12007
URL: https://issues.apache.org/jira/browse/ARROW-12007
Project: Apache Arrow
Issue Type: Bug
Components: Format
Affects Versions: 3.0.0
Reporter: Emiel Botman
While loading a specific parquet file (arrow::read_parquet(file = file)), the
following error is returned:
Error in parquet___arrow___FileReader__ReadTable1(self) :
Invalid: Invalid UTF8 payload
I managed to load several other parquet files, it is just this specific file
due to which I presume it may be due to some syntax used in this file. As
there any known bug in terms of handling the UTF8 encoding?
--
This message was sent by Atlassian Jira
(v8.3.4#803005)