tschaub opened a new issue, #38503: URL: https://github.com/apache/arrow/issues/38503
### Describe the bug, including details regarding any error messages, version, and platform. I'm uncertain if this is user error, an issue with the Go packages, or an issue with the C++ reader. I'll work on trying to put together a more minimal reproduction, but for now, I've put together a test that demonstrates the issue here: https://github.com/tschaub/parquet-issue I'm trying to use the `pqarrow` package to read an input Parquet file, transform some of the data, and write an output Parquet file. In the [linked test case](https://github.com/tschaub/parquet-issue), there is no transformation step. So the test uses a `pqarrow.FileReader`, gets a `pqarrow.RowGroupReader` for each row group, reads each column as an `arrow.Chunked`, and uses a `pqarrow.ArrowColumnWriter` to write out the same. When I try to use the C++ `parquet-reader` to read in the output file, I see the following error: ```shell # parquet-reader output.parquet > /dev/null Parquet error: Malformed levels. min: 2 max: 2 out of range. Max Level: 1 ``` This same test passes for other Parquet files. I'm trying to narrow down the issue by filtering the data to only include a subset of the columns or just a few rows, but I haven't succeeded in creating a subset of the data that reproduces the issue yet (in part because I don't have good tools to filter the data without altering the schema). ### Component(s) Go, Parquet -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
