Andy Grove created ARROW-10999:
----------------------------------
Summary: [Rust] TPC-H parquet files cannot be read by Apache Spark
Key: ARROW-10999
URL: https://issues.apache.org/jira/browse/ARROW-10999
Project: Apache Arrow
Issue Type: Bug
Components: Rust
Reporter: Andy Grove
Assignee: Andy Grove
Fix For: 3.0.0
The TPC-H parquet files generated by the benchmark crate cannot be read by
Apache Spark because they use unsigned ints, which cannot be read in Spark (I
am guessing because Java only has signed ints).
I would like to use the same data sets for benchmarking DataFusion, Apache
Spark, and other tools.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)