Andy Grove created ARROW-10999:
----------------------------------

             Summary: [Rust] TPC-H parquet files cannot be read by Apache Spark
                 Key: ARROW-10999
                 URL: https://issues.apache.org/jira/browse/ARROW-10999
             Project: Apache Arrow
          Issue Type: Bug
          Components: Rust
            Reporter: Andy Grove
            Assignee: Andy Grove
             Fix For: 3.0.0


The TPC-H parquet files generated by the benchmark crate cannot be read by 
Apache Spark because they use unsigned ints, which cannot be read in Spark (I 
am guessing because Java only has signed ints).

I would  like to use the same data sets for benchmarking DataFusion, Apache 
Spark, and other tools.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to