Dandandan opened a new pull request #9010: URL: https://github.com/apache/arrow/pull/9010
Some performance improvements for the csv writer * Use lexical core for numeric types * Allow setting batch size in convert (slightly faster reading) * Avoid allocation of vec PR: `cargo run --release --bin tpch -- convert --input /home/danielheres/Code/gdd/arrow/rust/benchmarks/tpch-dbgen --output ./output --format csv -s 20000` Orders / lineitems: ``` Conversion completed in 2050 ms Conversion completed in 16955 ms ``` Master `cargo run --release --bin tpch -- convert --input /home/danielheres/Code/gdd/arrow/rust/benchmarks/tpch-dbgen --output ./output --format csv` ``` Conversion completed in 2336 ms Conversion completed in 19070 ms ``` ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
