jhorstmann opened a new pull request, #7823: URL: https://github.com/apache/arrow-rs/pull/7823
# Which issue does this PR close? Prerequisite for investigating parquet writing performance (#7822). # Rationale for this change The benchmark should measure the cpu overhead of parquet writing, not the os or filesystem parts of it. Running the benchmark showed that the file has nearly a 50% overhead, which makes profiling more difficult by hiding the bottlenecks inside the parquet code itself. # What changes are included in this PR? Use a Vec instead of an unbuffered File as the sink. # Are these changes tested? Tested by running the benchmark. # Are there any user-facing changes? No -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
