lidavidm commented on issue #13980: URL: https://github.com/apache/arrow/issues/13980#issuecomment-1228389797
The benchmarks are nowhere near comparable. The C++ version spawns 16 threads, as you can see in the output. The Java version spawns one thread per Endpoint, and defaults to two endpoints. The Java version uses smaller batches (4095 * 4 * 8 ~= 128 KiB) whereas the C++ one uses ~256 KiB. The Java version also appears to suffer from JVM warmup. The Java thread also isn't clear, but it appears to be summing the two threads together. It does seem the per-thread performance is not quite as good, but there's so many differences between the two benchmarks that I wouldn't take this as anything remotely definitive. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
