nealrichardson commented on pull request #8650: URL: https://github.com/apache/arrow/pull/8650#issuecomment-756304407
Running benchmarks on the character vectors, using 10 iterations the performance is basically the same as on master. I haven't tested factors, but that file does test integer, numeric, integer64, and character, with variation in the number of NAs. So, something with equivalent performance, but that allows chunking and parallelization by columns (which the old converter does not, IIRC), would in practice have significant benefits. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
