MaxGekk commented on pull request #31858: URL: https://github.com/apache/spark/pull/31858#issuecomment-800870879
> I believe it's better to use default buffer size for stability, I agree. > potentially better performance, etc. in any event. It would be nice to re-run CSV benchmarks. Though we can do that separately from this PR. > It will also fix uniVocity/univocity-parsers#449 Could you update PR's description as it doesn't fix the issue. > ^ is a regression compared to Spark 2.4 Are you sure about this. The related code in uniVocity is old. I guess 2.4 might have the same problem. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
