[GitHub] [spark] MaxGekk commented on pull request #31858: [SPARK-34768][SQL] Respect the default input buffer size in Univocity

GitBox Wed, 17 Mar 2021 00:45:10 -0700


MaxGekk commented on pull request #31858:
URL: https://github.com/apache/spark/pull/31858#issuecomment-800870879



   >  I believe it's better to use default buffer size for stability,
   
   I agree.
   
   > potentially better performance, etc. in any event.
   
   It would be nice to re-run CSV benchmarks. Though we can do that separately 
from this PR.
   
   > It will also fix uniVocity/univocity-parsers#449
   
   Could you update PR's description as it doesn't fix the issue.
   
   > ^ is a regression compared to Spark 2.4
   
   Are you sure about this. The related code in uniVocity is old. I guess 2.4 
might have the same problem.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [spark] MaxGekk commented on pull request #31858: [SPARK-34768][SQL] Respect the default input buffer size in Univocity

Reply via email to