pitrou commented on PR #13442:
URL: https://github.com/apache/arrow/pull/13442#issuecomment-1171172497

   > Each row is about 10 MB of JSON.
   
   So 16MB is just barely adequate and may be too small for other similar 
datasets?
   
   Keep in mind that the block size is not merely used for type inference, it's 
used as a unit of work for batching and parallelization. A large value could be 
detrimental to performance.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to