saulpw opened a new pull request, #13442: URL: https://github.com/apache/arrow/pull/13442
Parsing JSONL with large objects fails with previous 1MB block_size. Experimentation is required to find a workable block_size. These days JSON blobs are more frequently bigger than 1MB. 16MB is a more reasonable balance between amount of memory allocated for the buffer, and number of JSON files in the wild that can be parsed immediately and without friction. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org