cyb70289 commented on issue #14229: URL: https://github.com/apache/arrow/issues/14229#issuecomment-1282463476
Maybe you need more patience :-) Attached a dataset created from below code. The parquet file is about 15M, after ziping it's only 60K. I saw rss jumps to above 30G during loading the file, not sure if a laptop can read it. [test.parquet.zip](https://github.com/apache/arrow/files/9811264/test.parquet.zip) ```python n_rows = 10800000 data = [np.zeros(200, dtype='bool')] * n_rows ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
