alamb commented on issue #15067: URL: https://github.com/apache/datafusion/issues/15067#issuecomment-2706951343
To trigger this error you need a slow internet connection and a parquet file where the row group is "large" The https://datasets.clickhouse.com/hits_compatible/athena_partitioned/hits_1.parquet file is particularly about in this regard (it has only three row groups with X rows). ```sql > select distinct row_group_id, row_group_num_rows from parquet_metadata('hits_1.parquet'); +--------------+--------------------+ | row_group_id | row_group_num_rows | +--------------+--------------------+ | 2 | 593202 | | 1 | 344064 | | 0 | 62734 | +--------------+--------------------+ 3 row(s) fetched. Elapsed 0.007 seconds. ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
