alamb commented on issue #15067:
URL: https://github.com/apache/datafusion/issues/15067#issuecomment-2706951343

   To trigger this error you need a slow internet connection and a parquet file 
where the row group is "large" 
   
   The 
https://datasets.clickhouse.com/hits_compatible/athena_partitioned/hits_1.parquet
 file is particularly about in this regard (it has only three row groups with X 
rows). 
   
   ```sql
   > select distinct row_group_id, row_group_num_rows from 
parquet_metadata('hits_1.parquet');
   +--------------+--------------------+
   | row_group_id | row_group_num_rows |
   +--------------+--------------------+
   | 2            | 593202             |
   | 1            | 344064             |
   | 0            | 62734              |
   +--------------+--------------------+
   3 row(s) fetched.
   Elapsed 0.007 seconds.
   ```
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to