alamb opened a new issue, #5708:
URL: https://github.com/apache/arrow-datafusion/issues/5708

   ### Describe the bug
   
   Parquet row group pruning incorrectly prunes out row groups when columns 
names have `.` in them
   
   ### To Reproduce
   
   Use this file:  
[spans.zip](https://github.com/apache/arrow-datafusion/files/11055059/spans.zip)
   
   Run using datafusion-cli:
   
   ```sql
   ❯ SELECT "service.name" FROM 'spans.parquet';
   +--------------+
   | service.name |
   +--------------+
   | frontend     |
   +--------------+
   1 row in set. Query took 0.002 seconds.
   ❯ SELECT "service.name" FROM 'spans.parquet' WHERE "service.name" = 
'frontend'
   0 rows in set. Query took 0.002 seconds.
   ```
   
   ### Expected behavior
   
   
   However if I disable row group pruning the same query works as expected and 
returns a single row
   
   ```sql
   ❯ set datafusion.execution.parquet.pruning=false;
   0 rows in set. Query took 0.000 seconds.
   ❯ SELECT "service.name" FROM 'spans.parquet' WHERE "service.name" = 
'frontend';
   +--------------+
   | service.name |
   +--------------+
   | frontend     |
   +--------------+
   ```
   
   ### Additional context
   
   https://github.com/influxdata/influxdb_iox/issues/7225


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to