alamb commented on code in PR #16086:
URL: https://github.com/apache/datafusion/pull/16086#discussion_r2098034125


##########
datafusion/datasource-parquet/src/opener.rs:
##########
@@ -178,7 +182,7 @@ impl FileOpener for ParquetOpener {
             // Build predicates for this specific file
             let (pruning_predicate, page_pruning_predicate) = 
build_pruning_predicates(
                 predicate.as_ref(),
-                &physical_file_schema,
+                &logical_file_schema,

Review Comment:
   This is the actual change (from physical to logical schema) -- I am calling 
this out because it took me a while to spot it (at first I thought this was 
only a name change)



##########
datafusion/datasource-parquet/src/opener.rs:
##########
@@ -55,8 +55,9 @@ pub(super) struct ParquetOpener {
     pub limit: Option<usize>,
     /// Optional predicate to apply during the scan
     pub predicate: Option<Arc<dyn PhysicalExpr>>,
-    /// Schema of the output table
-    pub table_schema: SchemaRef,
+    /// Schema of the output table without partition columns.

Review Comment:
   I verified that when the code changes are reverted this test fails:
   
   ```shell
   cargo test --all-features -p datafusion -- parquet
   ...
   
   ---- 
datasource::physical_plan::parquet::tests::evolved_schema_column_type_filter_ints
 stdout ----
   
   thread 
'datasource::physical_plan::parquet::tests::evolved_schema_column_type_filter_ints'
 panicked at datafusion/core/src/datasource/physical_plan/parquet.rs:927:9:
   assertion `left == right` failed
     left: 1
    right: 0
   note: run with `RUST_BACKTRACE=1` environment variable to display a backtrace
   
   ---- 
datasource::physical_plan::parquet::tests::evolved_schema_column_type_filter_strings
 stdout ----
   
   thread 
'datasource::physical_plan::parquet::tests::evolved_schema_column_type_filter_strings'
 panicked at datafusion/core/src/datasource/physical_plan/parquet.rs:885:9:
   assertion `left == right` failed
     left: 1
    right: 0
   
   
   failures:
       
datasource::physical_plan::parquet::tests::evolved_schema_column_type_filter_ints
       
datasource::physical_plan::parquet::tests::evolved_schema_column_type_filter_strings
   ```



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to