alamb commented on code in PR #11994:
URL: https://github.com/apache/datafusion/pull/11994#discussion_r1717433490


##########
datafusion/core/src/datasource/physical_plan/parquet/mod.rs:
##########
@@ -144,6 +142,29 @@ pub use writer::plan_to_parquet;
 /// * User provided  [`ParquetAccessPlan`]s to skip row groups and/or pages
 ///   based on external information. See "Implementing External Indexes" below
 ///
+/// # Predicate Pushdown

Review Comment:
   I tried to consolidate the description of what predicate pushdown is done in 
the ParquetExec



##########
datafusion/core/src/datasource/physical_plan/parquet/row_filter.rs:
##########
@@ -312,8 +369,10 @@ fn size_of_columns(
     Ok(total_size)
 }
 
-/// For a given set of `Column`s required for predicate `Expr` determine 
whether all
-/// columns are sorted. Sorted columns may be queried more efficiently in the 
presence of
+/// For a given set of `Column`s required for predicate `Expr` determine 
whether
+/// all columns are sorted.
+///
+/// Sorted columns may be queried more efficiently in the presence of
 /// a PageIndex.
 fn columns_sorted(

Review Comment:
   This is interesting that we never connected up the "columns_sorted" 
information -- is this on your list @thinkharderdev ?
   
   Should I file a ticket to do this?



##########
datafusion/core/src/datasource/physical_plan/parquet/row_filter.rs:
##########
@@ -15,6 +15,49 @@
 // specific language governing permissions and limitations
 // under the License.
 
+//! Utilities to push down of DataFusion filter predicates (any DataFusion

Review Comment:
   this is mostly the same content, reformatted and made more concise. 



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to