gene-bordegaray commented on code in PR #18919:
URL: https://github.com/apache/datafusion/pull/18919#discussion_r2563335803
##########
datafusion/datasource/src/file_scan_config.rs:
##########
@@ -196,6 +199,11 @@ pub struct FileScanConfig {
/// would be incorrect if there are filters being applied, thus this
should be accessed
/// via [`FileScanConfig::statistics`].
pub(crate) statistics: Statistics,
+ /// Preserve partition column value boundaries when forming file groups.
+ pub preserve_partition_values: bool,
+ /// Cached result of key_partition_exprs computation to avoid repeated work
+ #[allow(clippy::type_complexity)]
+ key_partition_exprs_cache: OnceLock<Option<Vec<Arc<dyn PhysicalExpr>>>>,
Review Comment:
Caches results of compute_key_partition_exprs() which is expensive and
called multiple times (output_partitioning() and eq_properties())
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]