Copilot commented on code in PR #19596:
URL: https://github.com/apache/datafusion/pull/19596#discussion_r2656645941


##########
datafusion/datasource/src/mod.rs:
##########
@@ -133,6 +134,8 @@ pub struct PartitionedFile {
     /// When set via [`Self::with_statistics`], partition column statistics 
are automatically
     /// computed from [`Self::partition_values`] with exact 
min/max/null_count/distinct_count.
     pub statistics: Option<Arc<Statistics>>,
+    /// A known ordering of the data in this file.

Review Comment:
   The documentation for the `ordering` field could be more descriptive. 
Consider expanding it to match the detail provided in the `with_ordering` 
method documentation, explaining that it represents the lexicographical sort 
order of the data and is typically inferred from file metadata (e.g., Parquet 
sorting_columns).
   ```suggestion
       /// The known lexicographical ordering of the rows in this file, if any.
       ///
       /// This describes how the data within the file is sorted with respect 
to one or more
       /// columns, and is used by the optimizer for planning operations that 
depend on input
       /// ordering (e.g. merges, sorts, and certain aggregations).
       ///
       /// When available, this is typically inferred from file-level metadata 
exposed by the
       /// underlying format (for example, Parquet `sorting_columns`), but it 
may also be set
       /// explicitly via [`Self::with_ordering`].
   ```



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to