sanjibansg commented on a change in pull request #12560:
URL: https://github.com/apache/arrow/pull/12560#discussion_r828244263
##########
File path: python/pyarrow/_dataset.pyx
##########
@@ -2019,9 +2023,12 @@ cdef class Scanner(_Weakrefable):
dataset : Dataset
Dataset to scan.
columns : list of str or dict, default None
- The columns to project. This can be a list of column names to include
- (order and duplicates will be preserved), or a dictionary with
- {new_column_name: expression} values for more advanced projections.
+ The columns to project. This can be a list of column names to
+ include (order and duplicates will be preserved) which may contain the
+ augmented fields such as `batch_index`, `fragment_index`,
+ `last_in_fragment` and `filename`, or a dictionary
+ with {new_column_name: expression} values for more advanced
+ projections.
Review comment:
I am not very sure about the common template doctoring in `_dataset.pyx`.
Do you mean something like this?
https://github.com/apache/arrow/blob/4f2e5cc6669aa68dcdf9c876115cf7d1146c8cd5/python/pyarrow/_dataset.pyx#L352
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]