amol- commented on code in PR #13409:
URL: https://github.com/apache/arrow/pull/13409#discussion_r917795065
##########
python/pyarrow/_exec_plan.pyx:
##########
@@ -95,6 +98,23 @@ cdef execplan(inputs, output_type, vector[CDeclaration]
plan, c_bool use_threads
c_in_table, 1 << 20)
c_input_node_opts = static_pointer_cast[CExecNodeOptions,
CTableSourceNodeOptions](
c_tablesourceopts)
+ elif isinstance(ipt, FilteredDataset):
Review Comment:
There shouldn't be any API on Dataset at the moment that should allow you to
restrict the columns before executing a join through the exec plan. We don't
yet provide any "projection" capability for `FilteredDataset` itself, only for
scanner.
Making ScanOptions seems like a good idea btw, would make easy to expand the
capabilities in the future. Adding projections for example would be just one
more passed option.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]