bkietz commented on pull request #10397:
URL: https://github.com/apache/arrow/pull/10397#issuecomment-853380645


   >  How do you see things evolving? Do you think the various operations 
achieved by a scanner today will be achieved by an execution plan? For example, 
will ScanBatches, CountRows, etc. create and execute an execution plan instead 
of maintaining the dual paths?
   
   I'd like the ExecPlan to be usable enough to replace all filtering and 
projection currently in Scanner. So for example ScanBatches could assemble an 
ExecPlan to handle filtering and projection then receive and reorder batches; 
never needing to explicitly evaluate an expression.
   
   Ultimately, I'm not positive we'll keep Scanner. It's possible we could 
simplify the dataset module to a factory for source/sink nodes. In that case, 
anything which currently builds a Scanner would instead produce an ExecPlan. 
We'll see


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Reply via email to