pitrou commented on a change in pull request #11210:
URL: https://github.com/apache/arrow/pull/11210#discussion_r724211082



##########
File path: cpp/src/arrow/compute/exec/exec_plan.h
##########
@@ -243,6 +248,128 @@ class ARROW_EXPORT ExecNode {
   NodeVector outputs_;
 };
 
+/// \brief MapNode is an ExecNode type class which process a task like 
filter/project
+/// (See SubmitTask method) to each given ExecBatch object, which have one 
input, one
+/// output, and are pure functions on the input
+///
+/// A simple parallel runner is created with a "map_fn" which is just a 
function that
+/// takes a batch in and returns a batch.  This simple parallel runner also 
needs an
+/// executor (use simple synchronous runner if there is no executor)
+
+class MapNode : public ExecNode {
+ public:
+  MapNode(ExecPlan* plan, std::vector<ExecNode*> inputs,
+          std::shared_ptr<Schema> output_schema)
+      : ExecNode(plan, std::move(inputs), /*input_labels=*/{"target"},
+                 std::move(output_schema),
+                 /*num_outputs=*/1) {
+    executor_ = plan_->exec_context()->executor();
+  }
+
+  void ErrorReceived(ExecNode* input, Status error) override {
+    DCHECK_EQ(input, inputs_[0]);
+    outputs_[0]->ErrorReceived(this, std::move(error));
+  }

Review comment:
       As a general concern, we should minimize the amount of code in `.h` 
files unless inlining is performance-critical. Here, my guess is that it isn't 
(and furthermore, these are virtual methods so are unlikely to be inlined in 
the first place).




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to