nealrichardson commented on code in PR #13700:
URL: https://github.com/apache/arrow/pull/13700#discussion_r930125712


##########
cpp/src/arrow/compute/exec/options.h:
##########
@@ -430,6 +430,23 @@ class ARROW_EXPORT SelectKSinkNodeOptions : public 
SinkNodeOptions {
   SelectKOptions select_k_options;
 };
 
+/// \brief Make a node which selects a range of rows passed through it
+///
+/// All batches pushed to this node will be accumulated, then selected, by the 
given

Review Comment:
   If I just want to select a range of rows, why accumulate all batches? In R 
how we've implemented `head()` now is to kill the plan after we've collected 
enough rows: https://github.com/apache/arrow/blob/master/r/R/query-engine.R#L241



##########
cpp/src/arrow/compute/exec/options.h:
##########
@@ -430,6 +430,23 @@ class ARROW_EXPORT SelectKSinkNodeOptions : public 
SinkNodeOptions {
   SelectKOptions select_k_options;
 };
 
+/// \brief Make a node which selects a range of rows passed through it
+///
+/// All batches pushed to this node will be accumulated, then selected, by the 
given
+/// fields. Then sorted batches will be forwarded to the generator in sorted 
order and

Review Comment:
   How is FetchAndSort different from TopK?



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to