kerwin-zk commented on code in PR #5675:
URL: https://github.com/apache/incubator-gluten/pull/5675#discussion_r1596242830
##########
cpp/velox/shuffle/VeloxShuffleWriter.cc:
##########
@@ -376,8 +386,145 @@ arrow::Status
VeloxShuffleWriter::partitioningAndDoSplit(facebook::velox::RowVec
return arrow::Status::OK();
}
-arrow::Status VeloxShuffleWriter::stop() {
+arrow::Status VeloxShuffleWriter::doSort(facebook::velox::RowVectorPtr rv,
int64_t memLimit) {
+ currentInputColumnBytes_ += rv->estimateFlatSize();
+ batches_.push_back(rv);
Review Comment:
@zhouyuan In the HashPartitioner's compute, it will encapsulate the
information of each row belonging to the same PartitionId in the RowVector into
an int, and put it into the Vector of the map, which is equivalent to replacing
the Sort with a HashMap.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]