Re: [PR] [#2652] feat(spark): Introduce records number check to ensure end-to-end data consistency [uniffle]

via GitHub Thu, 13 Nov 2025 00:12:41 -0800


zuston commented on code in PR #2653:
URL: https://github.com/apache/uniffle/pull/2653#discussion_r2522195598



##########
client-spark/spark3/src/main/java/org/apache/spark/shuffle/writer/RssShuffleWriter.java:
##########
@@ -933,14 +949,12 @@ public Option<MapStatus> stop(boolean success) {
             bitmapSplitNum,
             reportDuration);
         
shuffleWriteMetrics.incWriteTime(TimeUnit.MILLISECONDS.toNanos(reportDuration));
+
         // todo: we can replace the dummy host and port with the real shuffle 
server which we prefer
         // to read
         final BlockManagerId blockManagerId =
             BlockManagerId.apply(
-                appId + "_" + taskId,
-                DUMMY_HOST,
-                DUMMY_PORT,
-                Option.apply(Long.toString(taskAttemptId)));
+                appId + "_" + taskId, DUMMY_HOST, DUMMY_PORT, 
Option.apply(createShuffleInfo()));

Review Comment:
   > Would it be better to send this data to ShuffleServer? This can easily 
cause an OOM of the Driver.
   
   I have to say you are right. for the huge stage like 100000 tasks with 10000 
partitions, the driver will be oom. And so, I have to make this design possible 
by the shuffle-server store



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: [PR] [#2652] feat(spark): Introduce records number check to ensure end-to-end data consistency [uniffle]

Reply via email to