jerryshao commented on a change in pull request #25552: [SPARK-28849][CORE] Add 
a number to control transferTo calls to avoid infinite loop in some occasional 
cases
URL: https://github.com/apache/spark/pull/25552#discussion_r317088993
 
 

 ##########
 File path: core/src/main/scala/org/apache/spark/util/Utils.scala
 ##########
 @@ -417,16 +418,19 @@ private[spark] object Utils extends Logging {
       input: FileChannel,
       output: WritableByteChannel,
       startPosition: Long,
-      bytesToCopy: Long): Unit = {
+      bytesToCopy: Long,
+      numTransferToCalls: Int): Unit = {
     val outputInitialState = output match {
       case outputFileChannel: FileChannel =>
         Some((outputFileChannel.position(), outputFileChannel))
       case _ => None
     }
     var count = 0L
+    var num = 0
     // In case transferTo method transferred less data than we have required.
-    while (count < bytesToCopy) {
+    while (count < bytesToCopy && num < numTransferToCalls) {
       count += input.transferTo(count + startPosition, bytesToCopy - count, 
output)
 
 Review comment:
   Alright, I will change to track the return value. But I don't have an 
evidence that the return value is always 0, it may be a very small value, I 
would fail it fast in a such scenario. 
   
   As we tested on a normal cluster, 1 or 2 `transferTo` calls should be enough 
to copy more than 200MB data. If it cannot finish by calling more than 10k 
calls, I would fail it rather than waiting it.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to