pnowojski commented on a change in pull request #13718:
URL: https://github.com/apache/flink/pull/13718#discussion_r511867252



##########
File path: 
flink-runtime/src/main/java/org/apache/flink/runtime/io/network/api/serialization/SpanningWrapper.java
##########
@@ -286,11 +292,21 @@ private FileChannel createSpillingChannel() throws 
IOException {
                // try to find a unique file name for the spilling channel
                int maxAttempts = 10;
                for (int attempt = 0; attempt < maxAttempts; attempt++) {
-                       String directory = 
tempDirs[rnd.nextInt(tempDirs.length)];
+                       int dirIndex = rnd.nextInt(tempDirs.length);

Review comment:
       maybe instead of random picking a directory per every attempt, just 
randomly pick a starting directory index and then per each attempt increase it 
by one. +/- something like that:
   ```
   int initialDirIndex = rnd.nextInt(...);
   for (int attempt = 0; attempt < maxAttempts; attempt++) {
     int dirIndex = (initialDirIndex + attempt) % tempDirs.length;
     ...
   }
   ```
   ?

##########
File path: 
flink-runtime/src/main/java/org/apache/flink/runtime/io/network/api/serialization/SpanningWrapper.java
##########
@@ -286,11 +292,21 @@ private FileChannel createSpillingChannel() throws 
IOException {
                // try to find a unique file name for the spilling channel
                int maxAttempts = 10;
                for (int attempt = 0; attempt < maxAttempts; attempt++) {
-                       String directory = 
tempDirs[rnd.nextInt(tempDirs.length)];
+                       int dirIndex = rnd.nextInt(tempDirs.length);
+                       String directory = tempDirs[dirIndex];
                        File file = new File(directory, randomString(rnd) + 
".inputchannel");
-                       if (file.createNewFile()) {
-                               spillFile = new RefCountedFile(file);
-                               return new RandomAccessFile(file, 
"rw").getChannel();
+                       try {
+                               if (file.createNewFile()) {
+                                       spillFile = new RefCountedFile(file);
+                                       return new RandomAccessFile(file, 
"rw").getChannel();
+                               }
+                       } catch (IOException e) {
+                               // if there is no tempDir left to try
+                               if (tempDirs.length <= 1) {
+                                       throw e;
+                               }
+                               LOG.warn("Caught an IOException when creating 
spill file: " + directory + ". Attempt " + attempt, e);
+                               tempDirs = (String[]) 
ArrayUtils.remove(tempDirs, dirIndex);

Review comment:
       If we settle on such trivial approach to the problem (without temporary 
blacklisting), I wouldn't remove the failed dir from the `tempDir`, but just 
keep it and re-try next time?




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to