Balaji Varadarajan created HUDI-1497:
----------------------------------------

             Summary: Timeout Exception during getFileStatus() 
                 Key: HUDI-1497
                 URL: https://issues.apache.org/jira/browse/HUDI-1497
             Project: Apache Hudi
          Issue Type: Sub-task
          Components: Writer Core
            Reporter: Balaji Varadarajan


Seeing this happening when running RFC-15 branch in long running mode. There 
could be a resource leak as I am seeing this consistently after every 1 or 2 
hour period runs.  The below log shows it is during accessing bootstrap index 
but I am seeing it in getFileStatus() for other files too.

 

 

Caused by: java.io.InterruptedIOException: getFileStatus on 
s3://robinhood-encrypted-hudi-data-cove/dummy/balaji/sickle/public/client_ledger_clientledgerbalance/test_v4/.hoodie/.aux/.bootstrap/.partitions/00000000-0000-0000-0000-000000000000-0_1-0-1_00000000000001.hfile:
 com.amazonaws.SdkClientException: Unable to execute HTTP request: Timeout 
waiting for connection from poolCaused by: java.io.InterruptedIOException: 
getFileStatus on 
s3://robinhood-encrypted-hudi-data-cove/dummy/balaji/sickle/public/client_ledger_clientledgerbalance/test_v4/.hoodie/.aux/.bootstrap/.partitions/00000000-0000-0000-0000-000000000000-0_1-0-1_00000000000001.hfile:
 com.amazonaws.SdkClientException: Unable to execute HTTP request: Timeout 
waiting for connection from pool at 
org.apache.hadoop.fs.s3a.S3AUtils.translateException(S3AUtils.java:141) at 
org.apache.hadoop.fs.s3a.S3AUtils.translateException(S3AUtils.java:117) at 
org.apache.hadoop.fs.s3a.S3AFileSystem.s3GetFileStatus(S3AFileSystem.java:1859) 
at 
org.apache.hadoop.fs.s3a.S3AFileSystem.innerGetFileStatus(S3AFileSystem.java:1823)
 at 
org.apache.hadoop.fs.s3a.S3AFileSystem.getFileStatus(S3AFileSystem.java:1763) 
at org.apache.hadoop.fs.FileSystem.exists(FileSystem.java:1627) at 
org.apache.hadoop.fs.s3a.S3AFileSystem.exists(S3AFileSystem.java:2500) at 
org.apache.hudi.common.fs.HoodieWrapperFileSystem.exists(HoodieWrapperFileSystem.java:549)
 at 
org.apache.hudi.common.bootstrap.index.HFileBootstrapIndex.<init>(HFileBootstrapIndex.java:102)
 ... 33 moreCaused by: com.amazonaws.SdkClientException: Unable to execute HTTP 
request: Timeout waiting for connection from pool at 
com.amazonaws.http.AmazonHttpClient$RequestExecutor.handleRetryableException(AmazonHttpClient.java:1113)
 at 
com.amazonaws.http.AmazonHttpClient$RequestExecutor.executeHelper(AmazonHttpClient.java:1063)
 at 
com.amazonaws.http.AmazonHttpClient$RequestExecutor.doExecute(AmazonHttpClient.java:743)
 at 
com.amazonaws.http.AmazonHttpClient$RequestExecutor.executeWithTimer(AmazonHttpClient.java:717)
 at 
com.amazonaws.http.AmazonHttpClient$RequestExecutor.execute(AmazonHttpClient.java:699)
 at 
com.amazonaws.http.AmazonHttpClient$RequestExecutor.access$500(AmazonHttpClient.java:667)
 at 
com.amazonaws.http.AmazonHttpClient$RequestExecutionBuilderImpl.execute(AmazonHttpClient.java:649)
 at com.amazonaws.http.AmazonHttpClient.execute(AmazonHttpClient.java:513) at 
com.amazonaws.services.s3.AmazonS3Client.invoke(AmazonS3Client.java:4229) at 
com.amazonaws.services.s3.AmazonS3Client.invoke(AmazonS3Client.java:4176) at 
com.amazonaws.services.s3.AmazonS3Client.getObjectMetadata(AmazonS3Client.java:1253)
 at 
org.apache.hadoop.fs.s3a.S3AFileSystem.getObjectMetadata(S3AFileSystem.java:1053)
 at 
org.apache.hadoop.fs.s3a.S3AFileSystem.s3GetFileStatus(S3AFileSystem.java:1841) 
... 39 more



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to