hudi-bot opened a new issue, #16202:
URL: https://github.com/apache/hudi/issues/16202

   Of late, we are incurring timeouts for job module 4 in azure for UT module. 
   
   Its very strange from the logs. There are only 15 tests triggered and all 15 
of them are complete. And the test is just stuck in there. after 3 hour 40 mins 
or so, entire job run is cancelled and we don't see any logs in between. 
   
    
   
   Excerpt from logs:
   {code:java}
   2023-08-23T08:16:30.2299637Z [INFO] Tests run: 2, Failures: 0, Errors: 0, 
Skipped: 0, Time elapsed: 0.004 s - in org.apache.hudi.hadoop.TestAnnotation
   2023-08-23T08:16:30.2319408Z [INFO] Running 
org.apache.hudi.hadoop.hive.TestHoodieCombineHiveInputFormat
   2023-08-23T08:16:30.2421972Z Formatting using clusterid: testClusterID
   2023-08-23T08:16:30.2570394Z 31120 [Listener at localhost/41109] WARN  
org.apache.hadoop.conf.Configuration [] - No unit for dfs.heartbeat.interval(3) 
assuming SECONDS
   2023-08-23T08:16:30.2574429Z 31121 [Listener at localhost/41109] WARN  
org.apache.hadoop.conf.Configuration [] - No unit for 
dfs.namenode.safemode.extension(0) assuming MILLISECONDS
   2023-08-23T08:16:30.5588638Z 31422 [Listener at localhost/41109] WARN  
org.apache.hadoop.conf.Configuration [] - No unit for dfs.heartbeat.interval(3) 
assuming SECONDS
   2023-08-23T08:16:30.5589693Z 31422 [Listener at localhost/41109] WARN  
org.apache.hadoop.conf.Configuration [] - No unit for 
dfs.namenode.safemode.extension(0) assuming MILLISECONDS
   2023-08-23T08:16:30.6633485Z 31526 [Listener at localhost/33189] WARN  
org.apache.hadoop.conf.Configuration [] - No unit for 
dfs.datanode.outliers.report.interval(1800000) assuming MILLISECONDS
   2023-08-23T08:16:34.4630382Z 35326 [Listener at localhost/41177] WARN  
org.apache.hadoop.hdfs.server.datanode.DirectoryScanner [] - DirectoryScanner: 
shutdown has been called
   2023-08-23T08:16:34.5845964Z 35447 [BP-118616970-10.1.148.0-1692778590260 
heartbeating to localhost/127.0.0.1:33189] WARN  
org.apache.hadoop.hdfs.server.datanode.IncrementalBlockReportManager [] - 
IncrementalBlockReportManager interrupted
   2023-08-23T08:16:34.5895718Z 35447 [BP-118616970-10.1.148.0-1692778590260 
heartbeating to localhost/127.0.0.1:33189] WARN  
org.apache.hadoop.hdfs.server.datanode.DataNode [] - Ending block pool service 
for: Block pool BP-118616970-10.1.148.0-1692778590260 (Datanode Uuid 
eaf09d45-eb56-49ca-bec3-2da8a8f7bd14) service to localhost/127.0.0.1:33189
   2023-08-23T08:16:34.5952216Z 35458 
[refreshUsed-/tmp/hdfs-test-service16927785902315510072575960611105/data/data2/current/BP-118616970-10.1.148.0-1692778590260]
 WARN  org.apache.hadoop.fs.CachingGetSpaceUsed [] - Thread Interrupted waiting 
to refresh disk information: sleep interrupted
   2023-08-23T08:16:34.5971425Z 35458 
[refreshUsed-/tmp/hdfs-test-service16927785902315510072575960611105/data/data1/current/BP-118616970-10.1.148.0-1692778590260]
 WARN  org.apache.hadoop.fs.CachingGetSpaceUsed [] - Thread Interrupted waiting 
to refresh disk information: sleep interrupted
   2023-08-23T08:16:34.6255412Z 35488 [2085456234@qtp-936129838-1 - Acceptor0 
HttpServer2$SelectChannelConnectorWithSafeStartup@localhost:45383] WARN  
org.apache.hadoop.http.HttpServer2 [] - HttpServer Acceptor: isRunning is 
false. Rechecking.
   2023-08-23T08:16:34.6289510Z 35492 [2085456234@qtp-936129838-1 - Acceptor0 
HttpServer2$SelectChannelConnectorWithSafeStartup@localhost:45383] WARN  
org.apache.hadoop.http.HttpServer2 [] - HttpServer Acceptor: isRunning is false
   2023-08-23T08:16:34.7378002Z [WARNING] Tests run: 4, Failures: 0, Errors: 0, 
Skipped: 1, Time elapsed: 4.503 s - in 
org.apache.hudi.hadoop.hive.TestHoodieCombineHiveInputFormat
   2023-08-23T12:00:25.4660529Z ##[error]The operation was canceled.
   2023-08-23T12:00:25.4674798Z ##[section]Finishing: UT other modules {code}
   Just check for log timing for last 3 lines. 
   
    
   
   Ref:  
https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_apis/build/builds/19423/logs/21
   
    
   
    
   
   ## JIRA info
   
   - Link: https://issues.apache.org/jira/browse/HUDI-6755
   - Type: Bug
   - Epic: https://issues.apache.org/jira/browse/HUDI-4302


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to