lucabem commented on issue #7487:
URL: https://github.com/apache/hudi/issues/7487#issuecomment-1396742280

   Hi @yihua.
   
   Im facing the same error on EMR on EKS using Hudi 0.12.1. Im having slow 
download S3 file on first stages (check if is empty stage), downloading 300MB 
parquet on 1 hour.
   
   I have tried your s3-debug.log4j.properties but it doesnt give me anythin 
unless this logs:
   ```
   ERROR StatusLogger Reconfiguration failed: No configuration found for 
'18b4aac2' at 'null' in 'null'
   ERROR StatusLogger Reconfiguration failed: No configuration found for 
'Default' at 'null' in 'null'
   ```
   In my case, EMR ends fine but sometimes it shows me logs 
   ```
   23/01/16 10:56:05 INFO GlobalS3Executor: ReadTimeout File: 
landing/dms/full/my_system/my_table/LOAD00000003.parquet; Range: [62489073, 
62491066]
   Use default timeout configuration to retry for read timeout 
com.amazon.ws.emr.hadoop.fs.shaded.com.amazonaws.SdkClientException: Unable to 
execute HTTP request: Read timed out
   ```
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to