lucabem commented on issue #7487: URL: https://github.com/apache/hudi/issues/7487#issuecomment-1396742280
Hi @yihua. Im facing the same error on EMR on EKS using Hudi 0.12.1. Im having slow download S3 file on first stages (check if is empty stage), downloading 300MB parquet on 1 hour. I have tried your s3-debug.log4j.properties but it doesnt give me anythin unless this logs: ``` ERROR StatusLogger Reconfiguration failed: No configuration found for '18b4aac2' at 'null' in 'null' ERROR StatusLogger Reconfiguration failed: No configuration found for 'Default' at 'null' in 'null' ``` In my case, EMR ends fine but sometimes it shows me logs ``` 23/01/16 10:56:05 INFO GlobalS3Executor: ReadTimeout File: landing/dms/full/my_system/my_table/LOAD00000003.parquet; Range: [62489073, 62491066] Use default timeout configuration to retry for read timeout com.amazon.ws.emr.hadoop.fs.shaded.com.amazonaws.SdkClientException: Unable to execute HTTP request: Read timed out ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
