kimberlyamandalu opened a new issue #2696:
URL: https://github.com/apache/hudi/issues/2696


   I am getting the following set of errors when running a Pyspark Streaming 
job in AWS Glue and upserting to a Hudi Dataset in S3.
   
   Glue version: 2
   Python version: 3
   Spark version: 2
   Hudi version: 0.7.0
   
   
   `ERROR BaseTableMetadata: Failed to retrieve files in partition 
s3://mybucket/mydb/mytable/myear=2020/mmonth=3 from metadata`
   
   `ERROR FileSystemViewHandler: Got runtime exception servicing request 
partition=myear%3D2020%2Fmmonth%3D3&maxinstant=20210318163420&basepath=s3%3A%2F%2Fmybucket%2Fmydb%2Fmytable%2F&lastinstantts=20210318171415&timelinehash=c2fb0afdb102e7c3c543109e7df1c954b5aea0e738db1e4d644d9a832baf3d0b`
   
   `WARN ExceptionMapper: Uncaught exception`
   
   There are a lot of these in my log file. I don't see any failed tasks in my 
spark ui.  But I do see some jobs running for a long time. I have posted 
screenshots of my sparkui for a microbatch. 
   
   Can I get some assistance to troubleshoot what might be going on?
   
   
![image](https://user-images.githubusercontent.com/25435575/111694901-da637880-8808-11eb-9855-53a9bfe85f54.png)
   
![image](https://user-images.githubusercontent.com/25435575/111695102-1696d900-8809-11eb-85a2-3be2b5461c37.png)
   
![image](https://user-images.githubusercontent.com/25435575/111695139-21ea0480-8809-11eb-9d1c-672dc467b1f7.png)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to