Gatsby-Lee opened a new issue #4747:
URL: https://github.com/apache/hudi/issues/4747
I use Apache Hudi ( v0.9 ) in AWS Glue.
Since I enabled Hudi Metadata Table, I started seeing FileNotFoundException
like below.
* I enabled "hoodie.metadata.enable=True" from the very beginning.
* I use Zoopkeeper as Lock Provider.
I guess the "FileNotFoundException" happened because the content of Hudi
Metadata is out-dated.
Since I use AWS Glue, I don't have a way to run Hudi Metadata CLI.
Is there a way to rebuild the Hudi Metadata again by running a Glue ( or
Spark Job ) ?
Thank you
Gatsby
```
Caused by: java.io.FileNotFoundException: No such file or directory
's3://staging/events_v0/org_id=89/06a89e17-296b-4cf4-932f-684a95524090-0_22-8747-78425_20220204032706.parquet'
at
com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem.getFileStatus(S3NativeFileSystem.java:532)
at
com.amazon.ws.emr.hadoop.fs.EmrFileSystem.getFileStatus(EmrFileSystem.java:694)
at
org.apache.parquet.hadoop.util.HadoopInputFile.fromPath(HadoopInputFile.java:61)
at
org.apache.parquet.hadoop.ParquetFileReader.readFooter(ParquetFileReader.java:456)
at
org.apache.parquet.hadoop.ParquetFileReader.readFooter(ParquetFileReader.java:441)
at
org.apache.hudi.common.util.ParquetUtils.readMetadata(ParquetUtils.java:176)
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]