rubenssoto opened a new issue #2664: URL: https://github.com/apache/hudi/issues/2664
Hello Guys, I have a pipeline that reads csv file and writes to Hudi, much simple. The csv files are from an external application, yesterday I realized that the application creates a csv file even if has no data, so my pipeline read an empty csv and writes to Hudi, because of that, my tables have a lot of empty commits and all my parquet files disappeared. [20210311130632.commit.zip](https://github.com/apache/hudi/files/6123250/20210311130632.commit.zip) <img width="1403" alt="Captura de Tela 2021-03-11 às 10 26 09" src="https://user-images.githubusercontent.com/36298331/110794317-61867e80-8254-11eb-8d21-b6a4ebb1cbbc.png"> Am I have to put an empty dataframe validation in my pipe? Thank you so much ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
