rubenssoto opened a new issue #2664:
URL: https://github.com/apache/hudi/issues/2664


   Hello Guys,
   
   I have a pipeline that reads csv file and writes to Hudi, much simple. The 
csv files are from an external application, yesterday I realized that the 
application creates a csv file even if has no data, so my pipeline read an 
empty csv and writes to Hudi, because of that, my tables have a lot of empty 
commits and all my parquet files disappeared.
   
   
   
[20210311130632.commit.zip](https://github.com/apache/hudi/files/6123250/20210311130632.commit.zip)
   
   <img width="1403" alt="Captura de Tela 2021-03-11 às 10 26 09" 
src="https://user-images.githubusercontent.com/36298331/110794317-61867e80-8254-11eb-8d21-b6a4ebb1cbbc.png";>
   
   
   Am I have to put an empty dataframe validation in my pipe?
   
   Thank you so much
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to