cdmikechen commented on issue #1073: [HUDI-377] Adding Delete() support to 
DeltaStreamer
URL: https://github.com/apache/incubator-hudi/pull/1073#issuecomment-568702329
 
 
   I have another question: if hudi also syn to a hive table, 
`_hoodie_delete_marker` column should be added or not?
   
   Let's assume a scenario, there is a message from oracle by kafka to hudi, it 
has a column named `op_type `, and `op_type ` have three values: `I `insert, 
`U` update, `D` delete. 
   In this case, we need to add a column named `_hoodie_delete_marker` to set 
`D` to true, `I` and `U` to null. But user maybe don't want to hive show 
`_hoodie_delete_marker`. Because this field has no meaning for him, he has 
other fields to represent it.
   
   So I think whether we can add two attributes to `DataSourceWriteOptions` to 
let hudi to tell which field may mean delete and which value mean delete?
   
   
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

Reply via email to