wangxianghu commented on pull request #2452:
URL: https://github.com/apache/hudi/pull/2452#issuecomment-817089012


   > can we just have a this implemented as a replace of the partition where 
all files are replaced by an empty list. cleaner would automatically clean the 
partition that way. Love to keep all of our tooling to be flexible at the file 
level, working with existing actions and timeline
   
   Hi @vinothchandar @nsivabalan. sorry for the delay, kind of busy these days.
   The current implementation leveraged the  API 
`org.apache.hudi.client.SparkRDDWriteClient#deletePartitions` provided by 
@lw309637554, IIUC, it is already implemented as a replace of the partition 
where all files are replaced by empty data.
   
   BTW, I come to think that the clean strategy is more suitable for TTL 
scenarios, not this one.
   partition deletion is a relatively fixed scene(one-shot use case), and it 
seems to be better to make it as a tool. In this way, the user only needs to 
specify the partition to be deleted, a small amount of configuration such as 
table information, and the partition can be deleted.
   
   In the TTL scene, hoodie can check the outdated partitions in each 
commit(clean part), which could be a long-running term, so it is suitable 
implemented with clean strategy
   WDYT ?
   cc @n3nash @lw309637554 @yanghua 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to