[I] [SUPPORT]What is the behavior of compact inline and compact schedule incline [hudi]

via GitHub Sat, 21 Sep 2024 22:44:18 -0700


bithw1 opened a new issue, #11982:
URL: https://github.com/apache/hudi/issues/11982


   Hi, I am using Hudi 0.15.0 and Spark 3.3.2
   
   I created a hudi table with following Spark SQL:
   
   With `hoodie.compact.inline='false',` and 
`hoodie.compact.schedule.inline='true', `, I want to schedule a compaction 
immediately after each write, and run the compaction with a separate spark job 
using `HoodieCompactor`
   
   ```
   CREATE TABLE IF NOT EXISTS hudi_mor_12 (
     a INT,
     b INT,
     c INT
   ) 
   
   USING hudi
   
   tblproperties(
   type='mor',
   primaryKey='a',
   preCombineField='c',
   hoodie.compact.inline='false',
   hoodie.compact.schedule.inline='true', 
   hoodie.compact.inline.max.delta.commits='2'
   )
   
   ```
   
   After the table is created, I run following insert and update on the 
spark-sql cli:
   
   ```
   insert into hudi_mor_12 select 1,1,1; 
   insert into hudi_mor_12 select 2,2,2;
   insert into hudi_mor_12 select 3,3,3;
   insert into hudi_mor_12 select 4,4,4;
   insert into hudi_mor_12 select 5,5,5;
   
   update hudi_mor_12 set b = 11  where a = 1;
   update hudi_mor_12 set b = 22  where a = 2;
   update hudi_mor_12 set b = 33  where a = 3;
   update hudi_mor_12 set b = 44  where a = 4;
   update hudi_mor_12 set b = 55  where a = 5;
   
   ```
   
   I have 5 update operations, which are 5 delta.With these operation, I 
thought I have scheduled 5 compaction request because of the configuration 
`hoodie.compact.inline='false',` and `hoodie.compact.schedule.inline='true', `:
   
   But, when i look into .hoodie directory, only one compaction request is 
there :`.hoodie/20240922130320079.compaction.requested`,
   
   I would ask why there is only compaction.rquested file is there, I think 
there will be a compaction.rquested file after each delta commit.
   
   
   
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[I] [SUPPORT]What is the behavior of compact inline and compact schedule incline [hudi]

Reply via email to