bithw1 opened a new issue, #11982:
URL: https://github.com/apache/hudi/issues/11982
Hi, I am using Hudi 0.15.0 and Spark 3.3.2
I created a hudi table with following Spark SQL:
With `hoodie.compact.inline='false',` and
`hoodie.compact.schedule.inline='true', `, I want to schedule a compaction
immediately after each write, and run the compaction with a separate spark job
using `HoodieCompactor`
```
CREATE TABLE IF NOT EXISTS hudi_mor_12 (
a INT,
b INT,
c INT
)
USING hudi
tblproperties(
type='mor',
primaryKey='a',
preCombineField='c',
hoodie.compact.inline='false',
hoodie.compact.schedule.inline='true',
hoodie.compact.inline.max.delta.commits='2'
)
```
After the table is created, I run following insert and update on the
spark-sql cli:
```
insert into hudi_mor_12 select 1,1,1;
insert into hudi_mor_12 select 2,2,2;
insert into hudi_mor_12 select 3,3,3;
insert into hudi_mor_12 select 4,4,4;
insert into hudi_mor_12 select 5,5,5;
update hudi_mor_12 set b = 11 where a = 1;
update hudi_mor_12 set b = 22 where a = 2;
update hudi_mor_12 set b = 33 where a = 3;
update hudi_mor_12 set b = 44 where a = 4;
update hudi_mor_12 set b = 55 where a = 5;
```
I have 5 update operations, which are 5 delta.With these operation, I
thought I have scheduled 5 compaction request because of the configuration
`hoodie.compact.inline='false',` and `hoodie.compact.schedule.inline='true', `:
But, when i look into .hoodie directory, only one compaction request is
there :`.hoodie/20240922130320079.compaction.requested`,
I would ask why there is only compaction.rquested file is there, I think
there will be a compaction.rquested file after each delta commit.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]