ArnavBalyan opened a new pull request, #7825:
URL: https://github.com/apache/paimon/pull/7825

   ### Purpose
      - Today `compaction.optimization-interval` is not respected. When this 
interval set, the full compaction is supposed to run at max once per time 
interval. But this actually runs on every writer recreate much more frequently 
than the time interval, which can silently inflate i/o and CPU
      - The last compaction ts is an inmemory field that starts as `null`. 
However since `MergeTreeWriter` can be recreated many times, a new trigger 
causes it to be constructed with `null` value and the compaction fires more 
frequently.
      - Timestamp of the prior full compaction is today lost across writer 
recreations of MergeTreeWriter, but can be obtained using restoreFiles. 
    - Fix this by computing the most recent creationTimeEpochMillis of files at 
the max LSM level from restoreFiles and passing it to full compaction create.
    - Closes user reported bug #6547.
   ### Tests
    - UT


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to