Hi,

+1 for supporting configuring the number of remaining snapshots.

Besides, we may need to refactor the configuration name to:
* snapshot.keep.min-count = 10
* snapshot.keep.duration = 7d

Base is a concept of Mixed format, we should not keep it in the common
configuration names.
There is some other issue[1] focusing on improving configuration value with
duration unit.

[1] https://github.com/apache/amoro/issues/3119

Best,
Jinsong

On Mon, Sep 2, 2024 at 6:02 PM Paul Lam <paullin3...@gmail.com> wrote:

> +1 for supporting the snapshots to keep.
>
> However, Iceberg natively supports `history.expire.min-snapshots-to-keep`,
> should we directly reuse its value?
>
> Best,
> Paul Lam
>
> > 2024年9月2日 16:10,Xavier Bai <x...@apache.org> 写道:
> >
> > Hi developers,
> >
> > Currently, when we execute expired snapshots, we only consider the TTL
> as a
> > factor. However, Iceberg supports setting a minimum number of snapshots
> to
> > retain. I believe we should also make this configuration option
> > available(eg. `snapshot.base.keep.min-count`). For developers and table
> > users, these snapshots can be understood as the update logs of the table,
> > making it easier for users to review the history of updates. If there are
> > no updates for an extended period, Amoro may only retain a single
> snapshot
> > after cleanup, which could result in a loss of information for users.
> >
> > In addition, the Iceberg community is also working on retaining more
> > historical snapshot information in an additional folder. I believe this
> is
> > a significant requirement
> >
> > Best regards,
> > Xu Bai
>
>

Reply via email to