Thanks for starting this discussion.

For some extreme cases, rocksdb is still needed to ensure stability under
low memory conditions, even though it is slower.

> 1. The first one is to add a Maven profile related to RocksDB, allowing
users to manually use this profile to build the project and enable this
feature when needed.

+1, it is a good idea to reserve the user's to choose.

> 2. The second method is to provide a bundled package for RocksDB, allowing
it to be dynamically added at runtime.

Will it be downloaded over the Internet? This is an unstable factor.

Best,
Qishang Zhong

Xavier Bai <x...@apache.org> 于2024年7月9日周二 14:37写道:

> +1 for option 1
>
> Jinsong Zhou <jinsongz...@apache.org> 于2024年7月9日周二 14:25写道:
>
> > Hi,
> >
> > Thanks for the input from xuba. Yes, Indeed, at this stage, we may still
> > need some methods to allow users to add support for RocksDB when needed.
> > However, we can consider removing it from the default installation
> package.
> >
> > In my opinion, there are two possible methods:
> > 1. The first one is to add a Maven profile related to RocksDB, allowing
> > users to manually use this profile to build the project and enable this
> > feature when needed.
> > 2. The second method is to provide a bundled package for RocksDB,
> allowing
> > it to be dynamically added at runtime.
> >
> > Method one is much easier to implement and we should implement it first,
> > and implement method two when we needed later.
> >
> > What do you think?
> >
> > Best,
> > Jinsong
> >
> > On Tue, Jul 9, 2024 at 11:09 AM Xavier Bai <x...@apache.org> wrote:
> >
> > > There are still many optimisers in PROD environments that have rocksDB
> > > storage enabled. Removing dependencies in projects is acceptable, but
> we
> > > should also provide documentation and description of what to do if
> users
> > > want to continue using the feature. For example, there could be support
> > for
> > > users to add dependencies individually, etc.
> > >
> > > Jinsong Zhou <jinsongz...@apache.org> 于2024年7月8日周一 17:36写道:
> > >
> > > > Hi devs,
> > > >
> > > > Recently, I have been working on reducing the size of the Amoro
> > > > installation package. Considering the Amoro installation package is
> > > almost
> > > > 1GB in size, this task really should be done ASAP.
> > > >
> > > > I found the largest dependent of Amoro is the rocksdb lib (more than
> > > 50MB).
> > > > It is used to cache some data to disk storage when the memory is not
> > > > enough. It is originally used to cache iceberg delete records in
> > > > optimizers. But when we have improved the delete records caching with
> > > bloom
> > > > filter, this feature is really not needed anymore.
> > > >
> > > > So I am considering removing the rocksdb dependencies from the
> project
> > to
> > > > reduce the installation package size.
> > > >
> > > > I  am looking forward to hearing any point from anyone regarding this
> > > > issue.
> > > >
> > > > Best regards,
> > > > Jinsong
> > > >
> > >
> >
>


-- 
Best Regards,
Qishang Zhong

Reply via email to