Hi,

I had a short discussion with @QinshangZhong and @YongxiangZhang about this
topic online and we reached the conclusion:
* We may keep the apache/amoro, apache/amoro-flink-optimizer,
apache/amoro-spark-optimizer images.
* We may drop the quick demo image and simplify the existing quick demo
process, depending on the amoro and minio images.
* We may drop the datanode and namenode image.

Best,
Jinsong

On Wed, May 1, 2024 at 11:46 AM BaiyangTX <xiangneb...@163.com> wrote:

>
>
>
>
> Hi,
>
>
> I think apache/amoro  is a good name, ams and optimizer are just internal
> components of amoro, similar to Flink JM/TM, so they can be referred to by
> amoro, and the image does have all its features.
>
>
> Also, for QuickStart, I recommend changing its filesystem to minio or
> using an official HDFS image, which should not be maintained by Amoro.
>
>
> Best regards,
>
> BaiyangTX
>
>
> ---- Replied Message ----
> | From | Qishang Zhong<zhongqish...@gmail.com> |
> | Date | 4/30/2024 10:39 |
> | To | <dev@amoro.apache.org> |
> | Subject | Re: Adjust docker images of apache amoro project |
> Hi,
>
> IMO, I agree with jinsong's point of view. Using FlinkCDC to ingest data
> produces too many Delete Files, which is the vast majority of reasons why
> users choose amoro.
> Therefore, it would be better to provide a quick start image and provide
> the most realistic scene demo to users who are exposed to amoro for the
> first time.
>
> Regarding mirroring, I agree with Baiyang's point of view. Only keep
> production-available mirrors under apache, and for quick start we can put
> them under amoro (including datanode and namenode)
>
> - apache/amoro-ams (Just `amoro` may not be very clear, because amoro also
> includes many things like mixed format, optimizer)
> - apache/amoro-optimizer-flink (maybe need to rename)
> - apache/amoro-optimizer-spark (maybe need to rename)
> - amoro/amoro-quick-demo
> - amoro/amoro-datanode
> - amoro/amoro-datanode
>
> Best regards,
> Qishang Zhong
>
>
> Jinsong Zhou <jinsongz...@apache.org> 于2024年4月30日周二 10:01写道:
>
> Hi,
>
> Executing Insert/Delete/Update SQL through the Terminal can also trigger
> self-optimizing and greatly reduce the difficulty of the quick demo
> process.
> However, CDC streaming ingestion remains a scenario that most users are
> very interested in, and I'm not sure whether it should be removed from the
> quick demo.
> I'd like to hear more inputs from other developers and users.
>
> Best,
> Jinsong
>
> On Mon, Apr 29, 2024 at 3:07 PM Gang Huang <tcodehu...@gmail.com> wrote:
>
> Thanks for your rely.
>
> I agree with the vast majority of opinions. But I think we should
> simplify
> the quick start(whether installation or usage) for a newbie. From this
> point, we can remove amoro-quick-demo image instead of using amoro-ams
> image.
> Users can insert/delete/update rows by ams' Terminal page, then we can
> trigger minor/major processes.
>
>
>
> .
>
> Jinsong Zhou <jinsongz...@apache.org> 于2024年4月29日周一 14:44写道:
>
> Hi,
>
> Thanks a lot for driving this.
>
> I agree that we should keep the amoro-ams(renamed from amoro),
> amoro-flink-optimizer(renamed from flink-optimizer), and
> amoro-spark-optimizer(renamed from spark-optimizer) images.
> Besides, considering that the quick demo now requires additional usage
> of
> the Flink engine to complete CDC data ingestion, we still need the
> amoro-quick-demo image until we plan to adjust the quick demo process.
>
> Best,
> Jinsong
>
> On Sat, Apr 27, 2024 at 1:30 PM BaiyangTX <xiangneb...@163.com> wrote:
>
>
>
> Hi,
> 
> The docker image currently maintained in the project does have room
> for
> further optimization.
> 
> amoro: This is the core image of the project. It provides AMS
> deployment
> in early versions. After this PR will be merged (
> https://github.com/apache/amoro/pull/2695), it can also directly
> provide
> the deployment of Optimizer on K8S. , this should also be the
> recommended
> using in the future. I recommend maintaining this image in the apache
> repo
> in the future, using apache/amoro as the image name. Here you can
> refer
> to
> kyuubi (https://hub.docker.com/r/apache/kyuubi)
> 
> optimizer-flink/optimizer-spark: Provides images of optimizer
> deployment
> under different computing engines. I also recommend that these two
> images
> be maintained under the apache repo, using
> apache/amoro-flink-optimizer
> and
> apache/amoro-spark-optimizer as the image id.
> 
> quickstart: Used to demonstrate the QuickStart part of the official
> website. This image is based on the amoro image and includes
> computing
> engines such as flink/spark and connectors such as iceberg. It is
> recommended to use amoro's repo maintenance for this part and use
> amoro/quickstart as the image name.
> 
> namenode/datanode: These two images are used to provide an HDFS
> environment in the qucikstart demonstration. I suggest modifying the
> current quickstart process, using minio as the quickstart
> environment,
> and
> no longer maintaining these two images.
> 
>  The above are my personal suggestions.
>
>
> Kind Regards,
> baiyangtx
>
>
> ---- Replied Message ----
> | From | Gang Huang<tcodehu...@gmail.com> |
> | Date | 4/26/2024 17:03 |
> | To | <dev@amoro.apache.org> |
> | Subject | Adjust docker images of apache amoro project |
> Hi,
>
> Currently, there are up to 6 docker images in apache amoro project.
> But
> in
> my opinion, only amoro, optimizer-flink and optimizer-spark maybe are
> needed. Furthermore, we have to change the images' final
> names(amoro-ams,
> amoro-optimizer-flink, amoro-optimizer-spark) to identify them when
> uploading them into docker hub.
>
> Thus, we can simplify the quick start process for better user
> experience,
> just like iceberg/risingwave quickstart:
> https://iceberg.apache.org/spark-quickstart/
> https://docs.risingwave.com/docs/current/get-started/
>
> Please feel free to contribute your suggestions.
>
>
>
> Kind Regards,
> Gang Huang
>
>
>
>
>
>
> --
> Best Regards,
> Qishang Zhong
>

Reply via email to