Hi Yuval,

thank you for sharing all the information. I forgot to mention the Lyft 
operator. Thanks for "adding" it to the list.
About the dual cluster approach during upgrade I have some doubts about the 
resource usage. If you are operating some "big" jobs that would mean you always 
have to provide enough resources to run two of them in parallel during the 
upgrade or is there some workaround (downscaling, upscaling) available?
I will further investigate how the option three 
"GoogleCloudPlatform/flink-on-k8s-operator" is implementing the upgrade process.

I agree that there is a need for a community based operator project. It is 
unfortunate that both "relevant" projects (Lyft, GCP) have been more or less 
abandoned. The only thing that is left I can see with some activity is a fork 
of the GCP operator from spotify [0], but there is only one person involved.

Regards,
Niklas

[0] https://github.com/spotify/flink-on-k8s-operator


UNIBERG GmbH 
Simon-von-Utrecht-Straße 85a
20359 Hamburg

niklas.wil...@uniberg.com
Mobile: +49 160 9793 2593
Office: +49 40 2380 6523


UNIBERG GmbH, Dorfstraße 3, 23816 Bebensee 

Registergericht / Register: Amtsgericht Kiel HRB SE-1507
Geschäftsführer / CEO‘s: Andreas Möller, Martin Ulbricht

Informationen zum Datenschutz / Privacy Information: 
https://www.uniberg.com/impressum.html

> On 6. Aug 2021, at 16:59, Yuval Itzchakov <yuva...@gmail.com> wrote:
> 
> Hi Niklas,
> 
> We are currently using the Lyft operator for Flink in production 
> (https://github.com/lyft/flinkk8soperator 
> <https://github.com/lyft/flinkk8soperator>), which is additional alternative. 
> The project itself is pretty much in Zombie state, but commits happen every 
> now and then.
> 
> 1. Native Kubernetes could definitely work with GitOps, it would just require 
> you to do lots of steps "by hand" in terms of application upgrade and 
> rollover.
> 2. We're using Lyfts operator as mentioned above. It mostly works well, there 
> were several issues we had along the way but were mostly resolved. One 
> feature that is missing for us specifically is being able to perform an 
> upgrade by first savepointing and killing the existing cluster and only then 
> deploying a new one (their approach is dual, meaning have two clusters up and 
> running before doing the rollover).
> 3. At it's current state it looks more like a side project than an actively 
> maintained operator.
> 4. Ververica is definitely an option, we haven't tested their operator, not 
> sure about the maturity level yet.
> 
> I think a Flink community based operator for k8s is a much needed project 
> (which I'd be happy to contribute to).
> 
> 
> 
> 
> On Fri, Aug 6, 2021, 14:49 Niklas Wilcke <niklas.wil...@uniberg.com 
> <mailto:niklas.wil...@uniberg.com>> wrote:
> Hi Flink Community,
> 
> I'm currently assessing the situation about how to properly deploy Flink on 
> Kubernetes via GitOps. There are some options available to deploy Flink on 
> Kubernetes, which I would like to discuss.  In general we are looking for an 
> open source or at least unpaid solution, but I don't exclude paid solutions 
> from the beginning.
> I see the following options.
> 
> 1. Kubernetes Standalone [1]
>       * Seems to be deprecated, since the docs state to use Native Kubernetes 
> instead
> 2. Native Kubernetes [2]
>       * Doesn't seem to implement the Kubernetes operator pattern
>       * Seems to require command line activities to be operated / upgraded 
> (not GitOps compatible out of the box)
> 3. "GoogleCloudPlatform/flink-on-k8s-operator" Operator [3]
>       * Seems not to be well maintained / documented
>       * We had some trouble with crashes during configuration changes, but we 
> need to investigate further
>       * There is a "maintained" fork from spotify, which could be an option
> 4. Flink Native Kubernetes Operator [4]
>       * Seems to be a private project from a Flink Committer, which might not 
> be mature enough for a stable operation
> 5. Proprietary Solution Ververica Platform [5]
>       * I didn't try it out yet and have no experience with it
>       * I'm unsure whether the Community Edition is suited for a production 
> environment. (one namespace, no auto scaling, no RBAC, etc.)
> 
> I have the following questions.
> 
> 1. Is the "Native Kubernetes" approach suited to be operated via Gitops and 
> does it have some drawbacks compared to an operator based setup? (e.g. is a 
> rollback during a failed upgrade possible?)
> 2. Are there any experiences with the 
> "GoogleCloudPlatform/flink-on-k8s-operator" or a fork of it in a production 
> environment?
> 3. Is the "Flink Native Kubernetes Operator" an option or is it just a 
> playground project. How is it related to the "Native Kubernetes" setup? Is it 
> going to be "integrated" into Flink?
> 4. Is a proprietary unpaid solution like "Ververica Platform Community 
> Edition" a solution for a production environment or will it definitely lack 
> features I need?
> 
> Any information or feedback is highly appreciated. Thank you very much in 
> advance.
> 
> Kind Regards,
> Niklas Wilcke
> 
> 
> [1] 
> https://ci.apache.org/projects/flink/flink-docs-release-1.13/docs/deployment/resource-providers/standalone/kubernetes/
>  
> <https://ci.apache.org/projects/flink/flink-docs-release-1.13/docs/deployment/resource-providers/standalone/kubernetes/>
> [2] 
> https://ci.apache.org/projects/flink/flink-docs-release-1.13/docs/deployment/resource-providers/native_kubernetes/
>  
> <https://ci.apache.org/projects/flink/flink-docs-release-1.13/docs/deployment/resource-providers/native_kubernetes/>
> [3] https://github.com/GoogleCloudPlatform/flink-on-k8s-operator 
> <https://github.com/GoogleCloudPlatform/flink-on-k8s-operator>
> [4] https://github.com/wangyang0918/flink-native-k8s-operator 
> <https://github.com/wangyang0918/flink-native-k8s-operator>
> [5] https://www.ververica.com/getting-started-flink-ververica 
> <https://www.ververica.com/getting-started-flink-ververica>
> 
> 
> 
> 
> UNIBERG GmbH 
> Simon-von-Utrecht-Straße 85a
> 20359 Hamburg
> 
> niklas.wil...@uniberg.com <mailto:niklas.wil...@uniberg.com>
> Mobile: +49 160 9793 2593
> Office: +49 40 2380 6523
> 
> 
> UNIBERG GmbH, Dorfstraße 3, 23816 Bebensee 
> 
> Registergericht / Register: Amtsgericht Kiel HRB SE-1507
> Geschäftsführer / CEO‘s: Andreas Möller, Martin Ulbricht
> 
> Informationen zum Datenschutz / Privacy Information: 
> https://www.uniberg.com/impressum.html 
> <https://www.uniberg.com/impressum.html>

Attachment: smime.p7s
Description: S/MIME cryptographic signature

Reply via email to