Re: [PR] Creating a Fully Managed Beam Streaming System with Flink Runner on Kubernetes - Part 3 [beam]

via GitHub Tue, 26 Dec 2023 13:50:22 -0800


talatuyarer commented on code in PR #29860:
URL: https://github.com/apache/beam/pull/29860#discussion_r1436624767



##########
website/www/site/content/en/blog/apache-beam-flink-and-kubernetes-part3.md:
##########
@@ -0,0 +1,174 @@
+---
+title:  "Behind the Scenes: Crafting an Autoscaler for Apache Beam in a 
High-Volume Streaming Environment"
+date:   2023-12-21 09:00:00 -0400
+categories:
+  - blog
+authors:
+  - talat
+---
+<!--
+Licensed under the Apache License, Version 2.0 (the "License");
+you may not use this file except in compliance with the License.
+You may obtain a copy of the License at
+http://www.apache.org/licenses/LICENSE-2.0
+Unless required by applicable law or agreed to in writing, software
+distributed under the License is distributed on an "AS IS" BASIS,
+WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+See the License for the specific language governing permissions and
+limitations under the License.
+-->
+
+
+### Introduction to the Design of Our Autoscaler for Apache Beam Jobs
+
+Welcome to the third and final part of our blog series on building a scalable, 
self-managed streaming infrastructure with Beam and Flink. In our previous 
post, we delved into the scale of our streaming platforms, highlighting our 
capacity to manage over 40,000 streaming jobs and process upwards of 10 million 
events per second. This impressive scale sets the stage for the challenge we 
address today: the intricate task of resource allocation in a dynamic streaming 
environment.
+
+In this blog post [Talat Uyarer (Architect / Senior Principal 
Engineer)](https://www.linkedin.com/in/talatuyarer/), [Rishabh Kedia (Principal 
Engineer)](https://www.linkedin.com/in/rishabhkedia/) describe more details 
about our Autoscaler. Imagine a scenario where your streaming system is 
inundated with fluctuating workloads. Our case presents a unique challenge, as 
our customers, equipped with firewalls distributed globally, generate logs at 
various times of the day. This results in workloads that not only vary by time 
but also escalate over time due to changes in settings or the addition of new 
cybersecurity solutions from PANW. Furthermore, updates to our codebase 
necessitate rolling out changes across all streaming jobs, leading to a 
temporary surge in demand as the system processes unprocessed data.
+
+<img class="center-block"
+src="/images/blog/apache-beam-flink-and-kubernetes-part3/resource-allocation.png"
+alt="Resource Allocation">

Review Comment:
   I updated the image



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Re: [PR] Creating a Fully Managed Beam Streaming System with Flink Runner on Kubernetes - Part 3 [beam]

Reply via email to