This is an automated email from the ASF dual-hosted git repository.

xqhu pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/beam.git


The following commit(s) were added to refs/heads/master by this push:
     new 0eb3626b3a2 Add Albertsons case study (#37201)
0eb3626b3a2 is described below

commit 0eb3626b3a21a3911c14cf7efdc106c24c5cc8fa
Author: Danny McCormick <[email protected]>
AuthorDate: Tue Dec 30 19:44:01 2025 -0500

    Add Albertsons case study (#37201)
    
    * Add Albertsons case study
    
    * fix image ref
    
    * Fix table formatting
    
    * early tuesday
---
 .../www/site/content/en/case-studies/albertsons.md | 203 +++++++++++++++++++++
 website/www/site/data/en/quotes.yaml               |   5 +
 .../images/case-study/albertsons/ankurraj.jpeg     | Bin 0 -> 27941 bytes
 .../case-study/albertsons/mohammedjawedkhan.jpeg   | Bin 0 -> 5515 bytes
 .../images/case-study/albertsons/utkarshparekh.png | Bin 0 -> 1239692 bytes
 .../images/case-study/albertsons/vinaydesai.jpeg   | Bin 0 -> 42897 bytes
 .../static/images/logos/powered-by/albertsons.jpg  | Bin 0 -> 192436 bytes
 7 files changed, 208 insertions(+)

diff --git a/website/www/site/content/en/case-studies/albertsons.md 
b/website/www/site/content/en/case-studies/albertsons.md
new file mode 100644
index 00000000000..167275b10c8
--- /dev/null
+++ b/website/www/site/content/en/case-studies/albertsons.md
@@ -0,0 +1,203 @@
+---
+title: "Albertsons: Using Apache Beam for Unified Analytics Ingestion"
+name: "Albertsons: Beam for Analytics Ingestion"
+icon: /images/logos/powered-by/albertsons.jpg
+hasNav: true
+category: study
+cardTitle: "Albertsons: Using Apache Beam for Unified Analytics Ingestion"
+cardDescription: "Apache Beam enabled Albertsons to standardize ingestion into 
a resilient and portable framework, delivering 99.9% reliability at enterprise 
scale across both real-time signals and core business data."
+authorName: "Utkarsh Parekh"
+authorPosition: "Staff Engineer, Data @ Albertsons"
+authorImg: /images/case-study/albertsons/utkarshparekh.png
+publishDate: 2026-01-06T00:04:00+00:00
+---
+<!--
+Licensed under the Apache License, Version 2.0 (the "License");
+you may not use this file except in compliance with the License.
+You may obtain a copy of the License at
+
+http://www.apache.org/licenses/LICENSE-2.0
+
+Unless required by applicable law or agreed to in writing, software
+distributed under the License is distributed on an "AS IS" BASIS,
+WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+See the License for the specific language governing permissions and
+limitations under the License.
+-->
+<!-- div with class case-study-opinion is displayed at the top left area of 
the case study page -->
+<div class="case-study-opinion">
+    <div class="case-study-opinion-img">
+        <img src="/images/logos/powered-by/albertsons.jpg"/>
+    </div>
+    <blockquote class="case-study-quote-block">
+      <p class="case-study-quote-text">
+        “Apache Beam enabled Albertsons to standardize ingestion into a 
resilient and portable framework, delivering 99.9% reliability at enterprise 
scale across both real-time signals and core business data.”
+      </p>
+      <div class="case-study-quote-author">
+        <div class="case-study-quote-author-img">
+            <img src="/images/case-study/albertsons/utkarshparekh.png">
+        </div>
+        <div class="case-study-quote-author-info">
+            <div class="case-study-quote-author-name">
+              Utkarsh Parekh
+            </div>
+            <div class="case-study-quote-author-position">
+              Staff Engineer, Data @ Albertsons
+            </div>
+        </div>
+      </div>
+    </blockquote>
+</div>
+
+<!-- div with class case-study-post is the case study page main content -->
+<div class="case-study-post">
+
+# Albertsons: Using Apache Beam for Unified Analytics Ingestion
+
+## Context
+
+Albertsons Companies is one of the largest retail grocery organizations in 
North America, operating over 2,200 stores and serving millions of customers 
across physical and digital channels.
+
+Apache Beam is the foundation of the **internal Unified Data Ingestion 
framework**, a standardized enterprise ELT platform that delivers both 
streaming and batch data into modern cloud analytics systems. The framework 
uses **both Java and Python Beam SDKs, Dataflow Flex Templates, enabling 
flexibility across workloads. When a capability is not yet supported in the 
Python SDK but is available in the Java SDK, we can seamlessly leverage 
Java-based implementations to deliver the required fu [...]
+
+This unified architecture reduces duplicated logic, standardizes governance, 
and accelerates data enablement across business domains.
+
+## Challenges and Use Cases
+
+Before Apache Beam, ingestion patterns were fragmented across streaming and 
batch pipelines. This led to longer development cycles, inconsistent data 
quality, and increased operational overhead.
+
+The framework’s architecture emphasizes object-oriented principles including 
single responsibility, modularity, and separation of concerns. This enables 
reusable Beam transforms, configurable IO connectors, and clean abstractions 
between orchestration and execution layers.
+
+Beam enabled:
+
+- Unified development for real-time and scheduled ingestion
+- Standardized connectivity to enterprise systems
+- Reliable governance and observability baked into pipelines
+
+
+The framework supports:
+
+- **Real-time streaming analytics** from operational and digital signals
+- **Batch ingestion** from mission-critical enterprise systems
+- **File-based ingestion** for vendor and financial datasets
+- **Legacy MQ ingestion** using JMSIO-based connectors
+
+To scale efficiently, the framework features **Apache Airflow dynamic DAG 
creation.**
+
+Metadata-driven ingestion jobs generate DAGs automatically at runtime, and 
**BashOperator** is used to submit **Dataflow** jobs for consistent execution, 
security, and monitoring.
+
+Common Beam transforms include Impulse, windowing, grouping, and batching 
optimizations.
+
+<blockquote class="case-study-quote-block case-study-quote-wrapped">
+  <p class="case-study-quote-text">
+    In Albertsons we utilized Apache Beam to write an in-house framework that 
enabled our data engineering teams to create robust data pipelines through a 
consistent - single interface. The framework helped reduce the overall 
development cycle since we templatized the various data integration patterns. 
Having a custom framework gave us flexibility to prioritize and configure 
multiple technologies/integration points like Kafka, Files, Managed Queues, 
Databases (Oracle, DB2, Azure SQL etc. [...]
+  </p>
+  <div class="case-study-quote-author">
+    <div class="case-study-quote-author-img">
+        <img src="/images/case-study/albertsons/mohammedjawedkhan.jpeg">
+    </div>
+    <div class="case-study-quote-author-info">
+        <div class="case-study-quote-author-name">
+          Mohammed Jawed Khan
+        </div>
+        <div class="case-study-quote-author-position">
+          Principal Data Engineer @ Albertsons
+        </div>
+    </div>
+  </div>
+</blockquote>
+
+## Technical Data
+
+Apache Beam pipelines operate at enterprise scale:
+
+- Hundreds of production pipelines
+- Terabytes of data processed weekly, including thousands of streaming events 
per second.
+
+All ingestion paths adhere to internal security controls and support 
**tokenization** for PII and sensitive data protection using Protegrity.
+
+## Results
+
+Apache Beam has significantly improved the reliability, reusability, and speed 
of Albertsons’ data  platforms:
+
+{{< table >}}
+| Area                   | Outcome                                             
|
+| ---------------------- | --------------------------------------------------- 
|
+| Reliability            | **99.9%+ uptime** for data ingestion                
|
+| Developer Productivity | Pipelines created faster via standardized templates 
|
+| Operational Efficiency | **Autoscaling** optimizes resource utilization      
|
+| Business Enablement    | Enables **real-time decisioning**                   
|
+{{< /table >}}
+
+### Business Impact
+
+Beam enabled one unified ingestion framework that supports both streaming and 
batch workloads - eliminating fragmentation and delivering trusted signals to 
analytics.
+
+<blockquote class="case-study-quote-block case-study-quote-wrapped">
+  <p class="case-study-quote-text">
+    Integrating Apache Beam into our in-house ELT platform has reduced 
engineering effort and operational overhead, while improving efficiency at 
scale. Teams can now focus more on delivering business outcomes instead of 
managing infrastructure.
+  </p>
+  <div class="case-study-quote-author">
+    <div class="case-study-quote-author-img">
+        <img src="/images/case-study/albertsons/vinaydesai.jpeg">
+    </div>
+    <div class="case-study-quote-author-info">
+        <div class="case-study-quote-author-name">
+          Vinay Desai
+        </div>
+        <div class="case-study-quote-author-position">
+          Director Engineering @ Albertsons
+        </div>
+    </div>
+  </div>
+</blockquote>
+
+<blockquote class="case-study-quote-block case-study-quote-wrapped">
+  <p class="case-study-quote-text">
+    By leveraging Apache Beam into the ACI platform, we achieved a significant 
reduction in downtime. The adoption of reusable features further minimized the 
risk of production issues.
+  </p>
+  <div class="case-study-quote-author">
+    <div class="case-study-quote-author-img">
+        <img src="/images/case-study/albertsons/ankurraj.jpeg">
+    </div>
+    <div class="case-study-quote-author-info">
+        <div class="case-study-quote-author-name">
+          Ankur Raj
+        </div>
+        <div class="case-study-quote-author-position">
+          Director , Data Engineering Operations @ Albertsons
+        </div>
+    </div>
+  </div>
+</blockquote>
+
+## Infrastructure
+
+{{< table >}}
+| Component              | Detail                                        |
+| ---------------------- | --------------------------------------------- |
+| Cloud                  | Google Cloud Platform                         |
+| Runner                 | DataflowRunner                                |
+| Beam SDKs              | Java & Python                                 |
+| Workflow Orchestration | Apache Airflow with dynamic DAG creation      |
+| Deployment             | BashOperator submits Dataflow jobs            |
+| Sources                | Kafka, JDBC systems, files, MQ, APIs          |
+| Targets                | BigQuery, GCS, Kafka                          |
+| Observability          | Centralized logging, alerting, retry patterns |
+{{< /table >}}
+
+Deployment is portable across Dev, QA, and Prod environments.
+
+## Beam Community & Evolution
+
+Beam community resources supported the framework’s growth through:
+
+- Slack & developer channels
+- Documentation
+- Beam Summit participation
+
+<!-- case_study_feedback adds feedback buttons -->
+{{< case_study_feedback "AlbertsonsCompanies" >}}
+
+</div>
+<div class="clear-nav"></div>
\ No newline at end of file
diff --git a/website/www/site/data/en/quotes.yaml 
b/website/www/site/data/en/quotes.yaml
index 4ae6cca442a..2c0dbef443f 100644
--- a/website/www/site/data/en/quotes.yaml
+++ b/website/www/site/data/en/quotes.yaml
@@ -41,6 +41,11 @@
   logoUrl: images/logos/powered-by/credit-karma.png
   linkUrl: case-studies/creditkarma/index.html
   linkText: Learn more
+- text: Apache Beam enabled Albertsons to standardize ingestion into a 
resilient and portable framework, delivering 99.9% reliability at enterprise 
scale across both real-time signals and core business data.
+  icon: icons/quote-icon.svg
+  logoUrl: images/logos/powered-by/albertsons.jpg
+  linkUrl: case-studies/creditkarma/index.html
+  linkText: Learn more
 - text: Apache Beam is a central component to Intuit's Stream Processing 
Platform, which has driven 3x faster time-to-production for authoring a stream 
processing pipeline.
   icon: icons/quote-icon.svg
   logoUrl: images/case-study/intuit/intuit-quote.png
diff --git a/website/www/site/static/images/case-study/albertsons/ankurraj.jpeg 
b/website/www/site/static/images/case-study/albertsons/ankurraj.jpeg
new file mode 100644
index 00000000000..47a5f83b75c
Binary files /dev/null and 
b/website/www/site/static/images/case-study/albertsons/ankurraj.jpeg differ
diff --git 
a/website/www/site/static/images/case-study/albertsons/mohammedjawedkhan.jpeg 
b/website/www/site/static/images/case-study/albertsons/mohammedjawedkhan.jpeg
new file mode 100644
index 00000000000..579bc885e63
Binary files /dev/null and 
b/website/www/site/static/images/case-study/albertsons/mohammedjawedkhan.jpeg 
differ
diff --git 
a/website/www/site/static/images/case-study/albertsons/utkarshparekh.png 
b/website/www/site/static/images/case-study/albertsons/utkarshparekh.png
new file mode 100644
index 00000000000..9383e0be40b
Binary files /dev/null and 
b/website/www/site/static/images/case-study/albertsons/utkarshparekh.png differ
diff --git 
a/website/www/site/static/images/case-study/albertsons/vinaydesai.jpeg 
b/website/www/site/static/images/case-study/albertsons/vinaydesai.jpeg
new file mode 100644
index 00000000000..6ff6938a2f7
Binary files /dev/null and 
b/website/www/site/static/images/case-study/albertsons/vinaydesai.jpeg differ
diff --git a/website/www/site/static/images/logos/powered-by/albertsons.jpg 
b/website/www/site/static/images/logos/powered-by/albertsons.jpg
new file mode 100644
index 00000000000..35d5e40f96e
Binary files /dev/null and 
b/website/www/site/static/images/logos/powered-by/albertsons.jpg differ

Reply via email to