(beam) branch test-inference updated: Test without vllm

vterentev Wed, 06 May 2026 12:03:13 -0700

This is an automated email from the ASF dual-hosted git repository.

Amar3tto pushed a commit to branch test-inference
in repository https://gitbox.apache.org/repos/asf/beam.git



The following commit(s) were added to refs/heads/test-inference by this push:
     new 0b7f5da06cd Test without vllm
0b7f5da06cd is described below

commit 0b7f5da06cdb3af30737dadbe8a99fa7722fbdef
Author: Vitaly Terentyev <[email protected]>
AuthorDate: Wed May 6 23:02:49 2026 +0400

    Test without vllm
---
 .../beam_Inference_Python_Benchmarks_Dataflow.yml      | 18 ------------------
 1 file changed, 18 deletions(-)

diff --git a/.github/workflows/beam_Inference_Python_Benchmarks_Dataflow.yml 
b/.github/workflows/beam_Inference_Python_Benchmarks_Dataflow.yml
index 61de2b54e73..b92df46e446 100644
--- a/.github/workflows/beam_Inference_Python_Benchmarks_Dataflow.yml
+++ b/.github/workflows/beam_Inference_Python_Benchmarks_Dataflow.yml
@@ -97,24 +97,6 @@ jobs:
       # The env variables are created and populated in the 
test-arguments-action as 
"<github.job>_test_arguments_<argument_file_paths_index>"
       - name: get current time
         run: echo "NOW_UTC=$(date '+%m%d%H%M%S' --utc)" >> $GITHUB_ENV
-      - name: Build VLLM Development Image
-        id: build_vllm_image
-        uses: ./.github/actions/build-push-docker-action
-        with:
-          dockerfile_path: 
'sdks/python/apache_beam/ml/inference/test_resources/vllm.dockerfile'
-          image_name: 
'us-docker.pkg.dev/apache-beam-testing/beam-temp/beam-vllm-gpu-base'
-          image_tag: ${{ github.sha }}
-      - name: Run VLLM Gemma Batch Test
-        uses: ./.github/actions/gradle-command-self-hosted-action
-        timeout-minutes: 180
-        with:
-          gradle-command: :sdks:python:apache_beam:testing:load_tests:run
-          arguments: |
-            
-PloadTest.mainClass=apache_beam.testing.benchmarks.inference.vllm_gemma_benchmarks
 \
-            -Prunner=DataflowRunner \
-            -PsdkLocationOverride=false \
-            -PpythonVersion=3.10 \
-            
-PloadTest.requirementsTxtFile=apache_beam/ml/inference/vllm_tests_requirements.txt
 '-PloadTest.args=${{ 
env.beam_Inference_Python_Benchmarks_Dataflow_test_arguments_8 }} --mode=batch 
--job_name=benchmark-tests-vllm-with-gemma-2b-it-batch-${{env.NOW_UTC}} 
--sdk_container_image=${{ steps.build_vllm_image.outputs.image_url }}'
       - name: run Pytorch Sentiment Streaming using Hugging Face 
distilbert-base-uncased model
         uses: ./.github/actions/gradle-command-self-hosted-action
         timeout-minutes: 180

(beam) branch test-inference updated: Test without vllm

Reply via email to