Re: [PR] image embeddings. [beam]

via GitHub Tue, 17 Feb 2026 13:30:03 -0800


gemini-code-assist[bot] commented on PR #37628:
URL: https://github.com/apache/beam/pull/37628#issuecomment-3917180473


   ## Summary of Changes
   
   Hello @claudevdm, I'm Gemini Code Assist[^1]! I'm currently reviewing this 
pull request and will post my feedback shortly. In the meantime, here's a 
summary to help you and other reviewers quickly get up to speed!
   
   This pull request significantly enhances Apache Beam's RAG capabilities by 
integrating robust support for image embeddings. It introduces the necessary 
infrastructure to process and embed image content using popular ML frameworks 
like HuggingFace and Vertex AI, alongside existing text embedding 
functionalities. The changes streamline the handling of multimodal data within 
the RAG pipeline, from data representation to ingestion into various vector 
databases, thereby broadening the scope of applications that can leverage Beam 
for advanced retrieval augmented generation.
   
   ### Highlights
   
   * **Image Embedding Support**: Introduced new classes and functionalities to 
support image embeddings for both HuggingFace and Vertex AI models within the 
RAG pipeline. This allows users to generate embeddings from image content, 
expanding the multimodal capabilities of Apache Beam's ML RAG components.
   * **Enhanced EmbeddableItem**: The `EmbeddableItem` data structure has been 
updated to include an optional `image` field in its `Content` attribute. A new 
`from_image` factory method simplifies the creation of image-based 
`EmbeddableItem` instances, and a `content_string` property was added to 
provide a unified string representation for ingestion, prioritizing text over 
image URI.
   * **Ingestion Pipeline Updates**: BigQuery, MySQL, PostgreSQL, and Spanner 
ingestion modules were modified to leverage the new `content_string` property 
of `EmbeddableItem`. This ensures that these pipelines can correctly process 
and store both text and image URI content, adapting to the expanded 
`EmbeddableItem` definition.
   
   🧠 **New Feature in Public Preview:** You can now enable **Memory** to help 
**Gemini Code Assist** learn from your team's feedback. This makes future code 
reviews more consistent and personalized to your project's style. **Click 
[here](https://codeassist.google/code-review/login) to enable Memory in your 
admin console.**
   
   <details>
   <summary><b>Changelog</b></summary>
   
   * **sdks/python/apache_beam/ml/rag/embeddings/base_test.py**
       * Imported `EmbeddableItem` for testing purposes.
       * Added `ImageEmbeddableItemTest` to validate the 
`EmbeddableItem.from_image` factory method.
       * Added `ContentStringTest` to verify the functionality of the new 
`content_string` property.
   * **sdks/python/apache_beam/ml/rag/embeddings/huggingface.py**
       * Added imports for `io`, `Sequence`, `_add_embedding_fn`, 
`EmbeddingTypeAdapter`, `_ImageEmbeddingHandler`, and `PIL.Image`.
       * Updated the docstring for `HuggingfaceTextEmbeddings` to provide more 
detailed argument descriptions.
       * Implemented `_extract_images` to convert `EmbeddableItem` image 
content into PIL Image objects.
       * Created `_create_hf_image_adapter` to generate an 
`EmbeddingTypeAdapter` specifically for HuggingFace image embeddings.
       * Introduced `HuggingfaceImageEmbeddings` class to manage image 
embedding using HuggingFace models.
   * **sdks/python/apache_beam/ml/rag/embeddings/huggingface_test.py**
       * Added imports for `os`, `HuggingfaceImageEmbeddings`, 
`_create_hf_image_adapter`, `EmbeddableItem`, and `PIL.Image`.
       * Included a `PIL_AVAILABLE` check to conditionally run tests based on 
Pillow library availability.
       * Added `HuggingfaceImageAdapterTest` to test the functionality of the 
HuggingFace image adapter.
       * Implemented `HuggingfaceImageEmbeddingsTest` to verify the end-to-end 
image embedding pipeline for HuggingFace models, including tests for both image 
bytes and file paths.
       * Added a helper function `_create_png_bytes` to generate sample PNG 
image data for tests.
   * **sdks/python/apache_beam/ml/rag/embeddings/vertex_ai.py**
       * Added imports for `Sequence`, `_add_embedding_fn`, 
`EmbeddingTypeAdapter`, `_ImageEmbeddingHandler`, `Chunk`, 
`_VertexAIImageEmbeddingHandler`, and `vertexai.vision_models.Image`.
       * Modified the `get_ptransform_for_processing` method in 
`VertexAITextEmbeddings` to return `beam.PCollection[Chunk]`.
       * Implemented `_extract_images` to convert `EmbeddableItem` image 
content into Vertex AI `Image` objects.
       * Created `_create_image_adapter` to generate an `EmbeddingTypeAdapter` 
for Vertex AI image embeddings.
       * Introduced `VertexAIImageEmbeddings` class to manage image embedding 
using Vertex AI models.
   * **sdks/python/apache_beam/ml/rag/embeddings/vertex_ai_test.py**
       * Added imports for `os`, `EmbeddableItem`, `VertexAIImageEmbeddings`, 
and `_create_image_adapter`.
       * Implemented `VertexAIImageAdapterTest` to test the functionality of 
the Vertex AI image adapter.
       * Added `VertexAIImageEmbeddingsTest` to verify the end-to-end image 
embedding pipeline for Vertex AI models, including tests for both image bytes 
and file paths.
       * Included a static helper method `_create_png_bytes` to generate sample 
PNG image data for tests.
   * **sdks/python/apache_beam/ml/rag/ingestion/bigquery.py**
       * Updated example `embeddable_to_dict_fn` to use `item` instead of 
`chunk` for consistency.
       * Modified `_default_embeddable_to_dict_fn` to utilize 
`item.content_string` and adjusted the `ValueError` message.
       * Updated the lambda variable name in the `expand` method for improved 
readability.
   * **sdks/python/apache_beam/ml/rag/ingestion/mysql_common.py**
       * Modified the `value_fn` within `with_content_spec` to use 
`chunk.content_string` and removed the explicit `ValueError` check for 
`content.text`.
   * **sdks/python/apache_beam/ml/rag/ingestion/postgres_common.py**
       * Modified the `value_fn` within `with_content_spec` to use 
`chunk.content_string` and removed the explicit `ValueError` check for 
`content.text`.
   * **sdks/python/apache_beam/ml/rag/ingestion/spanner.py**
       * Modified the `extract_fn` within `with_content_spec` to use 
`embeddable.content_string` and updated its return type to `Optional[str]`, 
removing the `ValueError` check.
   * **sdks/python/apache_beam/ml/rag/types.py**
       * Imported `Union` for type hinting flexibility.
       * Added an `image: Optional[Union[bytes, str]] = None` field to the 
`Content` dataclass to support image content.
       * Introduced a `from_image` class method to `EmbeddableItem` for 
convenient creation of items with image content.
       * Added a `content_string` property to `EmbeddableItem` that returns a 
storable string representation, prioritizing text content or image URI.
   </details>
   
   <details>
   <summary><b>Activity</b></summary>
   
   * The pull request was opened by claudevdm with the title 'image embeddings'.
   * The initial description is a placeholder, indicating that a more 
meaningful description is needed from the author.
   * No further activity, comments, or reviews have been recorded since the 
pull request's creation.
   </details>
   
   <details>
   <summary><b>Using Gemini Code Assist</b></summary>
   <br>
   
   The full guide for Gemini Code Assist can be found on our [documentation 
page](https://developers.google.com/gemini-code-assist/docs/review-github-code),
 here are some quick tips.
   
   <b>Invoking Gemini</b>
   
   You can request assistance from Gemini at any point by creating a comment 
using either `/gemini <command>` or `@gemini-code-assist <command>`. Below is a 
summary of the supported commands on the current page.
   
   Feature | Command | Description
   --- | --- | ---
   Code Review | `/gemini review` | Performs a code review for the current pull 
request in its current state.
   Pull Request Summary | `/gemini summary` | Provides a summary of the current 
pull request in its current state.
   Comment | @gemini-code-assist | Responds in comments when explicitly tagged, 
both in pull request comments and review comments.
   Help | `/gemini help` | Displays a list of available commands.
   
   <b>Customization</b>
   
   To customize Gemini Code Assist for GitHub experience, repository 
maintainers can create a configuration file and/or provide a custom code review 
style guide (such as PEP-8 for Python) by creating and adding files to a 
`.gemini/` folder in the base of the repository. Detailed instructions can be 
found 
[here](https://developers.google.com/gemini-code-assist/docs/customize-gemini-behavior-github).
   
   <b>Limitations & Feedback</b>
   
   Gemini Code Assist may make mistakes. Please leave feedback on any instances 
where its feedback is incorrect or counter productive. You can react with 
:thumbsup: and :thumbsdown: on @gemini-code-assist comments. If you're 
interested in giving your feedback about your experience with Gemini Code 
Assist for Github and other Google products, sign up 
[here](https://google.qualtrics.com/jfe/form/SV_2cyuGuTWsEw84yG).
   
   <b>You can also get AI-powered code generation, chat, as well as code 
reviews directly in the IDE at no cost with the [Gemini Code Assist IDE 
Extension](https://cloud.google.com/products/gemini/code-assist).</b>
   </details>
   
   
   [^1]: Review the [Privacy Notices](https://policies.google.com/privacy), 
[Generative AI Prohibited Use 
Policy](https://policies.google.com/terms/generative-ai/use-policy), [Terms of 
Service](https://policies.google.com/terms), and learn how to configure Gemini 
Code Assist in GitHub 
[here](https://developers.google.com/gemini-code-assist/docs/customize-gemini-behavior-github).
 Gemini can make mistakes, so double check it and [use code with 
caution](https://support.google.com/legal/answer/13505487).
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Re: [PR] image embeddings. [beam]

Reply via email to