[ 
https://issues.apache.org/jira/browse/GSOC-278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17933935#comment-17933935
 ] 

XQ Hu commented on GSOC-278:
----------------------------

Information to get started:

- Beam contribution guide - 
[https://github.com/apache/beam/blob/master/CONTRIBUTING.md]
- Beam yaml overview - [https://beam.apache.org/documentation/sdks/yaml/]
- Beam yaml docs - [https://beam.apache.org/releases/yamldoc/current/]
- Existing yaml examples - 
[https://github.com/apache/beam/tree/master/sdks/python/apache_beam/yaml/examples]
- Beam website source - [https://github.com/apache/beam/tree/master/website]

> Beam YAML: ML, Iceberg, and Kafka User Accessibility
> ----------------------------------------------------
>
>                 Key: GSOC-278
>                 URL: https://issues.apache.org/jira/browse/GSOC-278
>             Project: Comdev GSOC
>          Issue Type: Task
>            Reporter: XQ Hu
>            Priority: Major
>              Labels: Beam, gsoc, gsoc2025, mentor
>   Original Estimate: 672h
>  Remaining Estimate: 672h
>
> [Apache Beam's YAML DSL|https://beam.apache.org/releases/yamldoc/current/] 
> provides a powerful and declarative way to define data processing pipelines. 
> However, its adoption for complex use cases like Machine Learning (ML) and 
> Managed IO (specifically Apache Iceberg and Kafka) is hindered by a lack of 
> comprehensive documentation and practical examples. This project aims to 
> significantly improve the Beam YAML documentation and create illustrative 
> examples focused on ML workflows and Iceberg/Kafka integration, making these 
> advanced features more accessible to users.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: gsoc-unsubscr...@community.apache.org
For additional commands, e-mail: gsoc-h...@community.apache.org

Reply via email to