Polber commented on code in PR #30003: URL: https://github.com/apache/beam/pull/30003#discussion_r1501174577
########## sdks/python/apache_beam/yaml/examples/README.md: ########## @@ -0,0 +1,48 @@ +# Examples Catalog + +<!-- TOC --> +* [Examples Catalog](#examples-catalog) + * [Wordcount](#wordcount) + * [Transforms](#transforms) + * [Element-wise](#element-wise) + * [Aggregation](#aggregation) +<!-- TOC --> + +This module contains a series of Beam YAML code samples that can be run using +the command: +``` +python -m apache_beam.yaml.main --pipeline_spec_file=/path/to/example.yaml +``` + +## Wordcount +A good starting place is the [Wordcount](wordcount_minimal.yaml) example under +the root example directory. +This example reads in a text file, splits the text on each word, groups by each +word, and counts the occurrence of each word. This is a classic example used in +the other SDK's and shows off many of the functionalities of Beam YAML. + +## Transforms + +Examples in this directory show off the various built-in transforms of the Beam +YAML framework. + +### Element-wise +These examples leverage the built-in mapping transforms including `MapToFields`, +`Filter` and `Explode`. More information can be found about mapping transforms +[here](../docs/yaml_mapping.md). Review Comment: I pointed to UDF section since that is where MapToFields lives -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
