kennknowles opened a new issue, #18392: URL: https://github.com/apache/beam/issues/18392
Few things I noticed while running mobile gaming example that could be improved: 1. When running on direct runner the default input is too large (20G), so it seems as though the pipeline is stuck without an progress updates or metrics. This could be improved by using a much smaller dataset by default. 2. Even when running on dataflow runner, with default worker settings and auto scaling, it still takes more than 30 minutes to run. We could use a much smaller dataset here too. Also the documentation of these examples could be improved in both the code docstring as well as beam quick start guide. Imported from Jira [BEAM-2286](https://issues.apache.org/jira/browse/BEAM-2286). Original Jira may contain additional context. Reported by: vikasrk. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
