[ https://issues.apache.org/jira/browse/BEAM-5428?focusedWorklogId=319033&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-319033 ]
ASF GitHub Bot logged work on BEAM-5428: ---------------------------------------- Author: ASF GitHub Bot Created on: 26/Sep/19 15:14 Start Date: 26/Sep/19 15:14 Worklog Time Spent: 10m Work Description: mxm commented on pull request #9418: [BEAM-5428] Implement cross-bundle user state caching in the Python SDK URL: https://github.com/apache/beam/pull/9418#discussion_r328673182 ########## File path: sdks/python/apache_beam/runners/worker/sdk_worker_main.py ########## @@ -205,6 +206,28 @@ def _get_worker_count(pipeline_options): return 12 +def _get_state_cache_size(pipeline_options): + """Defines the upper number of state items to cache. + + Note: state_cache_size is an experimental flag and might not be available in + future releases. + + Returns: + an int indicating the maximum number of items to cache. + Default is 0 (disabled) + """ + experiments = pipeline_options.view_as(DebugOptions).experiments + experiments = experiments if experiments else [] + + for experiment in experiments: + # There should only be 1 match so returning from the loop + if re.match(r'state_cache_size=', experiment): + return int( + re.match(r'state_cache_size=(?P<state_cache_size>.*)', + experiment).group('state_cache_size')) + return 100 Review comment: Good catch, this was actually pending in my branch. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org Issue Time Tracking ------------------- Worklog Id: (was: 319033) Time Spent: 23h 10m (was: 23h) > Implement cross-bundle state caching. > ------------------------------------- > > Key: BEAM-5428 > URL: https://issues.apache.org/jira/browse/BEAM-5428 > Project: Beam > Issue Type: Improvement > Components: sdk-py-harness > Reporter: Robert Bradshaw > Assignee: Maximilian Michels > Priority: Major > Time Spent: 23h 10m > Remaining Estimate: 0h > > Tech spec: > [https://docs.google.com/document/d/1BOozW0bzBuz4oHJEuZNDOHdzaV5Y56ix58Ozrqm2jFg/edit#heading=h.7ghoih5aig5m] > Relevant document: > [https://docs.google.com/document/d/1ltVqIW0XxUXI6grp17TgeyIybk3-nDF8a0-Nqw-s9mY/edit#|https://docs.google.com/document/d/1ltVqIW0XxUXI6grp17TgeyIybk3-nDF8a0-Nqw-s9mY/edit] > Mailing list link: > [https://lists.apache.org/thread.html/caa8d9bc6ca871d13de2c5e6ba07fdc76f85d26497d95d90893aa1f6@%3Cdev.beam.apache.org%3E] -- This message was sent by Atlassian Jira (v8.3.4#803005)