ryanthompson591 commented on a change in pull request #16589:
URL: https://github.com/apache/beam/pull/16589#discussion_r790937340
##########
File path: sdks/python/apache_beam/runners/worker/sdk_worker_main.py
##########
@@ -76,8 +76,9 @@ def create_harness(environment, dry_run=False):
# These are used for dataflow templates.
RuntimeValueProvider.set_runtime_options(pipeline_options_dict)
sdk_pipeline_options = PipelineOptions.from_dictionary(pipeline_options_dict)
+ pickle_library = sdk_pipeline_options.view_as(SetupOptions).pickle_library
Review comment:
move the line that declares this variable right above where it is used.
##########
File path: sdks/python/apache_beam/runners/worker/sdk_worker_main.py
##########
@@ -87,17 +88,18 @@ def create_harness(environment, dry_run=False):
_LOGGER.info('semi_persistent_directory: %s', semi_persistent_directory)
_worker_id = environment.get('WORKER_ID', None)
- try:
- _load_main_session(semi_persistent_directory)
- except CorruptMainSessionException:
- exception_details = traceback.format_exc()
- _LOGGER.error(
- 'Could not load main session: %s', exception_details, exc_info=True)
- raise
- except Exception: # pylint: disable=broad-except
- exception_details = traceback.format_exc()
- _LOGGER.error(
- 'Could not load main session: %s', exception_details, exc_info=True)
+ if pickle_library != pickler.USE_CLOUDPICKLE:
Review comment:
later on when we change the default to cloudpickle, won't this break.
##########
File path: sdks/python/apache_beam/runners/portability/stager.py
##########
@@ -341,9 +343,11 @@ def create_job_resources(options, # type: PipelineOptions
pickled_session_file = os.path.join(
temp_dir, names.PICKLED_MAIN_SESSION_FILE)
pickler.dump_session(pickled_session_file)
- resources.append(
- Stager._create_file_stage_to_artifact(
- pickled_session_file, names.PICKLED_MAIN_SESSION_FILE))
+ # for pickle_library: cloudpickle, dump_session is no op
+ if os.path.exists(pickled_session_file):
Review comment:
I like the way you did this. Is it possible to add a unit test for this
behavior?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]