[
https://issues.apache.org/jira/browse/BEAM-6067?focusedWorklogId=169311&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-169311
]
ASF GitHub Bot logged work on BEAM-6067:
----------------------------------------
Author: ASF GitHub Bot
Created on: 26/Nov/18 09:58
Start Date: 26/Nov/18 09:58
Worklog Time Spent: 10m
Work Description: robertwb commented on a change in pull request #7081:
[BEAM-6067] In Python SDK, specify pipeline_proto_coder_id property in
non-Beam-standard CloudObject coders
URL: https://github.com/apache/beam/pull/7081#discussion_r236185948
##########
File path: sdks/python/apache_beam/runners/dataflow/dataflow_runner.py
##########
@@ -441,22 +443,25 @@ def _get_side_input_encoding(self, input_encoding):
def _get_encoded_output_coder(self, transform_node, window_value=True):
"""Returns the cloud encoding of the coder for the output of a
transform."""
+ from apache_beam.runners.dataflow.internal import apiclient
if (len(transform_node.outputs) == 1
and transform_node.outputs[None].element_type is not None):
# TODO(robertwb): Handle type hints for multi-output transforms.
element_type = transform_node.outputs[None].element_type
+ use_fnapi =
apiclient._use_fnapi(transform_node.outputs[None].pipeline._options)
else:
# TODO(silviuc): Remove this branch (and assert) when typehints are
# propagated everywhere. Returning an 'Any' as type hint will trigger
# usage of the fallback coder (i.e., cPickler).
element_type = typehints.Any
+ use_fnapi = False # TODO(chambers): XXX do the right thing for this
Review comment:
I'd like to understand why setting the `pipeline_proto_coder_id` attribute
unconditionally breaks things. If that's not workable, I'd rather name this
something other than use_fnapi if we don't need the coder id in this case.
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
Issue Time Tracking
-------------------
Worklog Id: (was: 169311)
Time Spent: 5h 20m (was: 5h 10m)
Remaining Estimate: 162h 40m (was: 162h 50m)
> Dataflow runner should include portable pipeline coder id in CloudObject
> coder representation
> ---------------------------------------------------------------------------------------------
>
> Key: BEAM-6067
> URL: https://issues.apache.org/jira/browse/BEAM-6067
> Project: Beam
> Issue Type: Improvement
> Components: beam-model
> Reporter: Craig Chambers
> Assignee: Craig Chambers
> Priority: Major
> Original Estimate: 168h
> Time Spent: 5h 20m
> Remaining Estimate: 162h 40m
>
> When translating a BeamJava Coder into the DataflowRunner's CloudObject
> property map, include a property that specifies the id in the Beam model
> Pipeline coders map corresponding to that Coder. This will allow the
> DataflowRunner to reference the corresponding Beam coder in the FnAPI
> processing bundle.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)