[
https://issues.apache.org/jira/browse/BEAM-1556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15886971#comment-15886971
]
Davor Bonaci edited comment on BEAM-1556 at 2/28/17 1:05 AM:
-------------------------------------------------------------
It certainly is an issue in the other runners as well.
I wouldn't do in the context of a {{FileBasedSource}}. Users should be able to
call the {{FileSystem}} API from, say, {{@ProcessElement}} method of a
{{DoFn}}. So, I think the registration should be done before any "user code" is
invoked.
Doing it in worker startup might not be ideal -- the constructor takes
{{PipelineOptions}} as an argument. Since jobs could have different options, it
probably needs to happen on a per-task basis, likely at the point the worker
receives the task from the master and deserializes {{PipelineOptions}}.
was (Author: davor):
It certainly is an issue in other runners as well.
I wouldn't do in the context of a {{FileBasedSource}}. Users should be able to
call the {{FileSystem}} API from, say, {{@ProcessElement}} method of a
{{DoFn}}. So, I think the registration should be done before any "user code" is
invoked.
Doing it in worker startup might not be ideal -- the constructor takes
{{PipelineOptions}} as an argument. Since jobs could have different options, it
probably needs to happen on a per-task basis, likely at the point the worker
receives the task from the master and deserializes {{PipelineOptions}}.
> Spark executors need to register IO factories
> ---------------------------------------------
>
> Key: BEAM-1556
> URL: https://issues.apache.org/jira/browse/BEAM-1556
> Project: Beam
> Issue Type: Bug
> Components: runner-spark
> Reporter: Frances Perry
> Assignee: Jean-Baptiste Onofré
>
> The Spark executors need to call IOChannelUtils.registerIOFactories(options)
> in order to support GCS file and make the default WordCount example work.
> Context in this thread:
> https://lists.apache.org/thread.html/469a139c9eb07e64e514cdea42ab8000678ab743794a090c365205d7@%3Cuser.beam.apache.org%3E
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)