Sourabh Bajaj created BEAM-1731:
-----------------------------------
Summary: RuntimeError when running wordcount with ValueProviders
Key: BEAM-1731
URL: https://issues.apache.org/jira/browse/BEAM-1731
Project: Beam
Issue Type: Bug
Components: sdk-py
Reporter: Sourabh Bajaj
Assignee: MarĂa GH
Running: python -m apache_beam.examples.wordcount
INFO:root:Job 2017-03-15_13_39_59-3092873759767386 is in state JOB_STATE_FAILED
Traceback (most recent call last):
File
"/System/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/runpy.py",
line 162, in _run_module_as_main
"__main__", fname, loader, pkg_name)
File
"/System/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/runpy.py",
line 72, in _run_code
exec code in run_globals
File
"/Users/sourabhbajaj/Projects/incubator-beam/sdks/python/apache_beam/examples/wordcount.py",
line 119, in <module>
run()
File
"/Users/sourabhbajaj/Projects/incubator-beam/sdks/python/apache_beam/examples/wordcount.py",
line 109, in run
result.wait_until_finish()
File "apache_beam/runners/dataflow/dataflow_runner.py", line 711, in
wait_until_finish
(self.state, getattr(self._runner, 'last_error_msg', None)), self)
apache_beam.runners.dataflow.dataflow_runner.DataflowRuntimeException: Dataflow
pipeline failed. State: FAILED, Error:
(e22fabbb61bfae00): Traceback (most recent call last):
File "/usr/local/lib/python2.7/dist-packages/dataflow_worker/batchworker.py",
line 544, in do_work
work_executor.execute()
File "dataflow_worker/executor.py", line 1013, in
dataflow_worker.executor.CustomSourceSplitExecutor.execute
(dataflow_worker/executor.c:31501)
self.response = self._perform_source_split_considering_api_limits(
File "dataflow_worker/executor.py", line 1021, in
dataflow_worker.executor.CustomSourceSplitExecutor._perform_source_split_considering_api_limits
(dataflow_worker/executor.c:31703)
split_response = self._perform_source_split(source_operation_split_task,
File "dataflow_worker/executor.py", line 1059, in
dataflow_worker.executor.CustomSourceSplitExecutor._perform_source_split
(dataflow_worker/executor.c:32341)
for split in source.split(desired_bundle_size):
File
"/usr/local/lib/python2.7/dist-packages/apache_beam/io/filebasedsource.py",
line 192, in split
return self._get_concat_source().split(
File
"/usr/local/lib/python2.7/dist-packages/apache_beam/utils/value_provider.py",
line 105, in _f
raise RuntimeError('%s not accessible' % obj)
RuntimeError: RuntimeValueProvider(option: input, type: str, default_value:
'gs://dataflow-samples/shakespeare/kinglear.txt') not accessible
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)