[
https://issues.apache.org/jira/browse/BEAM-1545?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15883154#comment-15883154
]
Ahmet Altay commented on BEAM-1545:
-----------------------------------
.[gcp] will install google cloud platform specific components. This is required
for reading from gs:// uris. We are in the process of separating google
specific things, that is quickstart guide does not reflect this yet.
You have cython in your pip list. The failing tests assumes that cython is not
installed. We need to update that test, and skip it if cython is installed. If
you start from a clean virtual environment, without cython you would not see
that error I believe.
> Python sdk example run failed
> -----------------------------
>
> Key: BEAM-1545
> URL: https://issues.apache.org/jira/browse/BEAM-1545
> Project: Beam
> Issue Type: Bug
> Components: sdk-py
> Reporter: Haoxiang
> Assignee: Sourabh Bajaj
>
> When I run the python sdk example with
> https://beam.apache.org/get-started/quickstart-py/ show, run the command:
> python -m apache_beam.examples.wordcount --input
> gs://dataflow-samples/shakespeare/kinglear.txt --output output.txt
> it was failed by the logs:
> INFO:root:Missing pipeline option (runner). Executing pipeline using the
> default runner: DirectRunner.
> Traceback (most recent call last):
> File
> "/usr/local/Cellar/python/2.7.11/Frameworks/Python.framework/Versions/2.7/lib/python2.7/runpy.py",
> line 162, in _run_module_as_main
> "__main__", fname, loader, pkg_name)
> File
> "/usr/local/Cellar/python/2.7.11/Frameworks/Python.framework/Versions/2.7/lib/python2.7/runpy.py",
> line 72, in _run_code
> exec code in run_globals
> File
> "/Users/haoxiang/InterestingGitProject/beam/sdks/python/apache_beam/examples/wordcount.py",
> line 107, in <module>
> run()
> File
> "/Users/haoxiang/InterestingGitProject/beam/sdks/python/apache_beam/examples/wordcount.py",
> line 83, in run
> lines = p | 'read' >> ReadFromText(known_args.input)
> File "apache_beam/io/textio.py", line 378, in __init__
> skip_header_lines=skip_header_lines)
> File "apache_beam/io/textio.py", line 87, in __init__
> validate=validate)
> File "apache_beam/io/filebasedsource.py", line 97, in __init__
> self._validate()
> File "apache_beam/io/filebasedsource.py", line 171, in _validate
> if len(fileio.ChannelFactory.glob(self._pattern, limit=1)) <= 0:
> File "apache_beam/io/fileio.py", line 281, in glob
> return gcsio.GcsIO().glob(path, limit)
> AttributeError: 'NoneType' object has no attribute 'GcsIO'
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)