[
https://issues.apache.org/jira/browse/BEAM-11037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17361134#comment-17361134
]
Martin Funkquist commented on BEAM-11037:
-----------------------------------------
I also have this problem.
Specs are: Windows 10, python 3.8.8, version 2.30.0, Direct Runner
Here is where the error occurs (if it helps):
{code:java}
File
"e:\Documents\NLP\tapas\venv\lib\site-packages\apache_beam\runners\worker\bundle_processor.py",
line 226, in process_encoded decoded_value =
self.windowed_coder_impl.decode_from_stream( File
"apache_beam\coders\coder_impl.py", line 1251, in
apache_beam.coders.coder_impl.WindowedValueCoderImpl.decode_from_stream File
"apache_beam\coders\coder_impl.py", line 1270, in
apache_beam.coders.coder_impl.WindowedValueCoderImpl.decode_from_stream File
"apache_beam\coders\coder_impl.py", line 862, in
apache_beam.coders.coder_impl.AbstractComponentCoderImpl.decode_from_stream
File "apache_beam\coders\coder_impl.py", line 1056, in
apache_beam.coders.coder_impl.SequenceCoderImpl.decode_from_stream File
"apache_beam\coders\coder_impl.py", line 1363, in
apache_beam.coders.coder_impl.LengthPrefixCoderImpl.decode_from_stream File
"apache_beam\coders\coder_impl.py", line 228, in
apache_beam.coders.coder_impl.StreamCoderImpl.decode File
"apache_beam\coders\coder_impl.py", line 463, in
apache_beam.coders.coder_impl.FastPrimitivesCoderImpl.decode_from_stream File
"apache_beam\coders\coder_impl.py", line 474, in
apache_beam.coders.coder_impl.FastPrimitivesCoderImpl.decode_from_stream File
"apache_beam\coders\coder_impl.py", line 454, in
apache_beam.coders.coder_impl.FastPrimitivesCoderImpl.decode_from_stream File
"apache_beam\coders\stream.pyx", line 193, in
apache_beam.coders.stream.InputStream.read_var_int64 RuntimeError: VarLong too
long.
{code}
> Python DirectRunner InputStream Issues (RuntimeError: VarLong too long)
> -----------------------------------------------------------------------
>
> Key: BEAM-11037
> URL: https://issues.apache.org/jira/browse/BEAM-11037
> Project: Beam
> Issue Type: Bug
> Components: runner-direct, sdk-py-core
> Affects Versions: 2.24.0
> Environment: Windows 10 64bit, Python 3.7.9
> Reporter: Michael Stauffer
> Priority: P2
>
> I have a Beam workflow that runs fine on Dataflow. However, when I run it on
> my local direct runner (Windows 10, Python SDK), I run into some strange
> behaviour if I have a negative number in my PCollection (see below for an
> example).
> Minimal code example to reproduce the exception:
> {code:java}
> from apache_beam.options.pipeline_options import PipelineOptions
> from apache_beam.pipeline import Pipeline
> import apache_beam
> if __name__ == '__main__':
> pipeline_options = PipelineOptions()
> with Pipeline(options=pipeline_options) as p:
> lines = (
> p
> | apache_beam.Create([
> {"Test": -1}
> ]))
> lines | apache_beam.io.WriteToText('./lines')
> {code}
> Exception:
> {code:java}
> File "apache_beam\coders\coder_impl.py", line 222, in
> apache_beam.coders.coder_impl.StreamCoderImpl.decode
> File "apache_beam\coders\coder_impl.py", line 224, in
> apache_beam.coders.coder_impl.StreamCoderImpl.decode
> File "apache_beam\coders\coder_impl.py", line 470, in
> apache_beam.coders.coder_impl.FastPrimitivesCoderImpl.decode_from_stream
> File "apache_beam\coders\coder_impl.py", line 450, in
> apache_beam.coders.coder_impl.FastPrimitivesCoderImpl.decode_from_stream
> File "apache_beam\coders\stream.pyx", line 193, in
> apache_beam.coders.stream.InputStream.read_var_int64
> RuntimeError: VarLong too long. [while running 'Create/Map(decode)']
> {code}
> Any help is highly appreciated.
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)