niklas Hansson created BEAM-7885:
------------------------------------
Summary: DoFn.setup() don't run for streaming jobs.
Key: BEAM-7885
URL: https://issues.apache.org/jira/browse/BEAM-7885
Project: Beam
Issue Type: Bug
Components: sdk-py-core
Affects Versions: 2.14.0
Environment: Python
Reporter: niklas Hansson
>From version 2.14.0 Python have introduced setup and teardown for DoFn in
>order to "Called to prepare an instance for processing bundles of
>elements.This is a good place to initialize transient in-memory resources,
>such as network connections."
However when trying to use it for a unbounded job (pubsub source) it seams like
the DoFn.setup() is never called and the resources are never initialize.
Instead I get:
""""
AttributeError: 'NoneType' object has no attribute 'predict' [while running
'transform the data']
"""
My source code: [https://github.com/NikeNano/DataflowSklearnStreaming]
I am happy to contribute with example code for how to use setup as soon as I
get it running :)
--
This message was sent by Atlassian JIRA
(v7.6.14#76016)