[ 
https://issues.apache.org/jira/browse/BEAM-7885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16904185#comment-16904185
 ] 

niklas Hansson edited comment on BEAM-7885 at 8/9/19 9:09 PM:
--------------------------------------------------------------

I got it working on Dataflow, but actually still have problems to run it with 
the direct runner which i used initially Is it expected to work on the direct 
runner?

However it seams like the setup function where run multiple times during start 
up of the pipeline but then never again. Is this expected behaviour? 


was (Author: nikenano):
I got it working on Dataflow, but actually still have problems to run it with 
the direct runner which i used initially Is it expected to work on the direct 
runner?

> DoFn.setup() don't run for streaming jobs. 
> -------------------------------------------
>
>                 Key: BEAM-7885
>                 URL: https://issues.apache.org/jira/browse/BEAM-7885
>             Project: Beam
>          Issue Type: Bug
>          Components: sdk-py-core
>    Affects Versions: 2.14.0
>         Environment: Python
>            Reporter: niklas Hansson
>            Priority: Minor
>
> From version 2.14.0 Python have introduced setup and teardown for DoFn in 
> order to "Called to prepare an instance for processing bundles of 
> elements.This is a good place to initialize transient in-memory resources, 
> such as network connections."
> However when trying to use it for a unbounded job (pubsub source) it seams 
> like the DoFn.setup() is never called and the resources are never  
> initialize. Instead I get:
> """" 
> AttributeError: 'NoneType' object has no attribute 'predict' [while running 
> 'transform the data']
> """
> My source code: [https://github.com/NikeNano/DataflowSklearnStreaming]
>  
> I am happy to contribute with example code for how to use setup as soon as I 
> get it running :)  
>  



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

Reply via email to