[ 
https://issues.apache.org/jira/browse/BEAM-5640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17137917#comment-17137917
 ] 

Thomas Weise edited comment on BEAM-5640 at 6/16/20, 8:46 PM:
--------------------------------------------------------------

No longer an issue for us after moving to py3. If anyone else using the package 
runs into this issue, please re-open.


was (Author: thw):
No longer an issue after moving to py3.

> Portable python sdk worker leaks memory when PyOpenSSL package is present
> -------------------------------------------------------------------------
>
>                 Key: BEAM-5640
>                 URL: https://issues.apache.org/jira/browse/BEAM-5640
>             Project: Beam
>          Issue Type: Bug
>          Components: sdk-py-harness
>            Reporter: Micah Wylde
>            Priority: P3
>             Fix For: Not applicable
>
>
> When PyOpenSSL package is installed on a system (e.g., in a virtualenv) the 
> python sdk_worker process leaks memory. I've validated this when using the 
> flink portable runner in streaming mode, but it may occur in other 
> configurations as well. The leak is pretty significant, amounting to tens of 
> MBs/sec.
> I've put together a reproduction for the issue 
> [here|https://github.com/mwylde/beam/tree/micah_memory_leak]. That branch 
> includes a flink streaming data source that generates data, as well as a 
> python pipeline that demonstrates the issue.
> To reproduce:
> {code:java}
> check out the branch:
> $ git clone [email protected]:mwylde/beam.git
> $ git checkout micah_memory_leak
> build the python docker container with pyopenssl installed:
> $ cd beam
> $ ./gradlew :beam-sdks-python-container:docker
> start the job server with embedded flink cluster:
> $ ./gradlew runShadow
> run the pipeline:
> $ ./gradlew :beam-sdks-python:streamingLeak{code}
>  
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to