Dear Stephan.

I also suspect the s3. 
I’ve tried s3n, s3a.
what is suitable library? I’m using aws-java-sdk-1.7.4 and hadoop-aws-2.7.2.

Best regards.

> On Jul 21, 2016, at 5:54 PM, Stephan Ewen <se...@apache.org> wrote:
> 
> Hi!
> 
> There is a memory debugging logger, you can activate it like that:
> https://ci.apache.org/projects/flink/flink-docs-master/setup/config.html#memory-and-performance-debugging
>  
> <https://ci.apache.org/projects/flink/flink-docs-master/setup/config.html#memory-and-performance-debugging>
> 
> It will print which parts of the memory are growing.
> 
> What you can also try is to deactivate checkpointing for one run and see if 
> that solves it. If yes, then I suspect there is a memory leak in the s3 
> library (are you using s3, s3a, or s3n?).
> 
> Can you also check what libraries you are using? We have seen cases of memory 
> leaks in the libraries people used.
> 
> Greetings,
> Stephan
> 
> 
> 
> On Thu, Jul 21, 2016 at 5:13 AM, 김동일 <kim.s...@gmail.com 
> <mailto:kim.s...@gmail.com>> wrote:
> hi. stephan. 
> 
> - Did you submit any job to the cluster, or is the memory just growing even 
> on an idle TaskManager?
> 
> I have some stream job. 
> 
> - If you are running a job, do you use the RocksDB state backend, of the 
> FileSystem state backend?
> 
> file state backend. i use s3.
> 
> - Does it grow infinitely, or simply up a certain point and then goes down 
> again?
> 
> I think it infinitely. kernel kills the process , oom.
> 
> 
> 
> On Thu, Jul 21, 2016 at 3:52 AM Stephan Ewen <se...@apache.org 
> <mailto:se...@apache.org>> wrote:
> Hi!
> 
> In order to answer this, we need a bit more information. Here are some 
> followup questions:
> 
>   - Did you submit any job to the cluster, or is the memory just growing even 
> on an idle TaskManager?
>   - If you are running a job, do you use the RocksDB state backend, of the 
> FileSystem state backend?
>   - Does it grow infinitely, or simply up a certain point and then goes down 
> again?
> 
> Greetings,
> Stephan
> 
> 
> On Wed, Jul 20, 2016 at 5:58 PM, 김동일 <kim.s...@gmail.com 
> <mailto:kim.s...@gmail.com>> wrote:
> oh. my flink version is 1.0.3.
> 
> 
> ---------- Forwarded message ----------
> From: 김동일 <kim.s...@gmail.com <mailto:kim.s...@gmail.com>>
> Date: Thu, Jul 21, 2016 at 12:52 AM
> Subject: taskmanager memory leak
> To: user@flink.apache.org <mailto:user@flink.apache.org>
> 
> 
> I've set up cluster(stand alone).
> Taskmanager consumes memory over the Xmx property and it grows up 
> continuously.
> I saw this 
> link(http://mail-archives.apache.org/mod_mbox/flink-dev/201606.mbox/%3ccak2vtervsw4muboc4swix0mr6y9bijznjuypf6_f9f0g9-_...@mail.gmail.com%3E
>  
> <http://mail-archives.apache.org/mod_mbox/flink-dev/201606.mbox/%3ccak2vtervsw4muboc4swix0mr6y9bijznjuypf6_f9f0g9-_...@mail.gmail.com%3E>).
> So i set the taskmanager.memory.preallocation value to true but that is not 
> solution.
> 
> my java version is
> java version "1.8.0_20"
> Java(TM) SE Runtime Environment (build 1.8.0_20-b26)
> Java HotSpot(TM) 64-Bit Server VM (build 25.20-b23, mixed mode)
> 
> and my flink-conf.yaml 
> <http://mail-archives.apache.org/mod_mbox/flink-dev/201606.mbox/%3ccak2vtervsw4muboc4swix0mr6y9bijznjuypf6_f9f0g9-_...@mail.gmail.com%3E>
>  
> <http://mail-archives.apache.org/mod_mbox/flink-dev/201606.mbox/%3ccak2vtervsw4muboc4swix0mr6y9bijznjuypf6_f9f0g9-_...@mail.gmail.com%3E>
>  
> <http://mail-archives.apache.org/mod_mbox/flink-dev/201606.mbox/%3ccak2vtervsw4muboc4swix0mr6y9bijznjuypf6_f9f0g9-_...@mail.gmail.com%3E>
>  
> <http://mail-archives.apache.org/mod_mbox/flink-dev/201606.mbox/%3ccak2vtervsw4muboc4swix0mr6y9bijznjuypf6_f9f0g9-_...@mail.gmail.com%3E>
>  
> <http://mail-archives.apache.org/mod_mbox/flink-dev/201606.mbox/%3ccak2vtervsw4muboc4swix0mr6y9bijznjuypf6_f9f0g9-_...@mail.gmail.com%3E>
>  
> <http://mail-archives.apache.org/mod_mbox/flink-dev/201606.mbox/%3ccak2vtervsw4muboc4swix0mr6y9bijznjuypf6_f9f0g9-_...@mail.gmail.com%3E>
>  
> <http://mail-archives.apache.org/mod_mbox/flink-dev/201606.mbox/%3ccak2vtervsw4muboc4swix0mr6y9bijznjuypf6_f9f0g9-_...@mail.gmail.com%3E>
>  
> <http://mail-archives.apache.org/mod_mbox/flink-dev/201606.mbox/%3ccak2vtervsw4muboc4swix0mr6y9bijznjuypf6_f9f0g9-_...@mail.gmail.com%3E>
>  
> <http://mail-archives.apache.org/mod_mbox/flink-dev/201606.mbox/%3ccak2vtervsw4muboc4swix0mr6y9bijznjuypf6_f9f0g9-_...@mail.gmail.com%3E>
>  
> <http://mail-archives.apache.org/mod_mbox/flink-dev/201606.mbox/%3ccak2vtervsw4muboc4swix0mr6y9bijznjuypf6_f9f0g9-_...@mail.gmail.com%3E>
>  
> <http://mail-archives.apache.org/mod_mbox/flink-dev/201606.mbox/%3ccak2vtervsw4muboc4swix0mr6y9bijznjuypf6_f9f0g9-_...@mail.gmail.com%3E>
>  
> <http://mail-archives.apache.org/mod_mbox/flink-dev/201606.mbox/%3ccak2vtervsw4muboc4swix0mr6y9bijznjuypf6_f9f0g9-_...@mail.gmail.com%3E>
>  
> <http://mail-archives.apache.org/mod_mbox/flink-dev/201606.mbox/%3ccak2vtervsw4muboc4swix0mr6y9bijznjuypf6_f9f0g9-_...@mail.gmail.com%3E>
>  
> <http://mail-archives.apache.org/mod_mbox/flink-dev/201606.mbox/%3ccak2vtervsw4muboc4swix0mr6y9bijznjuypf6_f9f0g9-_...@mail.gmail.com%3E>
>  
> <http://mail-archives.apache.org/mod_mbox/flink-dev/201606.mbox/%3ccak2vtervsw4muboc4swix0mr6y9bijznjuypf6_f9f0g9-_...@mail.gmail.com%3E>
>  
> <http://mail-archives.apache.org/mod_mbox/flink-dev/201606.mbox/%3ccak2vtervsw4muboc4swix0mr6y9bijznjuypf6_f9f0g9-_...@mail.gmail.com%3E>
>  
> <http://mail-archives.apache.org/mod_mbox/flink-dev/201606.mbox/%3ccak2vtervsw4muboc4swix0mr6y9bijznjuypf6_f9f0g9-_...@mail.gmail.com%3E>
> 
> env.java.home: /usr/java/default
> jobmanager.rpc.address: internal.stream01.denma.ggportal.net 
> <http://internal.stream01.denma.ggportal.net/>
> jobmanager.rpc.port: 6123
> jobmanager.heap.mb: 2048
> taskmanager.heap.mb: 8192
> taskmanager.memory.off-heap: true
> taskmanager.numberOfTaskSlots: 4
> taskmanager.memory.preallocate: false
> parallelism.default: 2
> jobmanager.web.port: 8081
> jobmanager.web.submit.enable: true
> state.backend: filesystem
> state.backend.fs.checkpointdir: s3a://denma.live/flink/datum/checkpoints
> taskmanager.network.numberOfBuffers: 8192
> taskmanager.tmp.dirs: /opt/flink/var/tmp
> fs.hdfs.hadoopconf: /opt/flink/conf/
> recovery.mode: zookeeper
> recovery.zookeeper.quorum: ....
> recovery.zookeeper.storageDir: s3a://denma.live/flink/datum/recovery
> recovery.jobmanager.port: 50000-50100
> recovery.zookeeper.path.root: /flink
> blob.server.port: 50100-50200
> blob.storage.directory: /opt/flink/var/tmp/flink-blob
> taskmanager.rpc.port: 6122
> taskmanager.data.port: 6121
> 
> i need help. what shall i do?
> thx in advance.
> 
> 
> 
> -- 
> <A HREF="http://www.kiva.org <http://www.kiva.org/>" TARGET="_top">
> <IMG SRC="http://www.kiva.org/images/bannerlong.png 
> <http://www.kiva.org/images/bannerlong.png>" WIDTH="460" HEIGHT="60" 
> ALT="Kiva - loans that change lives" BORDER="0" ALIGN="BOTTOM"></A>
> 
> 

Reply via email to