[ https://issues.apache.org/jira/browse/FLINK-25028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17448347#comment-17448347 ]
Zhilong Hong commented on FLINK-25028: -------------------------------------- It seems that an OOM happens on the TaskExecutor. There are many reasons that could cause an OOM. You could run the {{jmap}} command in the pod/container to get the snapshot of the heap memory of that TaskExecutor. Then you could use the {{jhat}} command to analysis the snapshot and find out what is the largest part that occupies the heap memory. For more information, please see [the official doc of jmap|https://docs.oracle.com/javase/8/docs/technotes/guides/troubleshoot/tooldescr014.html]. > java.lang.OutOfMemoryError: Java heap space > ------------------------------------------- > > Key: FLINK-25028 > URL: https://issues.apache.org/jira/browse/FLINK-25028 > Project: Flink > Issue Type: Bug > Components: Runtime / Task > Affects Versions: 1.14.0 > Reporter: wangbaohua > Priority: Blocker > Attachments: error.txt > > > java.lang.OutOfMemoryError: Java heap space > at java.util.HashMap.resize(HashMap.java:703) ~[?:1.8.0_131] > at java.util.HashMap.putVal(HashMap.java:628) ~[?:1.8.0_131] > at java.util.HashMap.put(HashMap.java:611) ~[?:1.8.0_131] > at java.util.HashSet.add(HashSet.java:219) ~[?:1.8.0_131] > at > java.io.ObjectStreamClass$FieldReflector.<init>(ObjectStreamClass.java:1945) > ~[?:1.8.0_131] > at java.io.ObjectStreamClass.getReflector(ObjectStreamClass.java:2193) > ~[?:1.8.0_131] > at java.io.ObjectStreamClass.<init>(ObjectStreamClass.java:521) > ~[?:1.8.0_131] > at java.io.ObjectStreamClass.lookup(ObjectStreamClass.java:369) > ~[?:1.8.0_131] > at java.io.ObjectStreamClass.<init>(ObjectStreamClass.java:468) > ~[?:1.8.0_131] > at java.io.ObjectStreamClass.lookup(ObjectStreamClass.java:369) > ~[?:1.8.0_131] > at > java.io.ObjectOutputStream.writeObject0(ObjectOutputStream.java:1134) > ~[?:1.8.0_131] > at java.io.ObjectOutputStream.writeArray(ObjectOutputStream.java:1378) > ~[?:1.8.0_131] > at > java.io.ObjectOutputStream.writeObject0(ObjectOutputStream.java:1174) > ~[?:1.8.0_131] > at java.io.ObjectOutputStream.access$300(ObjectOutputStream.java:162) > ~[?:1.8.0_131] > at > java.io.ObjectOutputStream$PutFieldImpl.writeFields(ObjectOutputStream.java:1707) > ~[?:1.8.0_131] > at java.io.ObjectOutputStream.writeFields(ObjectOutputStream.java:482) > ~[?:1.8.0_131] > at > java.util.concurrent.ConcurrentHashMap.writeObject(ConcurrentHashMap.java:1406) > ~[?:1.8.0_131] > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > ~[?:1.8.0_131] > at > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) > ~[?:1.8.0_131] > at > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) > ~[?:1.8.0_131] > at java.lang.reflect.Method.invoke(Method.java:498) ~[?:1.8.0_131] > at > java.io.ObjectStreamClass.invokeWriteObject(ObjectStreamClass.java:1028) > ~[?:1.8.0_131] > at > java.io.ObjectOutputStream.writeSerialData(ObjectOutputStream.java:1496) > ~[?:1.8.0_131] > at > java.io.ObjectOutputStream.writeOrdinaryObject(ObjectOutputStream.java:1432) > ~[?:1.8.0_131] > at > java.io.ObjectOutputStream.writeObject0(ObjectOutputStream.java:1178) > ~[?:1.8.0_131] > at java.io.ObjectOutputStream.writeObject(ObjectOutputStream.java:348) > ~[?:1.8.0_131] > at > org.apache.flink.util.InstantiationUtil.serializeObject(InstantiationUtil.java:632) > ~[flink-dist_2.11-1.14.0.jar:1.14.0] > at > org.apache.flink.util.SerializedValue.<init>(SerializedValue.java:62) > ~[flink-dist_2.11-1.14.0.jar:1.14.0] > at > org.apache.flink.runtime.accumulators.AccumulatorSnapshot.<init>(AccumulatorSnapshot.java:51) > ~[flink-dist_2.11-1.14.0.jar:1.14.0] > at > org.apache.flink.runtime.accumulators.AccumulatorRegistry.getSnapshot(AccumulatorRegistry.java:54) > ~[flink-dist_2.11-1.14.0.jar:1.14.0] > at > org.apache.flink.runtime.taskexecutor.TaskExecutor$JobManagerHeartbeatListener.lambda$retrievePayload$3(TaskExecutor.java:2425) > ~[flink-dist_2.11-1.14.0.jar:1.14.0] > at > org.apache.flink.runtime.taskexecutor.TaskExecutor$JobManagerHeartbeatListener$$Lambda$1020/78782846.apply(Unknown > Source) ~[?:?] -- This message was sent by Atlassian Jira (v8.20.1#820001)