[
https://issues.apache.org/jira/browse/FLINK-25028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17448347#comment-17448347
]
Zhilong Hong commented on FLINK-25028:
--------------------------------------
It seems that an OOM happens on the TaskExecutor. There are many reasons that
could cause an OOM. You could run the {{jmap}} command in the pod/container to
get the snapshot of the heap memory of that TaskExecutor. Then you could use
the {{jhat}} command to analysis the snapshot and find out what is the largest
part that occupies the heap memory. For more information, please see [the
official doc of
jmap|https://docs.oracle.com/javase/8/docs/technotes/guides/troubleshoot/tooldescr014.html].
> java.lang.OutOfMemoryError: Java heap space
> -------------------------------------------
>
> Key: FLINK-25028
> URL: https://issues.apache.org/jira/browse/FLINK-25028
> Project: Flink
> Issue Type: Bug
> Components: Runtime / Task
> Affects Versions: 1.14.0
> Reporter: wangbaohua
> Priority: Blocker
> Attachments: error.txt
>
>
> java.lang.OutOfMemoryError: Java heap space
> at java.util.HashMap.resize(HashMap.java:703) ~[?:1.8.0_131]
> at java.util.HashMap.putVal(HashMap.java:628) ~[?:1.8.0_131]
> at java.util.HashMap.put(HashMap.java:611) ~[?:1.8.0_131]
> at java.util.HashSet.add(HashSet.java:219) ~[?:1.8.0_131]
> at
> java.io.ObjectStreamClass$FieldReflector.<init>(ObjectStreamClass.java:1945)
> ~[?:1.8.0_131]
> at java.io.ObjectStreamClass.getReflector(ObjectStreamClass.java:2193)
> ~[?:1.8.0_131]
> at java.io.ObjectStreamClass.<init>(ObjectStreamClass.java:521)
> ~[?:1.8.0_131]
> at java.io.ObjectStreamClass.lookup(ObjectStreamClass.java:369)
> ~[?:1.8.0_131]
> at java.io.ObjectStreamClass.<init>(ObjectStreamClass.java:468)
> ~[?:1.8.0_131]
> at java.io.ObjectStreamClass.lookup(ObjectStreamClass.java:369)
> ~[?:1.8.0_131]
> at
> java.io.ObjectOutputStream.writeObject0(ObjectOutputStream.java:1134)
> ~[?:1.8.0_131]
> at java.io.ObjectOutputStream.writeArray(ObjectOutputStream.java:1378)
> ~[?:1.8.0_131]
> at
> java.io.ObjectOutputStream.writeObject0(ObjectOutputStream.java:1174)
> ~[?:1.8.0_131]
> at java.io.ObjectOutputStream.access$300(ObjectOutputStream.java:162)
> ~[?:1.8.0_131]
> at
> java.io.ObjectOutputStream$PutFieldImpl.writeFields(ObjectOutputStream.java:1707)
> ~[?:1.8.0_131]
> at java.io.ObjectOutputStream.writeFields(ObjectOutputStream.java:482)
> ~[?:1.8.0_131]
> at
> java.util.concurrent.ConcurrentHashMap.writeObject(ConcurrentHashMap.java:1406)
> ~[?:1.8.0_131]
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> ~[?:1.8.0_131]
> at
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
> ~[?:1.8.0_131]
> at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> ~[?:1.8.0_131]
> at java.lang.reflect.Method.invoke(Method.java:498) ~[?:1.8.0_131]
> at
> java.io.ObjectStreamClass.invokeWriteObject(ObjectStreamClass.java:1028)
> ~[?:1.8.0_131]
> at
> java.io.ObjectOutputStream.writeSerialData(ObjectOutputStream.java:1496)
> ~[?:1.8.0_131]
> at
> java.io.ObjectOutputStream.writeOrdinaryObject(ObjectOutputStream.java:1432)
> ~[?:1.8.0_131]
> at
> java.io.ObjectOutputStream.writeObject0(ObjectOutputStream.java:1178)
> ~[?:1.8.0_131]
> at java.io.ObjectOutputStream.writeObject(ObjectOutputStream.java:348)
> ~[?:1.8.0_131]
> at
> org.apache.flink.util.InstantiationUtil.serializeObject(InstantiationUtil.java:632)
> ~[flink-dist_2.11-1.14.0.jar:1.14.0]
> at
> org.apache.flink.util.SerializedValue.<init>(SerializedValue.java:62)
> ~[flink-dist_2.11-1.14.0.jar:1.14.0]
> at
> org.apache.flink.runtime.accumulators.AccumulatorSnapshot.<init>(AccumulatorSnapshot.java:51)
> ~[flink-dist_2.11-1.14.0.jar:1.14.0]
> at
> org.apache.flink.runtime.accumulators.AccumulatorRegistry.getSnapshot(AccumulatorRegistry.java:54)
> ~[flink-dist_2.11-1.14.0.jar:1.14.0]
> at
> org.apache.flink.runtime.taskexecutor.TaskExecutor$JobManagerHeartbeatListener.lambda$retrievePayload$3(TaskExecutor.java:2425)
> ~[flink-dist_2.11-1.14.0.jar:1.14.0]
> at
> org.apache.flink.runtime.taskexecutor.TaskExecutor$JobManagerHeartbeatListener$$Lambda$1020/78782846.apply(Unknown
> Source) ~[?:?]
--
This message was sent by Atlassian Jira
(v8.20.1#820001)