[ 
https://issues.apache.org/jira/browse/FLINK-25028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17448347#comment-17448347
 ] 

Zhilong Hong commented on FLINK-25028:
--------------------------------------

It seems that an OOM happens on the TaskExecutor. There are many reasons that 
could cause an OOM. You could run the {{jmap}} command in the pod/container to 
get the snapshot of the heap memory of that TaskExecutor. Then you could use 
the {{jhat}} command to analysis the snapshot and find out what is the largest 
part that occupies the heap memory. For more information, please see [the 
official doc of 
jmap|https://docs.oracle.com/javase/8/docs/technotes/guides/troubleshoot/tooldescr014.html].

> java.lang.OutOfMemoryError: Java heap space
> -------------------------------------------
>
>                 Key: FLINK-25028
>                 URL: https://issues.apache.org/jira/browse/FLINK-25028
>             Project: Flink
>          Issue Type: Bug
>          Components: Runtime / Task
>    Affects Versions: 1.14.0
>            Reporter: wangbaohua
>            Priority: Blocker
>         Attachments: error.txt
>
>
> java.lang.OutOfMemoryError: Java heap space
>       at java.util.HashMap.resize(HashMap.java:703) ~[?:1.8.0_131]
>       at java.util.HashMap.putVal(HashMap.java:628) ~[?:1.8.0_131]
>       at java.util.HashMap.put(HashMap.java:611) ~[?:1.8.0_131]
>       at java.util.HashSet.add(HashSet.java:219) ~[?:1.8.0_131]
>       at 
> java.io.ObjectStreamClass$FieldReflector.<init>(ObjectStreamClass.java:1945) 
> ~[?:1.8.0_131]
>       at java.io.ObjectStreamClass.getReflector(ObjectStreamClass.java:2193) 
> ~[?:1.8.0_131]
>       at java.io.ObjectStreamClass.<init>(ObjectStreamClass.java:521) 
> ~[?:1.8.0_131]
>       at java.io.ObjectStreamClass.lookup(ObjectStreamClass.java:369) 
> ~[?:1.8.0_131]
>       at java.io.ObjectStreamClass.<init>(ObjectStreamClass.java:468) 
> ~[?:1.8.0_131]
>       at java.io.ObjectStreamClass.lookup(ObjectStreamClass.java:369) 
> ~[?:1.8.0_131]
>       at 
> java.io.ObjectOutputStream.writeObject0(ObjectOutputStream.java:1134) 
> ~[?:1.8.0_131]
>       at java.io.ObjectOutputStream.writeArray(ObjectOutputStream.java:1378) 
> ~[?:1.8.0_131]
>       at 
> java.io.ObjectOutputStream.writeObject0(ObjectOutputStream.java:1174) 
> ~[?:1.8.0_131]
>       at java.io.ObjectOutputStream.access$300(ObjectOutputStream.java:162) 
> ~[?:1.8.0_131]
>       at 
> java.io.ObjectOutputStream$PutFieldImpl.writeFields(ObjectOutputStream.java:1707)
>  ~[?:1.8.0_131]
>       at java.io.ObjectOutputStream.writeFields(ObjectOutputStream.java:482) 
> ~[?:1.8.0_131]
>       at 
> java.util.concurrent.ConcurrentHashMap.writeObject(ConcurrentHashMap.java:1406)
>  ~[?:1.8.0_131]
>       at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) 
> ~[?:1.8.0_131]
>       at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) 
> ~[?:1.8.0_131]
>       at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>  ~[?:1.8.0_131]
>       at java.lang.reflect.Method.invoke(Method.java:498) ~[?:1.8.0_131]
>       at 
> java.io.ObjectStreamClass.invokeWriteObject(ObjectStreamClass.java:1028) 
> ~[?:1.8.0_131]
>       at 
> java.io.ObjectOutputStream.writeSerialData(ObjectOutputStream.java:1496) 
> ~[?:1.8.0_131]
>       at 
> java.io.ObjectOutputStream.writeOrdinaryObject(ObjectOutputStream.java:1432) 
> ~[?:1.8.0_131]
>       at 
> java.io.ObjectOutputStream.writeObject0(ObjectOutputStream.java:1178) 
> ~[?:1.8.0_131]
>       at java.io.ObjectOutputStream.writeObject(ObjectOutputStream.java:348) 
> ~[?:1.8.0_131]
>       at 
> org.apache.flink.util.InstantiationUtil.serializeObject(InstantiationUtil.java:632)
>  ~[flink-dist_2.11-1.14.0.jar:1.14.0]
>       at 
> org.apache.flink.util.SerializedValue.<init>(SerializedValue.java:62) 
> ~[flink-dist_2.11-1.14.0.jar:1.14.0]
>       at 
> org.apache.flink.runtime.accumulators.AccumulatorSnapshot.<init>(AccumulatorSnapshot.java:51)
>  ~[flink-dist_2.11-1.14.0.jar:1.14.0]
>       at 
> org.apache.flink.runtime.accumulators.AccumulatorRegistry.getSnapshot(AccumulatorRegistry.java:54)
>  ~[flink-dist_2.11-1.14.0.jar:1.14.0]
>       at 
> org.apache.flink.runtime.taskexecutor.TaskExecutor$JobManagerHeartbeatListener.lambda$retrievePayload$3(TaskExecutor.java:2425)
>  ~[flink-dist_2.11-1.14.0.jar:1.14.0]
>       at 
> org.apache.flink.runtime.taskexecutor.TaskExecutor$JobManagerHeartbeatListener$$Lambda$1020/78782846.apply(Unknown
>  Source) ~[?:?]



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to