Hello,

Can you say a bit what's the actual problem? Your smys times in those GCs look 
large, but it's only a 4s Pause for a 160gb heap. I am sure you saw that pause 
time before? I think it's pretty hard to tell after the fact, but with such a 
large system I would lean towards problems outside of the JVM.

Gruss
Bernd


--
http://bernd.eckenfels.net
________________________________
Von: hotspot-gc-use <hotspot-gc-use-boun...@openjdk.java.net> im Auftrag von 
Roy Zhang <roy.sunny.zhang...@gmail.com>
Gesendet: Monday, January 20, 2020 11:22:43 AM
An: hotspot-gc-use@openjdk.java.net <hotspot-gc-use@openjdk.java.net>
Betreff: Abnormal high sys time in G1 GC

Dear JVM experts,

Recently we found GC spike (long STW minor GC), and sys time is high when we GC 
time is high. Normally sys time is near 0 seconds and minor GC is less than 
500ms.

>From 
>http://mail.openjdk.java.net/pipermail/hotspot-gc-dev/2017-October/020630.html 
>and https://blog.gceasy.io/2016/12/11/sys-time-greater-than-user-time/, high 
>sys time could be caused by operation system problem/VM related problem/memory 
>constraint/disk IO pressure/Transparent Huge Pages.

I checked them one by one, don't find any clue, could u please kindly provide 
suggestion? Thanks in advance!

1.operation system problem
--We have enough CPU/memory/disk (48 cpu cores + 373 RAM with 160G heap, disk 
is enough), and there is no error in /var/log/dmesg
2. memory constraint
-- We have enough available memory. available memory (free -m) is 263G
3. disk IO pressure
-- Not find issue from disk info from prometheus node exporter.  Granularity is 
15s, and I can't find counterpart of avgqu-sz & util metrics (disk IO util and 
saturation metrics)  which is part of iostat. It could be caused by big 
Granularity???
4. VM related problem
-- We are using physical machine
5. Transparent Huge Pages.
It is madvise. It could be a problem, but we don't have this issue previously. 
It has been running for nearly 20 weeks.
cat /sys/kernel/mm/transparent_hugepage/enabled
always [madvise] never

JDK version:
OpenJDK Runtime Environment, 1.8.0_222-b10

Java Opts:
-javaagent:/server/jmx_prometheus_javaagent-0.12.0.jar=xxxx:/server/config.yaml
-server
-Dcom.sun.management.jmxremote
-Dcom.sun.management.jmxremote.port=xxxx
-Dcom.sun.management.jmxremote.rmi.port=xxxx
-Dcom.sun.management.jmxremote.local.only=false
-Dcom.sun.management.jmxremote.authenticate=false
-Dcom.sun.management.jmxremote.ssl=false
-Xloggc:/server/xxxx.log
-XX:+PrintGCDateStamps
-XX:AutoBoxCacheMax=1000000
-XX:+UseG1GC
-XX:MaxGCPauseMillis=500
-XX:+UnlockExperimentalVMOptions
-XX:G1NewSizePercent=50
-XX:InitiatingHeapOccupancyPercent=70
-XX:+ParallelRefProcEnabled
-XX:+ExplicitGCInvokesConcurrent
-XX:+UseStringDeduplication
-XX:+PrintGCDetails
-XX:+PrintGCTimeStamps
-Xms160g
-Xmx160g
-XX:+HeapDumpOnOutOfMemoryError

Snippet of GC log:

2020-01-20T07:27:03.166+0000: 2756.665: [GC pause (G1 Evacuation Pause) 
(young), 6.2899024 secs]
   [Parallel Time: 6255.0 ms, GC Workers: 33]
      [GC Worker Start (ms): Min: 2756664.9, Avg: 2756665.5, Max: 2756666.1, 
Diff: 1.2]
      [Ext Root Scanning (ms): Min: 0.0, Avg: 0.5, Max: 5.3, Diff: 5.3, Sum: 
16.8]
      [Update RS (ms): Min: 0.0, Avg: 0.8, Max: 1.1, Diff: 1.1, Sum: 25.6]
         [Processed Buffers: Min: 0, Avg: 1.6, Max: 4, Diff: 4, Sum: 53]
      [Scan RS (ms): Min: 142.0, Avg: 145.3, Max: 146.4, Diff: 4.4, Sum: 4794.1]
      [Code Root Scanning (ms): Min: 0.0, Avg: 0.3, Max: 3.5, Diff: 3.5, Sum: 
8.8]
      [Object Copy (ms): Min: 6100.1, Avg: 6101.8, Max: 6106.5, Diff: 6.4, Sum: 
201358.4]
      [Termination (ms): Min: 0.1, Avg: 5.2, Max: 6.7, Diff: 6.6, Sum: 172.9]
         [Termination Attempts: Min: 1, Avg: 1353.0, Max: 1476, Diff: 1475, 
Sum: 44650]
      [GC Worker Other (ms): Min: 0.0, Avg: 0.2, Max: 0.4, Diff: 0.4, Sum: 7.0]
      [GC Worker Total (ms): Min: 6253.4, Avg: 6254.1, Max: 6254.7, Diff: 1.2, 
Sum: 206383.7]
      [GC Worker End (ms): Min: 2762919.4, Avg: 2762919.6, Max: 2762919.8, 
Diff: 0.4]
   [Code Root Fixup: 0.6 ms]
   [Code Root Purge: 0.0 ms]
   [String Dedup Fixup: 0.7 ms, GC Workers: 33]
      [Queue Fixup (ms): Min: 0.0, Avg: 0.0, Max: 0.1, Diff: 0.1, Sum: 0.4]
      [Table Fixup (ms): Min: 0.0, Avg: 0.1, Max: 0.6, Diff: 0.6, Sum: 2.0]
   [Clear CT: 4.0 ms]
   [Other: 29.6 ms]
      [Choose CSet: 0.1 ms]
      [Ref Proc: 10.3 ms]
      [Ref Enq: 0.6 ms]
      [Redirty Cards: 11.3 ms]
      [Humongous Register: 0.2 ms]
      [Humongous Reclaim: 0.0 ms]
      [Free CSet: 6.5 ms]
   [Eden: 72576.0M(72576.0M)->0.0B(80896.0M) Survivors: 9344.0M->1024.0M Heap: 
83520.0M(160.0G)->11046.9M(160.0G)]
 [Times: user=27.19 sys=162.28, real=6.30 secs]

2020-01-20T06:59:23.382+0000: 1096.881: [GC pause (G1 Evacuation Pause) (young) 
(initial-mark), 4.1248088 secs]
   [Parallel Time: 4098.0 ms, GC Workers: 33]
      [GC Worker Start (ms): Min: 1096882.1, Avg: 1096882.8, Max: 1096883.2, 
Diff: 1.2]
      [Ext Root Scanning (ms): Min: 4.0, Avg: 4.8, Max: 6.1, Diff: 2.0, Sum: 
159.7]
      [Update RS (ms): Min: 0.0, Avg: 0.3, Max: 1.1, Diff: 1.1, Sum: 9.5]
         [Processed Buffers: Min: 0, Avg: 1.3, Max: 6, Diff: 6, Sum: 43]
      [Scan RS (ms): Min: 2001.2, Avg: 2012.2, Max: 2013.4, Diff: 12.2, Sum: 
66401.0]
      [Code Root Scanning (ms): Min: 0.0, Avg: 0.6, Max: 10.7, Diff: 10.7, Sum: 
18.5]
      [Object Copy (ms): Min: 2039.3, Avg: 2049.2, Max: 2079.5, Diff: 40.2, 
Sum: 67623.1]
      [Termination (ms): Min: 0.0, Avg: 29.6, Max: 39.7, Diff: 39.7, Sum: 978.0]
         [Termination Attempts: Min: 1, Avg: 6587.0, Max: 8068, Diff: 8067, 
Sum: 217372]
      [GC Worker Other (ms): Min: 0.0, Avg: 0.2, Max: 0.5, Diff: 0.4, Sum: 7.9]
      [GC Worker Total (ms): Min: 4096.3, Avg: 4096.9, Max: 4097.7, Diff: 1.4, 
Sum: 135197.8]
      [GC Worker End (ms): Min: 1100979.5, Avg: 1100979.7, Max: 1100979.9, 
Diff: 0.4]
   [Code Root Fixup: 0.6 ms]
   [Code Root Purge: 0.2 ms]
   [String Dedup Fixup: 1.0 ms, GC Workers: 33]
      [Queue Fixup (ms): Min: 0.0, Avg: 0.0, Max: 0.0, Diff: 0.0, Sum: 0.0]
      [Table Fixup (ms): Min: 0.0, Avg: 0.0, Max: 0.7, Diff: 0.7, Sum: 1.4]
   [Clear CT: 3.4 ms]
   [Other: 21.7 ms]
      [Choose CSet: 0.0 ms]
      [Ref Proc: 9.1 ms]
      [Ref Enq: 0.9 ms]
      [Redirty Cards: 4.3 ms]
      [Humongous Register: 0.2 ms]
      [Humongous Reclaim: 0.0 ms]
      [Free CSet: 5.3 ms]
   [Eden: 81184.0M(81184.0M)->0.0B(72576.0M) Survivors: 736.0M->9344.0M Heap: 
83508.0M(160.0G)->10944.0M(160.0G)]
 [Times: user=68.40 sys=9.11, real=4.13 secs]

Thanks,
Roy
_______________________________________________
hotspot-gc-use mailing list
hotspot-gc-use@openjdk.java.net
https://mail.openjdk.java.net/mailman/listinfo/hotspot-gc-use

Reply via email to