Can you try Flight Recorder with allocation profile on?

If Full GC can not clean those objects, the application probably is holding those. Might be a memory leak.

Thanks

Jenny


On 11/09/2016 11:38 PM, Amit Mishra wrote:

Hello Charlie/Poonam/team,

Need your help/suggestions on how to troubleshoot memory leak without taking any heap dump.

We are facing random Promotion failure followed by Continuous concurrent mode failures/Full GC events that impacts our Standalone application for long time until restart.

Application GC remain stable for more than a week with smooth saw tooth pattern and suddenly something happened within 1 hour or so that results in severe GC failure and ultimately application failure.

We have verified traffic pattern/application logs and other dependent application logs but there is no indication on why suddenly at one point of time heap usage kept on increasing which results in CMS failures.(Traffic pattern is fairly stable and there are no scheduled or cron jobs during time of issue)

We cannot take heap dump as this is standalone application having big heap size.(32G)

We have collected histogram during issue time and of non- issue time and found that instances of 2-3 classes have been suddenly increased from 200-300 MB to 5G+ but not sure how we can dig into code to find out what cause those classes instances to surge.

Please guide me how to troubleshoot this issue in terms of any light weight tool that would exactly pin point methods or calls that can lead to this memory leak as we can’t take heap dump which is very heavy impacting tool.

One more question is why Full GC not able to clean generations even after multiple attempts and a continuous loop of GC failures being created which got resolved only after application restart, does it indicates that no new objects was creating & it was only GC algorithm which started failing and increased heap usage.

Many thanks in advance for your kind support and guidance.

This is GC graph and attached is GC file.

cid:image002.jpg@01D23948.747997C0

Histogram snapshots:

java.util.HashMap$Entry was only 400 MB before issue and then 5.5G during issue same thing true for AcctSessionInfo and java.lang.String class instances.

Non issue time:

  num     #instances         #bytes  class name

----------------------------------------------

   1:      13613915     2219936904 [Ljava.lang.Object;

  2:      10065566     1569906056 [Ljava.util.HashMap$Entry;

3: 2671564 1175488160 com.redknee.product.s5600.ipc.xgen.PdpContextID

   4:      17247420      903565648  [C

   5:      10055084      723966048 java.util.HashMap

   6:      17208464      688338560 java.lang.String

   7:       7843562      439239472 java.util.HashMap$Entry

   8:      10065566      402622640 java.util.HashMap$FrontCache

Issue time :Heap usage around 28G

  num     #instances         #bytes  class name

----------------------------------------------

   1:     118037170     6600874168  [C

   2:     103071116     5771982496 java.util.HashMap$Entry

3: 101560457 5687385592 com.redknee.product.s5600.ipc.xgen.AcctSessionInfo

   4:     118042761     4721710440 java.lang.String

   5:       9942863     3020272632 [Ljava.lang.Object;

   6:       7537560     2737186632 [Ljava.util.HashMap$Entry;

7: 1453865 639700600 com.redknee.product.s5600.ipc.xgen.PdpContextID

   8:       7537148      542674656 java.util.HashMap

Thanks,

Amit Mishra



_______________________________________________
hotspot-gc-use mailing list
hotspot-gc-use@openjdk.java.net
http://mail.openjdk.java.net/mailman/listinfo/hotspot-gc-use

_______________________________________________
hotspot-gc-use mailing list
hotspot-gc-use@openjdk.java.net
http://mail.openjdk.java.net/mailman/listinfo/hotspot-gc-use

Reply via email to