[
https://issues.apache.org/jira/browse/IGNITE-9305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16591363#comment-16591363
]
Pavel Pereslegin edited comment on IGNITE-9305 at 8/24/18 2:15 PM:
-------------------------------------------------------------------
[~dmagda], I changed format and now off heap metrics look not very well
readable:
{noformat}
^-- H/N/C [hosts=1, nodes=2, CPUs=8]
^-- CPU [cur=2.73%, avg=1.6%, GC=0%]
^-- PageMemory [pages=34]
^-- Heap [used=71MB, free=97.99%, comm=204MB]
^-- Off-heap [used=0MB, free=99.94%, comm=220MB]
^-- Off-heap sysMemPlc region [used=0MB, free=99.98%, comm=100MB]
^-- Off-heap default region [used=0MB, free=99.62%, comm=20MB]
^-- Off-heap metastoreMemPlc region [used=0MB, free=99.96%, comm=100MB]
^-- Ignite persistence default region [used=0MB]
^-- Outbound messages queue [size=0]
^-- Public thread pool [active=0, idle=6, qSize=0]
^-- System thread pool [active=0, idle=7, qSize=0]
^-- Custom executor 0 [active=0, idle=0, qSize=0]
^-- Custom executor 1 [active=0, idle=0, qSize=0]
{noformat}
May be change format to something like this?
{noformat}
Data region: {name}, off-heap [{params}]
Ignite persistence region: {name}, disk [{params}]
{noformat}
Example of such output:
{noformat}
^-- H/N/C [hosts=1, nodes=2, CPUs=8]
^-- CPU [cur=15.73%, avg=0%, GC=0%]
^-- PageMemory [pages=25]
^-- Heap [used=71MB, free=97.99%, comm=203MB]
^-- Off-heap [used=0MB, free=99.96%, comm=220MB]
^-- Data region: sysMemPlc, off-heap [used=0MB, free=99.98%, comm=100MB]
^-- Data region: default, off-heap [used=0MB, free=99.8%, comm=20MB]
^-- Data region: metastoreMemPlc, off-heap [used=0MB, free=99.96%, comm=100MB]
^-- Ignite persistence region: default, disk [used=0MB]
^-- Outbound messages queue [size=0]
^-- Public thread pool [active=0, idle=6, qSize=0]
^-- System thread pool [active=0, idle=6, qSize=0]
^-- Custom executor 0 [active=0, idle=0, qSize=0]
^-- Custom executor 1 [active=0, idle=0, qSize=0]
{noformat}
was (Author: xtern):
[~dmagda], I changed format and off heap metrics look not very well readable:
{noformat}
^-- H/N/C [hosts=1, nodes=2, CPUs=8]
^-- CPU [cur=2.73%, avg=1.6%, GC=0%]
^-- PageMemory [pages=34]
^-- Heap [used=71MB, free=97.99%, comm=204MB]
^-- Off-heap [used=0MB, free=99.94%, comm=220MB]
^-- Off-heap sysMemPlc region [used=0MB, free=99.98%, comm=100MB]
^-- Off-heap default region [used=0MB, free=99.62%, comm=20MB]
^-- Off-heap metastoreMemPlc region [used=0MB, free=99.96%, comm=100MB]
^-- Ignite persistence default region [used=0MB]
^-- Outbound messages queue [size=0]
^-- Public thread pool [active=0, idle=6, qSize=0]
^-- System thread pool [active=0, idle=7, qSize=0]
^-- Custom executor 0 [active=0, idle=0, qSize=0]
^-- Custom executor 1 [active=0, idle=0, qSize=0]
{noformat}
May be change format to something like this?
{noformat}
Data region: {name}, off-heap [{params}]
Ignite persistence region: {name}, disk [{params}]
{noformat}
Example of such output:
{noformat}
^-- H/N/C [hosts=1, nodes=2, CPUs=8]
^-- CPU [cur=15.73%, avg=0%, GC=0%]
^-- PageMemory [pages=25]
^-- Heap [used=71MB, free=97.99%, comm=203MB]
^-- Off-heap [used=0MB, free=99.96%, comm=220MB]
^-- Data region: sysMemPlc, off-heap [used=0MB, free=99.98%, comm=100MB]
^-- Data region: default, off-heap [used=0MB, free=99.8%, comm=20MB]
^-- Data region: metastoreMemPlc, off-heap [used=0MB, free=99.96%, comm=100MB]
^-- Ignite persistence region: default, disk [used=0MB]
^-- Outbound messages queue [size=0]
^-- Public thread pool [active=0, idle=6, qSize=0]
^-- System thread pool [active=0, idle=6, qSize=0]
^-- Custom executor 0 [active=0, idle=0, qSize=0]
^-- Custom executor 1 [active=0, idle=0, qSize=0]
{noformat}
> Wrong off-heap size is reported for a node
> ------------------------------------------
>
> Key: IGNITE-9305
> URL: https://issues.apache.org/jira/browse/IGNITE-9305
> Project: Ignite
> Issue Type: Task
> Affects Versions: 2.6
> Reporter: Denis Magda
> Assignee: Pavel Pereslegin
> Priority: Blocker
> Fix For: 2.7
>
>
> Was troubleshooting an Ignite deployment today and couldn't find out from the
> logs what was the actual off-heap space used.
> Those were the given memory resoures (Ignite 2.6):
> {code}
> [2018-08-16 15:07:49,961][INFO ][main][GridDiscoveryManager] Topology
> snapshot [ver=1, servers=1, clients=0, CPUs=64, offheap=30.0GB, heap=24.0GB]
> {code}
> And that weird stuff was reported by the node (pay attention to the last
> line):
> {code}
> [2018-08-16 15:45:50,211][INFO
> ][grid-timeout-worker-#135%cluster_31-Dec-2017%][IgniteKernal%cluster_31-Dec-2017]
>
> Metrics for local node (to disable set 'metricsLogFrequency' to 0)
> ^-- Node [id=c033026e, name=cluster_31-Dec-2017, uptime=00:38:00.257]
> ^-- H/N/C [hosts=1, nodes=1, CPUs=64]
> ^-- CPU [cur=0.03%, avg=5.54%, GC=0%]
> ^-- PageMemory [pages=6997377]
> ^-- Heap [used=9706MB, free=61.18%, comm=22384MB]
> ^-- Non heap [used=144MB, free=-1%, comm=148MB] - this line is always the
> same!
> {code}
> Had to change the code by using
> {code}dataRegion.getPhysicalMemoryPages(){code} to find out that actual
> off-heap usage size was
> {code}
> >>> Physical Memory Size: 28651614208 => 27324 MB, 26 GB
> {code}
> The logs have to report the following instead:
> {code}
> ^-- Off-heap {Data Region 1} [used={dataRegion1.getPhysicalMemorySize()},
> free=X%, comm=dataRegion1.maxSize()]
> ^-- Off-heap {Data Region 2} [used={dataRegion2.getPhysicalMemorySize()},
> free=X%, comm=dataRegion2.maxSize()]
> {code}
> If Ignite persistence is enabled then the following extra lines have to be
> added to see the disk used space:
> {code}
> ^-- Ignite persistence {Data Region 1}:
> used={dataRegion1.getTotalAllocatedSize() -
> dataRegion1.getPhysicalMemorySize()}
> ^-- Ignite persistence {Data Region 2}
> [used={dataRegion2.getTotalAllocatedSize() -
> dataRegion2.getPhysicalMemorySize()}]
> {code}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)