Hello! Does it happen to work on 2.9.1, or will fail too? I recommend checking it since I vaguely remember some discussions about late affinity assignments fix.
Regards, -- Ilya Kasnacheev сб, 9 янв. 2021 г. в 03:11, tschauenberg <tschauenb...@invidi.com>: > Here's my attempt to demonstrate and also provide logs > > Standup 3 node cluster and load with data > > > Using a thick client, 250k devices are loaded into the device cache. The > thick client then leaves. There's one other thick client connected the > whole time for serving requests but I think that's irrelevant for the test > but want to point it out in case someone notices there's still a client > connected. > > Show topology from logs of the client leaving: > > > > [2021-01-08T23:08:05.012Z][INFO][disco-event-worker-#40][GridDiscoveryManager] > Node left topology: TcpDiscoveryNode > [id=611e30ee-b7c6-4ead-a746-f609b206cfb4, > consistentId=611e30ee-b7c6-4ead-a746-f609b206cfb4, addrs=ArrayList > [127.0.0.1, 172.17.0.3], sockAddrs=HashSet [/127.0.0.1:0, /172.17.0.3:0], > discPort=0, order=7, intOrder=6, lastExchangeTime=1610146373751, loc=false, > ver=2.8.1#20200521-sha1:86422096, isClient=true] > > [2021-01-08T23:08:05.013Z][INFO][disco-event-worker-#40][GridDiscoveryManager] > Topology snapshot [ver=8, locNode=75e4ddea, servers=3, clients=1, > state=ACTIVE, CPUs=7, offheap=3.0GB, heap=3.1GB] > > Start visor on one of the nodes > > > Show topology from logs > > > [2021-01-08T23:30:33.461Z][INFO][tcp-disco-msg-worker-[4ea8efe1 > 10.12.3.76:47500]-#2][TcpDiscoverySpi] New next node > [newNext=TcpDiscoveryNode [id=1cca94e3-f15f-4a8b-9f65-d9b9055a5fa7, > consistentId=10.12.2.110:47501, addrs=ArrayList [10.12.2.110], > sockAddrs=HashSet [/10.12.2.110:47501], discPort=47501, order=0, > intOrder=7, > lastExchangeTime=1610148633458, loc=false, > ver=2.8.1#20200521-sha1:86422096, > isClient=false]] > > [2021-01-08T23:30:34.045Z][INFO][sys-#1011][GridDhtPartitionsExchangeFuture] > Completed partition exchange > [localNode=75e4ddea-1927-4e93-82e9-fdfbb7b58d1c, > exchange=GridDhtPartitionsExchangeFuture [topVer=AffinityTopologyVersion > [topVer=9, minorTopVer=0], evt=NODE_JOINED, evtNode=TcpDiscoveryNode > [id=1cca94e3-f15f-4a8b-9f65-d9b9055a5fa7, consistentId=10.12.2.110:47501, > addrs=ArrayList [10.12.2.110], sockAddrs=HashSet [/10.12.2.110:47501], > discPort=47501, order=9, intOrder=7, lastExchangeTime=1610148633458, > loc=false, ver=2.8.1#20200521-sha1:86422096, isClient=false], done=true, > newCrdFut=null], topVer=AffinityTopologyVersion [topVer=9, minorTopVer=0]] > > Show data balanced in visor > > > > +-----------------------+-------------+-------+---------------------------------+-----------------------------------+-----------+-----------+-----------+----------------+ > | Devices(@c2) | PARTITIONED | 3 | 250000 (0 / 250000) > > | min: 80315 (0 / 80315) | min: 0 | min: 0 | min: 0 | > min: 250000 | > | | | | > > | avg: 83333.33 (0.00 / 83333.33) | avg: 0.00 | avg: 0.00 | avg: 0.00 | > avg: 250000.00 | > | | | | > > | max: 86968 (0 / 86968) | max: 0 | max: 0 | max: 0 | > max: 250000 | > > +-----------------------+-------------+-------+---------------------------------+-----------------------------------+-----------+-----------+-----------+----------------+ > > At this point the data is all relatively balanced and the topology > increased > when visor connected. > > Stop ignite on one node > > > Show topology and PME from logs (from a different ignite node as the ignite > process was stopped) > > > > [2021-01-08T23:35:39.333Z][INFO][disco-event-worker-#40][GridDiscoveryManager] > Node left topology: TcpDiscoveryNode > [id=75e4ddea-1927-4e93-82e9-fdfbb7b58d1c, > consistentId=3a4a497f-5a89-4f2c-8531-b2b05f2ede22, addrs=ArrayList > [10.12.2.110], sockAddrs=HashSet [/10.12.2.110:47500], discPort=47500, > order=3, intOrder=3, lastExchangeTime=1610139164908, loc=false, > ver=2.8.1#20200521-sha1:86422096, isClient=false] > > [2021-01-08T23:35:39.333Z][INFO][disco-event-worker-#40][GridDiscoveryManager] > Topology snapshot [ver=10, locNode=4ea8efe1, servers=2, clients=1, > state=ACTIVE, CPUs=5, offheap=2.0GB, heap=2.1GB] > [2021-01-08T23:35:39.333Z][INFO][disco-event-worker-#40][GridDiscoveryManager] > > ^-- Baseline [id=0, size=3, online=2, offline=1] > [2021-01-08T23:35:39.335Z][INFO][exchange-worker-#41][time] Started > exchange > init [topVer=AffinityTopologyVersion [topVer=10, minorTopVer=0], crd=true, > evt=NODE_LEFT, evtNode=75e4ddea-1927-4e93-82e9-fdfbb7b58d1c, > customEvt=null, > allowMerge=false, exchangeFreeSwitch=true] > [2021-01-08T23:35:39.338Z][INFO][sys-#1031][GridAffinityAssignmentCache] > Local node affinity assignment distribution is not ideal [cache=Households, > expectedPrimary=512.00, actualPrimary=548, expectedBackups=1024.00, > actualBackups=476, warningThreshold=50.00%] > [2021-01-08T23:35:39.340Z][INFO][sys-#1032][GridAffinityAssignmentCache] > Local node affinity assignment distribution is not ideal [cache=Devices, > expectedPrimary=512.00, actualPrimary=548, expectedBackups=1024.00, > actualBackups=476, warningThreshold=50.00%] > > [2021-01-08T23:35:39.354Z][INFO][exchange-worker-#41][GridDhtPartitionsExchangeFuture] > Finished waiting for partition release future > [topVer=AffinityTopologyVersion [topVer=10, minorTopVer=0], waitTime=0ms, > futInfo=NA, mode=DISTRIBUTED] > > [2021-01-08T23:35:39.357Z][INFO][exchange-worker-#41][GridDhtPartitionsExchangeFuture] > Finished waiting for partitions release latch: ServerLatch [permits=0, > pendingAcks=HashSet [], super=CompletableLatch [id=CompletableLatchUid > [id=exchange, topVer=AffinityTopologyVersion [topVer=10, minorTopVer=0]]]] > > [2021-01-08T23:35:39.365Z][INFO][exchange-worker-#41][GridDhtPartitionsExchangeFuture] > Finish exchange future [startVer=AffinityTopologyVersion [topVer=10, > minorTopVer=0], resVer=AffinityTopologyVersion [topVer=10, minorTopVer=0], > err=null, rebalanced=true, wasRebalanced=true] > > [2021-01-08T23:35:39.370Z][INFO][exchange-worker-#41][GridDhtPartitionsExchangeFuture] > Completed partition exchange > [localNode=4ea8efe1-0f92-4232-bbab-2cb232ed04f9, > exchange=GridDhtPartitionsExchangeFuture [topVer=AffinityTopologyVersion > [topVer=10, minorTopVer=0], evt=NODE_LEFT, evtNode=TcpDiscoveryNode > [id=75e4ddea-1927-4e93-82e9-fdfbb7b58d1c, > consistentId=3a4a497f-5a89-4f2c-8531-b2b05f2ede22, addrs=ArrayList > [10.12.2.110], sockAddrs=HashSet [/10.12.2.110:47500], discPort=47500, > order=3, intOrder=3, lastExchangeTime=1610139164908, loc=false, > ver=2.8.1#20200521-sha1:86422096, isClient=false], done=true, > newCrdFut=null], topVer=AffinityTopologyVersion [topVer=10, minorTopVer=0]] > > [2021-01-08T23:35:39.370Z][INFO][exchange-worker-#41][GridDhtPartitionsExchangeFuture] > Exchange timings [startVer=AffinityTopologyVersion [topVer=10, > minorTopVer=0], resVer=AffinityTopologyVersion [topVer=10, minorTopVer=0], > stage="Waiting in exchange queue" (0 ms), stage="Exchange parameters > initialization" (0 ms), stage="Determine exchange type" (15 ms), > stage="Preloading notification" (0 ms), stage="WAL history reservation" (2 > ms), stage="Wait partitions release" (0 ms), stage="Wait partitions release > latch" (2 ms), stage="After states restored callback" (5 ms), > stage="Finalize update counters" (2 ms), stage="Detect lost partitions" (1 > ms), stage="Exchange done" (3 ms), stage="Total time" (30 ms)] > > [2021-01-08T23:35:39.370Z][INFO][exchange-worker-#41][GridDhtPartitionsExchangeFuture] > Exchange longest local stages [startVer=AffinityTopologyVersion [topVer=10, > minorTopVer=0], resVer=AffinityTopologyVersion [topVer=10, minorTopVer=0], > stage="Affinity initialization (exchange-free switch on fully-rebalanced > topology) [grp=Households]" (15 ms) (parent=Determine exchange type), > stage="Affinity initialization (exchange-free switch on fully-rebalanced > topology) [grp=CorpHeadend]" (14 ms) (parent=Determine exchange type), > stage="Affinity initialization (exchange-free switch on fully-rebalanced > topology) [grp=AttributeOptions]" (13 ms) (parent=Determine exchange type)] > [2021-01-08T23:35:39.370Z][INFO][exchange-worker-#41][time] Finished > exchange init [topVer=AffinityTopologyVersion [topVer=10, minorTopVer=0], > crd=true] > > [2021-01-08T23:35:39.380Z][INFO][exchange-worker-#41][GridCachePartitionExchangeManager] > Skipping rebalancing (nothing scheduled) [top=AffinityTopologyVersion > [topVer=10, minorTopVer=0], force=false, evt=NODE_LEFT, > node=75e4ddea-1927-4e93-82e9-fdfbb7b58d1c] > > Show data in visor with data on the two nodes > > > > +-----------------------+-------------+-------+---------------------------------+-----------------------------------+-----------+-----------+-----------+----------------+ > | Devices(@c2) | PARTITIONED | 2 | 250000 (0 / 250000) > > | min: 116261 (0 / 116261) | min: 0 | min: 0 | min: 0 | > min: 250000 | > | | | | > > | avg: 125000.00 (0.00 / 125000.00) | avg: 0.00 | avg: 0.00 | avg: 0.00 | > avg: 250000.00 | > | | | | > > | max: 133739 (0 / 133739) | max: 0 | max: 0 | max: 0 | > max: 250000 | > > +-----------------------+-------------+-------+---------------------------------+-----------------------------------+-----------+-----------+-----------+----------------+ > > At this point there's only 2 nodes with data for this cache and it got > nicely balanced when the ignite process was stopped. > > Start the stopped ignite > > > Show topology and PME from logs (on node where ignite started) > > > > [2021-01-08T23:39:42.817Z][INFO][exchange-worker-#42][GridDhtPartitionsExchangeFuture] > Skipped waiting for partitions release future (local node is joining) > [topVer=AffinityTopologyVersion [topVer=11, minorTopVer=0]] > > [2021-01-08T23:39:43.006Z][INFO][db-checkpoint-thread-#54][GridCacheDatabaseSharedManager] > Checkpoint started [checkpointId=07c7fa75-d1dd-4955-964e-7d61bddbf05b, > startPtr=FileWALPointer [idx=7, fileOff=39245141, len=118735], > checkpointBeforeLockTime=91ms, checkpointLockWait=0ms, > checkpointListenersExecuteTime=29ms, checkpointLockHoldTime=51ms, > walCpRecordFsyncDuration=8ms, writeCheckpointEntryDuration=5ms, > splitAndSortCpPagesDuration=14ms, pages=2079, reason='node started'] > > [2021-01-08T23:39:43.163Z][INFO][grid-nio-worker-tcp-comm-0-#24][TcpCommunicationSpi] > Established outgoing communication connection [locAddr=/10.12.2.110:34662, > rmtAddr=/10.12.3.76:47100] > [2021-01-08T23:39:43.167Z][INFO][exchange-worker-#42][time] Finished > exchange init [topVer=AffinityTopologyVersion [topVer=11, minorTopVer=0], > crd=false] > [2021-01-08T23:39:43.368Z][INFO][sys-#48][GridDhtPartitionsExchangeFuture] > Received full message, will finish exchange > [node=4ea8efe1-0f92-4232-bbab-2cb232ed04f9, resVer=AffinityTopologyVersion > [topVer=11, minorTopVer=0]] > [2021-01-08T23:39:43.494Z][INFO][sys-#48][GridDhtPartitionsExchangeFuture] > Finish exchange future [startVer=AffinityTopologyVersion [topVer=11, > minorTopVer=0], resVer=AffinityTopologyVersion [topVer=11, minorTopVer=0], > err=null, rebalanced=false, wasRebalanced=false] > [2021-01-08T23:39:43.554Z][INFO][sys-#48][GridDhtPartitionsExchangeFuture] > Completed partition exchange > [localNode=b909d450-11cd-471f-b49e-85c6f300f704, > exchange=GridDhtPartitionsExchangeFuture [topVer=AffinityTopologyVersion > [topVer=11, minorTopVer=0], evt=NODE_JOINED, evtNode=TcpDiscoveryNode > [id=b909d450-11cd-471f-b49e-85c6f300f704, > consistentId=3a4a497f-5a89-4f2c-8531-b2b05f2ede22, addrs=ArrayList > [10.12.2.110], sockAddrs=HashSet [/10.12.2.110:47500], discPort=47500, > order=11, intOrder=8, lastExchangeTime=1610149183525, loc=true, > ver=2.8.1#20200521-sha1:86422096, isClient=false], done=true, > newCrdFut=null], topVer=AffinityTopologyVersion [topVer=11, minorTopVer=0]] > [2021-01-08T23:39:43.556Z][INFO][sys-#48][GridDhtPartitionsExchangeFuture] > Exchange timings [startVer=AffinityTopologyVersion [topVer=11, > minorTopVer=0], resVer=AffinityTopologyVersion [topVer=11, minorTopVer=0], > stage="Waiting in exchange queue" (53 ms), stage="Exchange parameters > initialization" (3 ms), stage="Components activation" (22 ms), > stage="Determine exchange type" (50 ms), stage="Preloading notification" (0 > ms), stage="WAL history reservation" (5 ms), stage="Restore partition > states" (11 ms), stage="After states restored callback" (147 ms), > stage="Waiting for Full message" (392 ms), stage="Affinity recalculation" > (89 ms), stage="Full map updating" (36 ms), stage="Detect lost partitions" > (4 ms), stage="Exchange done" (57 ms), stage="Total time" (869 ms)] > [2021-01-08T23:39:43.558Z][INFO][sys-#48][GridDhtPartitionsExchangeFuture] > Exchange longest local stages [startVer=AffinityTopologyVersion [topVer=11, > minorTopVer=0], resVer=AffinityTopologyVersion [topVer=11, minorTopVer=0], > stage="Affinity fetch" (48 ms) (parent=Determine exchange type), > stage="Restore partition states [grp=ignite-sys-cache]" (11 ms) > (parent=Restore partition states), stage="Restore partition states > [grp=MasterVersion]" (11 ms) (parent=Restore partition states), > stage="Restore partition states [grp=Devices]" (11 ms) (parent=Restore > partition states), stage="Affinity initialization (local join) > [grp=Devices]" (89 ms) (parent=Affinity recalculation), stage="Affinity > initialization (local join) [grp=Households]" (88 ms) (parent=Affinity > recalculation), stage="Affinity initialization (local join) > [grp=MasterVersion]" (67 ms) (parent=Affinity recalculation)] > > [2021-01-08T23:39:43.588Z][INFO][exchange-worker-#42][GridCachePartitionExchangeManager] > Skipping rebalancing (nothing scheduled) [top=AffinityTopologyVersion > [topVer=11, minorTopVer=0], force=false, evt=NODE_JOINED, > node=b909d450-11cd-471f-b49e-85c6f300f704] > [2021-01-08T23:39:43.642Z][INFO][main][GridDiscoveryManager] Topology > snapshot [ver=11, locNode=b909d450, servers=3, clients=1, state=ACTIVE, > CPUs=7, offheap=3.0GB, heap=3.1GB] > [2021-01-08T23:39:43.643Z][INFO][main][GridDiscoveryManager] ^-- Baseline > [id=0, size=3, online=3, offline=0] > [2021-01-08T23:39:43.645Z][INFO][main][G] Node started : [stage="Configure > system pool" (65 ms),stage="Start managers" (1113 ms),stage="Configure > binary metadata" (102 ms),stage="Start processors" (744 ms),stage="Init > metastore" (899 ms),stage="Init and start regions" (10 ms),stage="Restore > binary memory" (989 ms),stage="Restore logical state" (2450 > ms),stage="Finish recovery" (112 ms),stage="Join topology" (1498 > ms),stage="Await transition" (19 ms),stage="Await exchange" (951 > ms),stage="Total time" (8952 ms)] > > [2021-01-08T23:39:44.550Z][INFO][db-checkpoint-thread-#54][GridCacheDatabaseSharedManager] > Checkpoint finished [cpId=07c7fa75-d1dd-4955-964e-7d61bddbf05b, pages=2079, > markPos=FileWALPointer [idx=7, fileOff=39245141, len=118735], > walSegmentsCleared=0, walSegmentsCovered=[], markDuration=83ms, > pagesWrite=184ms, fsync=1359ms, total=1717ms] > > > Show data with a min of zero primaries in visor > > > > +-----------------------+-------------+-------+---------------------------------+-----------------------------------+-----------+-----------+-----------+----------------+ > | Devices(@c2) | PARTITIONED | 3 | 250000 (0 / 250000) > > | min: 0 (0 / 0) | min: 0 | min: 0 | min: 0 | > min: 0 | > | | | | > > | avg: 83333.33 (0.00 / 83333.33) | avg: 0.00 | avg: 0.00 | avg: 0.00 | > avg: 166666.67 | > | | | | > > | max: 133739 (0 / 133739) | max: 0 | max: 0 | max: 0 | > max: 250000 | > > +-----------------------+-------------+-------+---------------------------------+-----------------------------------+-----------+-----------+-----------+----------------+ > > At this point the stopped ignite process was restarted but notice the > primaries were not balanced to it upon joining. > > Stop visor > > > Show topology and PME from logs > > > > [2021-01-08T23:44:07.824Z][INFO][grid-nio-worker-tcp-comm-1-#25][TcpCommunicationSpi] > Accepted incoming communication connection [locAddr=/10.12.2.110:47100, > rmtAddr=/10.12.2.110:37510] > > [2021-01-08T23:44:35.597Z][INFO][tcp-disco-srvr-[:47500]-#3][TcpDiscoverySpi] > TCP discovery accepted incoming connection [rmtAddr=/10.12.4.242, > rmtPort=42615] > > [2021-01-08T23:44:35.598Z][INFO][tcp-disco-srvr-[:47500]-#3][TcpDiscoverySpi] > TCP discovery spawning a new thread for connection [rmtAddr=/10.12.4.242, > rmtPort=42615] > > [2021-01-08T23:44:35.598Z][INFO][tcp-disco-sock-reader-[]-#7][TcpDiscoverySpi] > Started serving remote node connection [rmtAddr=/10.12.4.242:42615, > rmtPort=42615] > [2021-01-08T23:44:35.599Z][INFO][tcp-disco-sock-reader-[63d7cf0d > 10.12.4.242:42615]-#7][TcpDiscoverySpi] Initialized connection with remote > server node [nodeId=63d7cf0d-f8b6-4fe2-ba99-4bbd26731451, > rmtAddr=/10.12.4.242:42615] > [2021-01-08T23:44:35.621Z][INFO][exchange-worker-#42][time] Started > exchange > init [topVer=AffinityTopologyVersion [topVer=12, minorTopVer=0], crd=false, > evt=NODE_LEFT, evtNode=1cca94e3-f15f-4a8b-9f65-d9b9055a5fa7, > customEvt=null, > allowMerge=true, exchangeFreeSwitch=false] > > [2021-01-08T23:44:35.632Z][INFO][exchange-worker-#42][GridDhtPartitionsExchangeFuture] > Finished waiting for partition release future > [topVer=AffinityTopologyVersion [topVer=12, minorTopVer=0], waitTime=0ms, > futInfo=NA, mode=DISTRIBUTED] > > [2021-01-08T23:44:35.644Z][INFO][exchange-worker-#42][GridDhtPartitionsExchangeFuture] > Finished waiting for partitions release latch: ClientLatch > [coordinator=TcpDiscoveryNode [id=4ea8efe1-0f92-4232-bbab-2cb232ed04f9, > consistentId=3b21ae44-c9cc-4e04-9e79-9807d22a6d50, addrs=ArrayList > [10.12.3.76], sockAddrs=HashSet [/10.12.3.76:47500], discPort=47500, > order=1, intOrder=1, lastExchangeTime=1610149182437, loc=false, > ver=2.8.1#20200521-sha1:86422096, isClient=false], ackSent=true, > super=CompletableLatch [id=CompletableLatchUid [id=exchange, > topVer=AffinityTopologyVersion [topVer=12, minorTopVer=0]]]] > > [2021-01-08T23:44:35.644Z][INFO][exchange-worker-#42][GridDhtPartitionsExchangeFuture] > Finished waiting for partition release future > [topVer=AffinityTopologyVersion [topVer=12, minorTopVer=0], waitTime=0ms, > futInfo=NA, mode=LOCAL] > [2021-01-08T23:44:35.688Z][INFO][exchange-worker-#42][time] Finished > exchange init [topVer=AffinityTopologyVersion [topVer=12, minorTopVer=0], > crd=false] > [2021-01-08T23:44:35.704Z][INFO][tcp-disco-sock-reader-[1cca94e3 > 10.12.2.110:58161]-#5][TcpDiscoverySpi] Finished serving remote node > connection [rmtAddr=/10.12.2.110:58161, rmtPort=58161 > [2021-01-08T23:44:35.761Z][INFO][sys-#80][GridDhtPartitionsExchangeFuture] > Received full message, will finish exchange > [node=4ea8efe1-0f92-4232-bbab-2cb232ed04f9, resVer=AffinityTopologyVersion > [topVer=12, minorTopVer=0]] > [2021-01-08T23:44:35.802Z][INFO][sys-#80][GridDhtPartitionsExchangeFuture] > Finish exchange future [startVer=AffinityTopologyVersion [topVer=12, > minorTopVer=0], resVer=AffinityTopologyVersion [topVer=12, minorTopVer=0], > err=null, rebalanced=true, wasRebalanced=false] > [2021-01-08T23:44:35.820Z][INFO][sys-#80][GridDhtPartitionsExchangeFuture] > Completed partition exchange > [localNode=b909d450-11cd-471f-b49e-85c6f300f704, > exchange=GridDhtPartitionsExchangeFuture [topVer=AffinityTopologyVersion > [topVer=12, minorTopVer=0], evt=NODE_LEFT, evtNode=TcpDiscoveryNode > [id=1cca94e3-f15f-4a8b-9f65-d9b9055a5fa7, consistentId=10.12.2.110:47501, > addrs=ArrayList [10.12.2.110], sockAddrs=HashSet [/10.12.2.110:47501], > discPort=47501, order=9, intOrder=7, lastExchangeTime=1610149182447, > loc=false, ver=2.8.1#20200521-sha1:86422096, isClient=false], done=true, > newCrdFut=null], topVer=AffinityTopologyVersion [topVer=12, minorTopVer=0]] > [2021-01-08T23:44:35.820Z][INFO][sys-#80][GridDhtPartitionsExchangeFuture] > Exchange timings [startVer=AffinityTopologyVersion [topVer=12, > minorTopVer=0], resVer=AffinityTopologyVersion [topVer=12, minorTopVer=0], > stage="Waiting in exchange queue" (0 ms), stage="Exchange parameters > initialization" (0 ms), stage="Determine exchange type" (0 ms), > stage="Preloading notification" (0 ms), stage="WAL history reservation" (2 > ms), stage="Wait partitions release" (13 ms), stage="Wait partitions > release > latch" (7 ms), stage="Wait partitions release" (0 ms), stage="After states > restored callback" (0 ms), stage="Finalize update counters" (19 ms), > stage="Waiting for Full message" (96 ms), stage="Affinity recalculation" > (31 > ms), stage="Full map updating" (9 ms), stage="Detect lost partitions" (2 > ms), stage="Exchange done" (15 ms), stage="Total time" (194 ms)] > [2021-01-08T23:44:35.820Z][INFO][sys-#80][GridDhtPartitionsExchangeFuture] > Exchange longest local stages [startVer=AffinityTopologyVersion [topVer=12, > minorTopVer=0], resVer=AffinityTopologyVersion [topVer=12, minorTopVer=0], > stage="Affinity applying from full message [grp=AttributeOptions]" (9 ms) > (parent=Affinity recalculation), stage="Affinity applying from full message > [grp=CorpHeadend]" (8 ms) (parent=Affinity recalculation), stage="Affinity > applying from full message [grp=Devices]" (6 ms) (parent=Affinity > recalculation)] > > [2021-01-08T23:44:35.824Z][INFO][exchange-worker-#42][GridCachePartitionExchangeManager] > Skipping rebalancing (nothing scheduled) [top=AffinityTopologyVersion > [topVer=12, minorTopVer=0], force=false, evt=NODE_LEFT, > node=1cca94e3-f15f-4a8b-9f65-d9b9055a5fa7] > > With visor leaving the cluster now, this triggers the PME for the primary > keys. Will be starting visor again to show that. > > Start visor again > > > Show topology and PME from logs > > > > [2021-01-08T23:46:52.977Z][INFO][tcp-disco-srvr-[:47500]-#3][TcpDiscoverySpi] > TCP discovery accepted incoming connection [rmtAddr=/10.12.2.110, > rmtPort=42455] > > [2021-01-08T23:46:52.977Z][INFO][tcp-disco-srvr-[:47500]-#3][TcpDiscoverySpi] > TCP discovery spawning a new thread for connection [rmtAddr=/10.12.2.110, > rmtPort=42455] > > [2021-01-08T23:46:52.982Z][INFO][tcp-disco-sock-reader-[]-#8][TcpDiscoverySpi] > Started serving remote node connection [rmtAddr=/10.12.2.110:42455, > rmtPort=42455] > [2021-01-08T23:46:52.984Z][INFO][tcp-disco-sock-reader-[6215e66a > 10.12.2.110:42455]-#8][TcpDiscoverySpi] Initialized connection with remote > server node [nodeId=6215e66a-b021-410f-8e3a-aaa317f19609, > rmtAddr=/10.12.2.110:42455] > rmtPort=424553:46:53.041Z][INFO][tcp-disco-sock-reader-[6215e66a > 10.12.2.110:42455]-#8][TcpDiscoverySpi] Finished serving remote node > connection [rmtAddr=/10.12.2.110:42455, > [2021-01-08T23:46:53.049Z][INFO][tcp-disco-msg-worker-[4ea8efe1 > 10.12.3.76:47500]-#2][TcpDiscoverySpi] New next node > [newNext=TcpDiscoveryNode [id=6215e66a-b021-410f-8e3a-aaa317f19609, > consistentId=10.12.2.110:47501, addrs=ArrayList [10.12.2.110], > sockAddrs=HashSet [/10.12.2.110:47501], discPort=47501, order=0, > intOrder=9, > lastExchangeTime=1610149613043, loc=false, > ver=2.8.1#20200521-sha1:86422096, > isClient=false]] > [2021-01-08T23:46:53.405Z][INFO][exchange-worker-#42][time] Started > exchange > init [topVer=AffinityTopologyVersion [topVer=13, minorTopVer=0], crd=false, > evt=NODE_JOINED, evtNode=6215e66a-b021-410f-8e3a-aaa317f19609, > customEvt=null, allowMerge=true, exchangeFreeSwitch=false] > > [2021-01-08T23:46:53.410Z][INFO][exchange-worker-#42][GridDhtPartitionsExchangeFuture] > Finished waiting for partition release future > [topVer=AffinityTopologyVersion [topVer=13, minorTopVer=0], waitTime=0ms, > futInfo=NA, mode=DISTRIBUTED] > e [id=4ea8efe1-0f92-4232-bbab-2cb232ed04f9, > consistentId=3b21ae44-c9cc-4e04-9e79-9807d22a6d50, addrs=ArrayList > [10.12.3.76], sockAddrs=HashSet [/10.12.3.76:47500], discPort=47500, > order=1, intOrder=1, lastExchangeTime=1610149182437, loc=false, > ver=2.8.1#20200521-sha1:86422096, isClient=false], ackSent=true, > super=CompletableLatch [id=CompletableLatchUid [id=exchange, > topVer=AffinityTopologyVersion [topVer=13, minorTopVer=0]]]] > > [2021-01-08T23:46:53.412Z][INFO][exchange-worker-#42][GridDhtPartitionsExchangeFuture] > Finished waiting for partition release future > [topVer=AffinityTopologyVersion [topVer=13, minorTopVer=0], waitTime=0ms, > futInfo=NA, mode=LOCAL] > [2021-01-08T23:46:53.437Z][INFO][exchange-worker-#42][time] Finished > exchange init [topVer=AffinityTopologyVersion [topVer=13, minorTopVer=0], > crd=false] > nityTopologyVersion [topVer=13, minorTopVer=0]]htPartitionsExchangeFuture] > Received full message, will finish exchange > [node=4ea8efe1-0f92-4232-bbab-2cb232ed04f9, > > resVer=Affi[2021-01-08T23:46:53.587Z][INFO][sys-#98][GridDhtPartitionsExchangeFuture] > Finish exchange future [startVer=AffinityTopologyVersion [topVer=13, > minorTopVer=0], resVer=AffinityTopologyVersion [topVer=13, minorTopVer=0], > err=null, rebalanced=true, wasRebalanced=true] > [2021-01-08T23:46:53.598Z][INFO][sys-#98][GridDhtPartitionsExchangeFuture] > Completed partition exchange > [localNode=b909d450-11cd-471f-b49e-85c6f300f704, > exchange=GridDhtPartitionsExchangeFuture [topVer=AffinityTopologyVersion > [topVer=13, minorTopVer=0], evt=NODE_JOINED, evtNode=TcpDiscoveryNode > [id=6215e66a-b021-410f-8e3a-aaa317f19609, consistentId=10.12.2.110:47501, > addrs=ArrayList [10.12.2.110], sockAddrs=HashSet [/10.12.2.110:47501], > discPort=47501, order=13, intOrder=9, lastExchangeTime=1610149613043, > loc=false, ver=2.8.1#20200521-sha1:86422096, isClient=false], done=true, > newCrdFut=null], topVer=AffinityTopologyVersion [topVer=13, minorTopVer=0]] > [2021-01-08T23:46:53.602Z][INFO][sys-#98][GridDhtPartitionsExchangeFuture] > Exchange timings [startVer=AffinityTopologyVersion [topVer=13, > minorTopVer=0], resVer=AffinityTopologyVersion [topVer=13, minorTopVer=0], > stage="Waiting in exchange queue" (0 ms), stage="Exchange parameters > initialization" (0 ms), stage="Determine exchange type" (1 ms), > stage="Preloading notification" (0 ms), stage="WAL history reservation" (2 > ms), stage="Wait partitions release" (0 ms), stage="Wait partitions release > latch" (1 ms), stage="Wait partitions release" (0 ms), stage="After states > restored callback" (0 ms), stage="Waiting for Full message" (102 ms), > stage="Affinity recalculation" (60 ms), stage="Full map updating" (11 ms), > stage="Detect lost partitions" (2 ms), stage="Exchange done" (9 ms), > stage="Total time" (188 ms)] > AffinityTopologyVersion [topVer=13, minorTopVer=0], stage="Affinity > initialization (node join) [grp=CorpHeadend, crd=false]" (33 ms) > (parent=Affinity recalculation), stage="Affinity initialization (node join) > [grp=Devices, crd=false]" (27 ms) (parent=Affinity recalculation), > stage="Affinity initialization (node join) [grp=MasterVersion, crd=false]" > (22 ms) (parent=Affinity recalculation)] > > [2021-01-08T23:46:53.614Z][INFO][exchange-worker-#42][GridCachePartitionExchangeManager] > Skipping rebalancing (nothing scheduled) [top=AffinityTopologyVersion > [topVer=13, minorTopVer=0], force=false, evt=NODE_JOINED, > node=6215e66a-b021-410f-8e3a-aaa317f19609] > > > Show data with primaries balanced in visor > > > > +-----------------------+-------------+-------+---------------------------------+-----------------------------------+-----------+-----------+-----------+----------------+ > | Devices(@c2) | PARTITIONED | 3 | 250000 (0 / 250000) > > | min: 80315 (0 / 80315) | min: 0 | min: 0 | min: 0 | > min: 0 | > | | | | > > | avg: 83333.33 (0.00 / 83333.33) | avg: 0.00 | avg: 0.00 | avg: 0.00 | > avg: 166666.67 | > | | | | > > | max: 86968 (0 / 86968) | max: 0 | max: 0 | max: 0 | > max: 250000 | > > +-----------------------+-------------+-------+---------------------------------+-----------------------------------+-----------+-----------+-----------+----------------+ > > Starting visor shows the keys have all been rebalanced again. > > > > > -- > Sent from: http://apache-ignite-users.70518.x6.nabble.com/ >