[ 
https://issues.apache.org/jira/browse/IGNITE-12089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16923860#comment-16923860
 ] 

temp2 edited comment on IGNITE-12089 at 9/6/19 1:58 AM:
--------------------------------------------------------

hi Stanilovsky Evgeny,
I don't understand what cluster configuration inconsistency means and why need 
isClient=false.

The whole test environment and step I will elaborate on:
1. Prepare 4 physical machines, 3 servers and 1 client. Install Oracle 
jdk1.8.0_101 in all 4 machines. make all kinds of necessary directories in each 
machine.

2. In one of the three server-side machines, decompress 
"apache-ignite-2.7.5-bin.zip", modify "bin\ignite.sh" file in decompression 
directory, increase -Xmx value to 10g, modify "config\default-config.xml", see 
the attachments above.

3. Copy the entire directory of the decompressed and modified apache-ignite to 
two other server machines.

4. Select two of the server-side machines to run "bin\ignite. sh 
config\default-config. xml",  start two server-side nodes

5.  Select one of the server-side machines to run "bin\control.sh --activate", 
activate the cluster. This step can also be avoided.

6. Put IgniteTest2 test code and dependency jars (apache-ignite-2.7.5-bin.zip 
is not required) into the fourth machine and run directly with Java command. 
The running parameters are "com.test.IgniteTest2 5 0 test 2000000000 
192.168.20.12:49500 192.168.20.13:49500 192.168.20.14:49500".

7. 20 minutes later, the third server-side machines to run "bin\ignite. sh 
config\default-config. xml"

8. A few minutes later after the start of the third server, one or two of the 
first two servers will appear "Partitions cache sizes are inconsistent", and 
then a moment later  one or two of the first two servers the JVM will exit with 
the errores.



was (Author: temp2):
hi Stanilovsky Evgeny,
I don't understand what cluster configuration inconsistency means.

The whole test environment and step I will elaborate on:
1. Prepare 4 physical machines, 3 servers and 1 client. Install Oracle 
jdk1.8.0_101 in all 4 machines. make all kinds of necessary directories in each 
machine.

2. In one of the three server-side machines, decompress 
"apache-ignite-2.7.5-bin.zip", modify "bin\ignite.sh" file in decompression 
directory, increase -Xmx value to 10g, modify "config\default-config.xml", see 
the attachments above.

3. Copy the entire directory of the decompressed and modified apache-ignite to 
two other server machines.

4. Select two of the server-side machines to run "bin\ignite. sh 
config\default-config. xml",  start two server-side nodes

5.  Select one of the server-side machines to run "bin\control.sh --activate", 
activate the cluster. This step can also be avoided.

6. Put IgniteTest2 test code and dependency jars (apache-ignite-2.7.5-bin.zip 
is not required) into the fourth machine and run directly with Java command. 
The running parameters are "com.test.IgniteTest2 5 0 test 2000000000 
192.168.20.12:49500 192.168.20.13:49500 192.168.20.14:49500".

7. 20 minutes later, the third server-side machines to run "bin\ignite. sh 
config\default-config. xml"

8. A few minutes later after the start of the third server, one or two of the 
first two serveres will appear "Partitions cache sizes are inconsistent", and 
then a moment later  one or two of the first two serveres the JVM will exit 
with the errores.


> JVM is halted after this error during rolling restart of a cluster
> ------------------------------------------------------------------
>
>                 Key: IGNITE-12089
>                 URL: https://issues.apache.org/jira/browse/IGNITE-12089
>             Project: Ignite
>          Issue Type: Bug
>    Affects Versions: 2.6
>            Reporter: temp2
>            Priority: Critical
>         Attachments: IgniteTest2.java, default-config.xml, ignite27.log, 
> ignite42.log
>
>
> JVM is halted after this error during rolling restart of a cluster:
> excepition is :528-a852-c65782e337f0][2019-08-20 
> 17:22:10,901][ERROR][ttl-cleanup-worker-#155][] Critical system error 
> detected. Will be handled accordingly to configured handler [hnd=class 
> o.a.i.failure.StopNodeOrHaltFailure528-a852-c65782e337f0][2019-08-20 
> 17:22:10,901][ERROR][ttl-cleanup-worker-#155][] Critical system error 
> detected. Will be handled accordingly to configured handler [hnd=class 
> o.a.i.failure.StopNodeOrHaltFailureHandler, failureCtx=FailureContext 
> [type=SYSTEM_WORKER_TERMINATION, err=class o.a.i.IgniteException: Runtime 
> failure on bounds: [lower=PendingRow [], upper=PendingRow 
> []]]]org.apache.ignite.IgniteException: Runtime failure on bounds: 
> [lower=PendingRow [], upper=PendingRow []] at 
> org.apache.ignite.internal.processors.cache.persistence.tree.BPlusTree.find(BPlusTree.java:971)
>  ~[ignite-core-2.6.0.jar:2.6.0] at 
> org.apache.ignite.internal.processors.cache.persistence.tree.BPlusTree.find(BPlusTree.java:950)
>  ~[ignite-core-2.6.0.jar:2.6.0] at 
> org.apache.ignite.internal.processors.cache.IgniteCacheOffheapManagerImpl.expire(IgniteCacheOffheapManagerImpl.java:1022)
>  ~[ignite-core-2.6.0.jar:2.6.0] at 
> org.apache.ignite.internal.processors.cache.GridCacheTtlManager.expire(GridCacheTtlManager.java:197)
>  ~[ignite-core-2.6.0.jar:2.6.0] at 
> org.apache.ignite.internal.processors.cache.GridCacheSharedTtlCleanupManager$CleanupWorker.body(GridCacheSharedTtlCleanupManager.java:137)
>  [ignite-core-2.6.0.jar:2.6.0] at 
> org.apache.ignite.internal.util.worker.GridWorker.run(GridWorker.java:110) 
> [ignite-core-2.6.0.jar:2.6.0] at java.lang.Thread.run(Thread.java:745) 
> [?:1.8.0_101]Caused by: java.lang.IllegalStateException: Failed to get page 
> IO instance (page content is corrupted) at 
> org.apache.ignite.internal.processors.cache.persistence.tree.io.IOVersions.forVersion(IOVersions.java:83)
>  ~[ignite-core-2.6.0.jar:2.6.0] at 
> org.apache.ignite.internal.processors.cache.persistence.tree.io.IOVersions.forPage(IOVersions.java:95)
>  ~[ignite-core-2.6.0.jar:2.6.0] at 
> org.apache.ignite.internal.processors.cache.persistence.CacheDataRowAdapter.initFromLink(CacheDataRowAdapter.java:148)
>  ~[ignite-core-2.6.0.jar:2.6.0] at 
> org.apache.ignite.internal.processors.cache.persistence.CacheDataRowAdapter.initFromLink(CacheDataRowAdapter.java:102)
>  ~[ignite-core-2.6.0.jar:2.6.0] at 
> org.apache.ignite.internal.processors.cache.tree.PendingRow.initKey(PendingRow.java:72)
>  ~[ignite-core-2.6.0.jar:2.6.0] at 
> org.apache.ignite.internal.processors.cache.tree.PendingEntriesTree.getRow(PendingEntriesTree.java:118)
>  ~[ignite-core-2.6.0.jar:2.6.0] at 
> org.apache.ignite.internal.processors.cache.tree.PendingEntriesTree.getRow(PendingEntriesTree.java:31)
>  ~[ignite-core-2.6.0.jar:2.6.0] at 
> org.apache.ignite.internal.processors.cache.persistence.tree.BPlusTree$ForwardCursor.fillFromBuffer(BPlusTree.java:4660)
>  ~[ignite-core-2.6.0.jar:2.6.0] at 
> org.apache.ignite.internal.processors.cache.persistence.tree.BPlusTree$ForwardCursor.init(BPlusTree.java:4562)
>  ~[ignite-core-2.6.0.jar:2.6.0] at 
> org.apache.ignite.internal.processors.cache.persistence.tree.BPlusTree$ForwardCursor.access$5300(BPlusTree.java:4501)
>  ~[ignite-core-2.6.0.jar:2.6.0] at 
> org.apache.ignite.internal.processors.cache.persistence.tree.BPlusTree$GetCursor.notFound(BPlusTree.java:2633)
>  ~[ignite-core-2.6.0.jar:2.6.0] at 
> org.apache.ignite.internal.processors.cache.persistence.tree.BPlusTree$Search.run0(BPlusTree.java:293)
>  ~[ignite-core-2.6.0.jar:2.6.0] at 
> org.apache.ignite.internal.processors.cache.persistence.tree.BPlusTree$GetPageHandler.run(BPlusTree.java:4816)
>  ~[ignite-core-2.6.0.jar:2.6.0] at 
> org.apache.ignite.internal.processors.cache.persistence.tree.BPlusTree$GetPageHandler.run(BPlusTree.java:4801)
>  ~[ignite-core-2.6.0.jar:2.6.0] at 
> org.apache.ignite.internal.processors.cache.persistence.tree.util.PageHandler.readPage(PageHandler.java:158)
>  ~[ignite-core-2.6.0.jar:2.6.0] at 
> org.apache.ignite.internal.processors.cache.persistence.DataStructure.read(DataStructure.java:332)
>  ~[ignite-core-2.6.0.jar:2.6.0] at 
> org.apache.ignite.internal.processors.cache.persistence.tree.BPlusTree.findDown(BPlusTree.java:1140)
>  ~[ignite-core-2.6.0.jar:2.6.0] at 
> org.apache.ignite.internal.processors.cache.persistence.tree.BPlusTree.findDown(BPlusTree.java:1149)
>  ~[ignite-core-2.6.0.jar:2.6.0] at 
> org.apache.ignite.internal.processors.cache.persistence.tree.BPlusTree.doFind(BPlusTree.java:1107)
>  ~[ignite-core-2.6.0.jar:2.6.0] at 
> org.apache.ignite.internal.processors.cache.persistence.tree.BPlusTree.access$15800(BPlusTree.java:83)
>  ~[ignite-core-2.6.0.jar:2.6.0] at 
> org.apache.ignite.internal.processors.cache.persistence.tree.BPlusTree$ForwardCursor.find(BPlusTree.java:4716)
>  ~[ignite-core-2.6.0.jar:2.6.0] at 
> org.apache.ignite.internal.processors.cache.persistence.tree.BPlusTree$ForwardCursor.access$5400(BPlusTree.java:4501)
>  ~[ignite-core-2.6.0.jar:2.6.0] at 
> org.apache.ignite.internal.processors.cache.persistence.tree.BPlusTree.find(BPlusTree.java:963)
>  ~[ignite-core-2.6.0.jar:2.6.0] ... 6 more[2019-08-20 
> 17:22:10,909][ERROR][ttl-cleanup-worker-#155][] JVM will be halted 
> immediately due to the failure: [failureCtx=FailureContext 
> [type=SYSTEM_WORKER_TERMINATION, err=class o.a.i.IgniteException: Runtime 
> failure on bounds: [lower=PendingRow [], upper=PendingRow []]]]
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.2#803003)

Reply via email to