[jira] [Commented] (IGNITE-11298) TcpCommunicationSpi does not support TLSv1.3

2019-05-28 Thread Vitaliy Biryukov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-11298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16850001#comment-16850001
 ] 

Vitaliy Biryukov commented on IGNITE-11298:
---

[~ilyak], unfortunately I slill have fluky issue on java12. I hope that it will 
disappear if all the sockets are closed after the streams like in pr. But I 
need to check it. 

> TcpCommunicationSpi does not support TLSv1.3
> 
>
> Key: IGNITE-11298
> URL: https://issues.apache.org/jira/browse/IGNITE-11298
> Project: Ignite
>  Issue Type: Bug
>  Components: general
>Affects Versions: 2.7
>Reporter: Ilya Kasnacheev
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: Java11
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> When started on Java 11 we cannot form a secure cluster - Discovery will 
> happily use the default TLSv1.3 but Communication will fail with its custom 
> SSLEngine-using code.
> Need to fix that.
> Until that, nodes may be salvaged by setProtocol("TLSv1.2") on 
> SslContextFactory, or by system property -Djdk.tls.client.protocols="TLSv1.2"



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (IGNITE-11298) TcpCommunicationSpi does not support TLSv1.3

2019-05-27 Thread Vitaliy Biryukov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-11298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16848825#comment-16848825
 ] 

Vitaliy Biryukov commented on IGNITE-11298:
---

[~ilyak]

I've checked on my Ubuntu pc.

It seems like we are dealing with a [known 
bug|https://bugs.openjdk.java.net/browse/JDK-8219658] related to Half-Close 
Policy. It's affects only TcpDiscoverySpi (checked only on ubuntu).
 This bag cause deadlock between SslSocket.close and inputStream.read. And it 
locks so hard so I have to restart my PC to return my network back to normal.

Thread dump:
{noformat}
"test-runner-#1%tcp.TcpCommunicationSpiFaultyClientTest%" #15 prio=5 os_prio=0 
cpu=2139,72ms elapsed=75,55s tid=0x7f8ac0815800 nid=0x6422 waiting for 
monitor entry  [0x7f8a8cee4000]
   java.lang.Thread.State: BLOCKED (on object monitor)
at 
sun.security.ssl.SSLSocketImpl$AppInputStream.deplete(java.base@12.0.1/SSLSocketImpl.java:921)
- waiting to lock <0x00070791c5e8> (a 
sun.security.ssl.SSLSocketImpl$AppInputStream)
at 
sun.security.ssl.SSLSocketImpl.bruteForceCloseInput(java.base@12.0.1/SSLSocketImpl.java:615)
at 
sun.security.ssl.SSLSocketImpl.duplexCloseOutput(java.base@12.0.1/SSLSocketImpl.java:566)
at sun.security.ssl.SSLSocketImpl.close(java.base@12.0.1/SSLSocketImpl.java:479)
at org.apache.ignite.internal.util.IgniteUtils.closeQuiet(IgniteUtils.java:4135)
at 
org.apache.ignite.spi.discovery.tcp.ServerImpl$SocketReader.interrupt(ServerImpl.java:7084)
at org.apache.ignite.internal.util.IgniteUtils.interrupt(IgniteUtils.java:4724)
at org.apache.ignite.spi.discovery.tcp.ServerImpl.spiStop0(ServerImpl.java:510)
at org.apache.ignite.spi.discovery.tcp.ServerImpl.spiStop(ServerImpl.java:442)
at 
org.apache.ignite.spi.discovery.tcp.TcpDiscoverySpi.spiStop(TcpDiscoverySpi.java:2217)
at 
org.apache.ignite.internal.managers.GridManagerAdapter.stopSpi(GridManagerAdapter.java:330)
at 
org.apache.ignite.internal.managers.discovery.GridDiscoveryManager.stop(GridDiscoveryManager.java:1743)
at org.apache.ignite.internal.IgniteKernal.stop0(IgniteKernal.java:2413)
at org.apache.ignite.internal.IgniteKernal.stop(IgniteKernal.java:2285)
at 
org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.stop0(IgnitionEx.java:2574)
- locked <0x000707b608a8> (a 
org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance)
at 
org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.stop(IgnitionEx.java:2537)
at org.apache.ignite.internal.IgnitionEx.stop(IgnitionEx.java:330)
at org.apache.ignite.Ignition.stop(Ignition.java:223)
at 
org.apache.ignite.testframework.junits.GridAbstractTest.stopGrid(GridAbstractTest.java:1253)
at 
org.apache.ignite.testframework.junits.GridAbstractTest.stopAllGrids(GridAbstractTest.java:1296)
at 
org.apache.ignite.testframework.junits.GridAbstractTest.stopAllGrids(GridAbstractTest.java:1274)
at 
org.apache.ignite.spi.communication.tcp.TcpCommunicationSpiFaultyClientTest.testFailClient(TcpCommunicationSpiFaultyClientTest.java:284)
at 
org.apache.ignite.spi.communication.tcp.TcpCommunicationSpiFaultyClientTest.testNoServerOnHost(TcpCommunicationSpiFaultyClientTest.java:154)
at 
jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(java.base@12.0.1/Native 
Method)
at 
jdk.internal.reflect.NativeMethodAccessorImpl.invoke(java.base@12.0.1/NativeMethodAccessorImpl.java:62)
at 
jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(java.base@12.0.1/DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(java.base@12.0.1/Method.java:567)
at 
org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
at 
org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
at 
org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
at 
org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
at org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
at org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
at 
org.apache.ignite.testframework.junits.GridAbstractTest$6.run(GridAbstractTest.java:2145)
at java.lang.Thread.run(java.base@12.0.1/Thread.java:835)


"tcp-disco-sock-reader-[31894d88 
127.0.0.1:56645]-#12%tcp.TcpCommunicationSpiFaultyClientTest1%" #114 prio=10 
os_prio=0 cpu=123,49ms elapsed=72,51s tid=0x7f8a84138800 nid=0x6494 
runnable  [0x7f89939de000]
   java.lang.Thread.State: RUNNABLE
at java.net.SocketInputStream.socketRead0(java.base@12.0.1/Native Method)
at 
java.net.SocketInputStream.socketRead(java.base@12.0.1/SocketInputStream.java:115)
at java.net.SocketInputStream.read(java.base@12.0.1/SocketInputStream.java:168)
at java.net.SocketInputStream.read(java.base@12.0.1/SocketInputStream.java:140)
at 
sun.security.ssl.SSLSocketInputRecord.read(java.base@12.0.1/SSLSocketInputRecord.java:448)
at 

[jira] [Commented] (IGNITE-11298) TcpCommunicationSpi does not support TLSv1.3

2019-05-23 Thread Vitaliy Biryukov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-11298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16847046#comment-16847046
 ] 

Vitaliy Biryukov commented on IGNITE-11298:
---

[~ilyak], Yes, I'll try to fix it by the end of this week. 

> TcpCommunicationSpi does not support TLSv1.3
> 
>
> Key: IGNITE-11298
> URL: https://issues.apache.org/jira/browse/IGNITE-11298
> Project: Ignite
>  Issue Type: Bug
>  Components: general
>Affects Versions: 2.7
>Reporter: Ilya Kasnacheev
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: Java11
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> When started on Java 11 we cannot form a secure cluster - Discovery will 
> happily use the default TLSv1.3 but Communication will fail with its custom 
> SSLEngine-using code.
> Need to fix that.
> Until that, nodes may be salvaged by setProtocol("TLSv1.2") on 
> SslContextFactory, or by system property -Djdk.tls.client.protocols="TLSv1.2"



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Assigned] (IGNITE-11298) TcpCommunicationSpi does not support TLSv1.3

2019-03-26 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-11298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov reassigned IGNITE-11298:
-

Assignee: Vitaliy Biryukov

> TcpCommunicationSpi does not support TLSv1.3
> 
>
> Key: IGNITE-11298
> URL: https://issues.apache.org/jira/browse/IGNITE-11298
> Project: Ignite
>  Issue Type: Bug
>  Components: general
>Affects Versions: 2.7
>Reporter: Ilya Kasnacheev
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: Java11
>
> When started on Java 11 we cannot form a secure cluster - Discovery will 
> happily use the default TLSv1.3 but Communication will fail with its custom 
> SSLEngine-using code.
> Need to fix that.
> Until that, nodes may be salvaged by setProtocol("TLSv1.2") on 
> SslContextFactory, or by system property -Djdk.tls.client.protocols="TLSv1.2"



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (IGNITE-9520) Investigate fuzzy free lists

2019-03-21 Thread Vitaliy Biryukov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-9520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16798380#comment-16798380
 ] 

Vitaliy Biryukov commented on IGNITE-9520:
--

Hi, [~agoncharuk].
Is this task still actual after optimizations from 
[IGNITE-10720|https://issues.apache.org/jira/browse/IGNITE-10720] and 
[IGNITE-9612|https://issues.apache.org/jira/browse/IGNITE-9612]?

> Investigate fuzzy free lists
> 
>
> Key: IGNITE-9520
> URL: https://issues.apache.org/jira/browse/IGNITE-9520
> Project: Ignite
>  Issue Type: Task
>Reporter: Alexey Goncharuk
>Assignee: Vitaliy Biryukov
>Priority: Major
>
> We have several data structures (free list, reuse list) associated with each 
> partition. For these structures a major part of their state is maintained 
> on-heap and persisted during checkpoints.
> This yields a lot of random disk accesses during checkpoints which 
> significantly increases checkpoint mark phase (done under checkpoint write 
> lock and essentially blocks all tx ops on the node).
> Need to investigate if we can implement some sort of a data structure which 
> is updated lazily and may be out-of date, then we can update these data 
> structures outside of checkpoint mark phases.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Assigned] (IGNITE-9520) Investigate fuzzy free lists

2019-03-12 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-9520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov reassigned IGNITE-9520:


Assignee: Vitaliy Biryukov

> Investigate fuzzy free lists
> 
>
> Key: IGNITE-9520
> URL: https://issues.apache.org/jira/browse/IGNITE-9520
> Project: Ignite
>  Issue Type: Task
>Reporter: Alexey Goncharuk
>Assignee: Vitaliy Biryukov
>Priority: Major
>
> We have several data structures (free list, reuse list) associated with each 
> partition. For these structures a major part of their state is maintained 
> on-heap and persisted during checkpoints.
> This yields a lot of random disk accesses during checkpoints which 
> significantly increases checkpoint mark phase (done under checkpoint write 
> lock and essentially blocks all tx ops on the node).
> Need to investigate if we can implement some sort of a data structure which 
> is updated lazily and may be out-of date, then we can update these data 
> structures outside of checkpoint mark phases.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (IGNITE-11086) IGNITE_REST_SECURITY_TOKEN_TIMEOUT parameter is set in deciseconds instead of seconds.

2019-01-25 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-11086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov updated IGNITE-11086:
--
Description: 
According to javaDoc IGNITE_REST_SECURITY_TOKEN_TIMEOUT should be set in 
seconds.

Can be fixed by multiplying this parameter by 1000 instead of 100 in 
GridRestProcessor's constructor:
{code:java}
sesTokTtl = IgniteSystemProperties.getLong(IGNITE_REST_SECURITY_TOKEN_TIMEOUT, 
DFLT_SES_TOKEN_INVALIDATE_INTERVAL) * 100;
{code}

  was:
According to javaDoc IGNITE_REST_SECURITY_TOKEN_TIMEOUT should sit in seconds.

Can be fixed by multiplying this parameter by 1000 instead of 100 in 
GridRestProcessor's constructor:
{code:java}
sesTokTtl = IgniteSystemProperties.getLong(IGNITE_REST_SECURITY_TOKEN_TIMEOUT, 
DFLT_SES_TOKEN_INVALIDATE_INTERVAL) * 100;
{code}



> IGNITE_REST_SECURITY_TOKEN_TIMEOUT parameter is set in deciseconds instead of 
> seconds.
> --
>
> Key: IGNITE-11086
> URL: https://issues.apache.org/jira/browse/IGNITE-11086
> Project: Ignite
>  Issue Type: Bug
>  Components: rest
>Affects Versions: 2.7
>Reporter: Vitaliy Biryukov
>Priority: Major
>
> According to javaDoc IGNITE_REST_SECURITY_TOKEN_TIMEOUT should be set in 
> seconds.
> Can be fixed by multiplying this parameter by 1000 instead of 100 in 
> GridRestProcessor's constructor:
> {code:java}
> sesTokTtl = 
> IgniteSystemProperties.getLong(IGNITE_REST_SECURITY_TOKEN_TIMEOUT, 
> DFLT_SES_TOKEN_INVALIDATE_INTERVAL) * 100;
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (IGNITE-11086) IGNITE_REST_SECURITY_TOKEN_TIMEOUT parameter is set in deciseconds instead of seconds.

2019-01-25 Thread Vitaliy Biryukov (JIRA)
Vitaliy Biryukov created IGNITE-11086:
-

 Summary: IGNITE_REST_SECURITY_TOKEN_TIMEOUT parameter is set in 
deciseconds instead of seconds.
 Key: IGNITE-11086
 URL: https://issues.apache.org/jira/browse/IGNITE-11086
 Project: Ignite
  Issue Type: Bug
  Components: rest
Affects Versions: 2.7
Reporter: Vitaliy Biryukov


According to javaDoc IGNITE_REST_SECURITY_TOKEN_TIMEOUT should sit in seconds.

Can be fixed by multiplying this parameter by 1000 instead of 100 in 
GridRestProcessor's constructor:
{code:java}
sesTokTtl = IgniteSystemProperties.getLong(IGNITE_REST_SECURITY_TOKEN_TIMEOUT, 
DFLT_SES_TOKEN_INVALIDATE_INTERVAL) * 100;
{code}




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (IGNITE-10731) ZookeeperDiscoverySpiTestSuite4: IgniteCacheReplicatedQuerySelfTest.testNodeLeft fails

2018-12-19 Thread Vitaliy Biryukov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-10731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16724837#comment-16724837
 ] 

Vitaliy Biryukov commented on IGNITE-10731:
---

Hi, [~dpavlov].
Please take a look.
This test did not pass for two reasons.
1) "localSize" method was called on a server node instead of a client node.
2) Zk discovery SPI generates EVT_NODE_FAILED instead of EVT_NODE_LEFT.

> ZookeeperDiscoverySpiTestSuite4: 
> IgniteCacheReplicatedQuerySelfTest.testNodeLeft fails
> --
>
> Key: IGNITE-10731
> URL: https://issues.apache.org/jira/browse/IGNITE-10731
> Project: Ignite
>  Issue Type: Improvement
>Affects Versions: 2.7
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
> Fix For: 2.8
>
>
> {noformat}
> junit.framework.AssertionFailedError: 
> Expected :0
> Actual   :312
>  
>   at junit.framework.Assert.fail(Assert.java:57)
>   at junit.framework.Assert.failNotEquals(Assert.java:329)
>   at junit.framework.Assert.assertEquals(Assert.java:78)
>   at junit.framework.Assert.assertEquals(Assert.java:234)
>   at junit.framework.Assert.assertEquals(Assert.java:241)
>   at junit.framework.TestCase.assertEquals(TestCase.java:409)
>   at 
> org.apache.ignite.internal.processors.cache.distributed.replicated.IgniteCacheReplicatedQuerySelfTest.testNodeLeft(IgniteCacheReplicatedQuerySelfTest.java:348)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.lang.reflect.Method.invoke(Method.java:498)
>   at junit.framework.TestCase.runTest(TestCase.java:176)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest.access$001(GridAbstractTest.java:151)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest$6.evaluate(GridAbstractTest.java:2102)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest$7.run(GridAbstractTest.java:2117)
>   at java.lang.Thread.run(Thread.java:745)
> {noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (IGNITE-10731) ZookeeperDiscoverySpiTestSuite4: IgniteCacheReplicatedQuerySelfTest.testNodeLeft fails

2018-12-18 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-10731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov updated IGNITE-10731:
--
Description: 

{noformat}
junit.framework.AssertionFailedError: 
Expected :0
Actual   :312
 


at junit.framework.Assert.fail(Assert.java:57)
at junit.framework.Assert.failNotEquals(Assert.java:329)
at junit.framework.Assert.assertEquals(Assert.java:78)
at junit.framework.Assert.assertEquals(Assert.java:234)
at junit.framework.Assert.assertEquals(Assert.java:241)
at junit.framework.TestCase.assertEquals(TestCase.java:409)
at 
org.apache.ignite.internal.processors.cache.distributed.replicated.IgniteCacheReplicatedQuerySelfTest.testNodeLeft(IgniteCacheReplicatedQuerySelfTest.java:348)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:498)
at junit.framework.TestCase.runTest(TestCase.java:176)
at 
org.apache.ignite.testframework.junits.GridAbstractTest.access$001(GridAbstractTest.java:151)
at 
org.apache.ignite.testframework.junits.GridAbstractTest$6.evaluate(GridAbstractTest.java:2102)
at 
org.apache.ignite.testframework.junits.GridAbstractTest$7.run(GridAbstractTest.java:2117)
at java.lang.Thread.run(Thread.java:745)

{noformat}


  was:junit.framework.AssertionFailedError: Expected :0 Actual :312  at junit.framework.Assert.fail(Assert.java:57) at 
junit.framework.Assert.failNotEquals(Assert.java:329) at 
junit.framework.Assert.assertEquals(Assert.java:78) at 
junit.framework.Assert.assertEquals(Assert.java:234) at 
junit.framework.Assert.assertEquals(Assert.java:241) at 
junit.framework.TestCase.assertEquals(TestCase.java:409) at 
org.apache.ignite.internal.processors.cache.distributed.replicated.IgniteCacheReplicatedQuerySelfTest.testNodeLeft(IgniteCacheReplicatedQuerySelfTest.java:348)
 at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) 
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
 at java.lang.reflect.Method.invoke(Method.java:498) at 
junit.framework.TestCase.runTest(TestCase.java:176) at 
org.apache.ignite.testframework.junits.GridAbstractTest.access$001(GridAbstractTest.java:151)
 at 
org.apache.ignite.testframework.junits.GridAbstractTest$6.evaluate(GridAbstractTest.java:2102)
 at 
org.apache.ignite.testframework.junits.GridAbstractTest$7.run(GridAbstractTest.java:2117)
 at java.lang.Thread.run(Thread.java:745)


> ZookeeperDiscoverySpiTestSuite4: 
> IgniteCacheReplicatedQuerySelfTest.testNodeLeft fails
> --
>
> Key: IGNITE-10731
> URL: https://issues.apache.org/jira/browse/IGNITE-10731
> Project: Ignite
>  Issue Type: Improvement
>Affects Versions: 2.7
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
> Fix For: 2.8
>
>
> {noformat}
> junit.framework.AssertionFailedError: 
> Expected :0
> Actual   :312
>  
>   at junit.framework.Assert.fail(Assert.java:57)
>   at junit.framework.Assert.failNotEquals(Assert.java:329)
>   at junit.framework.Assert.assertEquals(Assert.java:78)
>   at junit.framework.Assert.assertEquals(Assert.java:234)
>   at junit.framework.Assert.assertEquals(Assert.java:241)
>   at junit.framework.TestCase.assertEquals(TestCase.java:409)
>   at 
> org.apache.ignite.internal.processors.cache.distributed.replicated.IgniteCacheReplicatedQuerySelfTest.testNodeLeft(IgniteCacheReplicatedQuerySelfTest.java:348)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.lang.reflect.Method.invoke(Method.java:498)
>   at junit.framework.TestCase.runTest(TestCase.java:176)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest.access$001(GridAbstractTest.java:151)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest$6.evaluate(GridAbstractTest.java:2102)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest$7.run(GridAbstractTest.java:2117)
>   at java.lang.Thread.run(Thread.java:745)
> {noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (IGNITE-10731) ZookeeperDiscoverySpiTestSuite4: IgniteCacheReplicatedQuerySelfTest.testNodeLeft fails

2018-12-18 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-10731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov updated IGNITE-10731:
--
Environment: (was: {noformat}
junit.framework.AssertionFailedError: 
Expected :0
Actual   :312
 


at junit.framework.Assert.fail(Assert.java:57)
at junit.framework.Assert.failNotEquals(Assert.java:329)
at junit.framework.Assert.assertEquals(Assert.java:78)
at junit.framework.Assert.assertEquals(Assert.java:234)
at junit.framework.Assert.assertEquals(Assert.java:241)
at junit.framework.TestCase.assertEquals(TestCase.java:409)
at 
org.apache.ignite.internal.processors.cache.distributed.replicated.IgniteCacheReplicatedQuerySelfTest.testNodeLeft(IgniteCacheReplicatedQuerySelfTest.java:348)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:498)
at junit.framework.TestCase.runTest(TestCase.java:176)
at 
org.apache.ignite.testframework.junits.GridAbstractTest.access$001(GridAbstractTest.java:151)
at 
org.apache.ignite.testframework.junits.GridAbstractTest$6.evaluate(GridAbstractTest.java:2102)
at 
org.apache.ignite.testframework.junits.GridAbstractTest$7.run(GridAbstractTest.java:2117)
at java.lang.Thread.run(Thread.java:745)

{noformat})

> ZookeeperDiscoverySpiTestSuite4: 
> IgniteCacheReplicatedQuerySelfTest.testNodeLeft fails
> --
>
> Key: IGNITE-10731
> URL: https://issues.apache.org/jira/browse/IGNITE-10731
> Project: Ignite
>  Issue Type: Improvement
>Affects Versions: 2.7
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
> Fix For: 2.8
>
>
> junit.framework.AssertionFailedError: Expected :0 Actual :312  difference> at junit.framework.Assert.fail(Assert.java:57) at 
> junit.framework.Assert.failNotEquals(Assert.java:329) at 
> junit.framework.Assert.assertEquals(Assert.java:78) at 
> junit.framework.Assert.assertEquals(Assert.java:234) at 
> junit.framework.Assert.assertEquals(Assert.java:241) at 
> junit.framework.TestCase.assertEquals(TestCase.java:409) at 
> org.apache.ignite.internal.processors.cache.distributed.replicated.IgniteCacheReplicatedQuerySelfTest.testNodeLeft(IgniteCacheReplicatedQuerySelfTest.java:348)
>  at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) 
> at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>  at java.lang.reflect.Method.invoke(Method.java:498) at 
> junit.framework.TestCase.runTest(TestCase.java:176) at 
> org.apache.ignite.testframework.junits.GridAbstractTest.access$001(GridAbstractTest.java:151)
>  at 
> org.apache.ignite.testframework.junits.GridAbstractTest$6.evaluate(GridAbstractTest.java:2102)
>  at 
> org.apache.ignite.testframework.junits.GridAbstractTest$7.run(GridAbstractTest.java:2117)
>  at java.lang.Thread.run(Thread.java:745)



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (IGNITE-10731) ZookeeperDiscoverySpiTestSuite4: IgniteCacheReplicatedQuerySelfTest.testNodeLeft fails

2018-12-18 Thread Vitaliy Biryukov (JIRA)
Vitaliy Biryukov created IGNITE-10731:
-

 Summary: ZookeeperDiscoverySpiTestSuite4: 
IgniteCacheReplicatedQuerySelfTest.testNodeLeft fails
 Key: IGNITE-10731
 URL: https://issues.apache.org/jira/browse/IGNITE-10731
 Project: Ignite
  Issue Type: Improvement
Affects Versions: 2.7
 Environment: {noformat}
junit.framework.AssertionFailedError: 
Expected :0
Actual   :312
 


at junit.framework.Assert.fail(Assert.java:57)
at junit.framework.Assert.failNotEquals(Assert.java:329)
at junit.framework.Assert.assertEquals(Assert.java:78)
at junit.framework.Assert.assertEquals(Assert.java:234)
at junit.framework.Assert.assertEquals(Assert.java:241)
at junit.framework.TestCase.assertEquals(TestCase.java:409)
at 
org.apache.ignite.internal.processors.cache.distributed.replicated.IgniteCacheReplicatedQuerySelfTest.testNodeLeft(IgniteCacheReplicatedQuerySelfTest.java:348)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:498)
at junit.framework.TestCase.runTest(TestCase.java:176)
at 
org.apache.ignite.testframework.junits.GridAbstractTest.access$001(GridAbstractTest.java:151)
at 
org.apache.ignite.testframework.junits.GridAbstractTest$6.evaluate(GridAbstractTest.java:2102)
at 
org.apache.ignite.testframework.junits.GridAbstractTest$7.run(GridAbstractTest.java:2117)
at java.lang.Thread.run(Thread.java:745)

{noformat}
Reporter: Vitaliy Biryukov
Assignee: Vitaliy Biryukov
 Fix For: 2.8






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (IGNITE-10731) ZookeeperDiscoverySpiTestSuite4: IgniteCacheReplicatedQuerySelfTest.testNodeLeft fails

2018-12-18 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-10731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov updated IGNITE-10731:
--
Description: junit.framework.AssertionFailedError: Expected :0 Actual :312 
 at junit.framework.Assert.fail(Assert.java:57) at 
junit.framework.Assert.failNotEquals(Assert.java:329) at 
junit.framework.Assert.assertEquals(Assert.java:78) at 
junit.framework.Assert.assertEquals(Assert.java:234) at 
junit.framework.Assert.assertEquals(Assert.java:241) at 
junit.framework.TestCase.assertEquals(TestCase.java:409) at 
org.apache.ignite.internal.processors.cache.distributed.replicated.IgniteCacheReplicatedQuerySelfTest.testNodeLeft(IgniteCacheReplicatedQuerySelfTest.java:348)
 at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) 
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
 at java.lang.reflect.Method.invoke(Method.java:498) at 
junit.framework.TestCase.runTest(TestCase.java:176) at 
org.apache.ignite.testframework.junits.GridAbstractTest.access$001(GridAbstractTest.java:151)
 at 
org.apache.ignite.testframework.junits.GridAbstractTest$6.evaluate(GridAbstractTest.java:2102)
 at 
org.apache.ignite.testframework.junits.GridAbstractTest$7.run(GridAbstractTest.java:2117)
 at java.lang.Thread.run(Thread.java:745)

> ZookeeperDiscoverySpiTestSuite4: 
> IgniteCacheReplicatedQuerySelfTest.testNodeLeft fails
> --
>
> Key: IGNITE-10731
> URL: https://issues.apache.org/jira/browse/IGNITE-10731
> Project: Ignite
>  Issue Type: Improvement
>Affects Versions: 2.7
> Environment: {noformat}
> junit.framework.AssertionFailedError: 
> Expected :0
> Actual   :312
>  
>   at junit.framework.Assert.fail(Assert.java:57)
>   at junit.framework.Assert.failNotEquals(Assert.java:329)
>   at junit.framework.Assert.assertEquals(Assert.java:78)
>   at junit.framework.Assert.assertEquals(Assert.java:234)
>   at junit.framework.Assert.assertEquals(Assert.java:241)
>   at junit.framework.TestCase.assertEquals(TestCase.java:409)
>   at 
> org.apache.ignite.internal.processors.cache.distributed.replicated.IgniteCacheReplicatedQuerySelfTest.testNodeLeft(IgniteCacheReplicatedQuerySelfTest.java:348)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.lang.reflect.Method.invoke(Method.java:498)
>   at junit.framework.TestCase.runTest(TestCase.java:176)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest.access$001(GridAbstractTest.java:151)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest$6.evaluate(GridAbstractTest.java:2102)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest$7.run(GridAbstractTest.java:2117)
>   at java.lang.Thread.run(Thread.java:745)
> {noformat}
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
> Fix For: 2.8
>
>
> junit.framework.AssertionFailedError: Expected :0 Actual :312  difference> at junit.framework.Assert.fail(Assert.java:57) at 
> junit.framework.Assert.failNotEquals(Assert.java:329) at 
> junit.framework.Assert.assertEquals(Assert.java:78) at 
> junit.framework.Assert.assertEquals(Assert.java:234) at 
> junit.framework.Assert.assertEquals(Assert.java:241) at 
> junit.framework.TestCase.assertEquals(TestCase.java:409) at 
> org.apache.ignite.internal.processors.cache.distributed.replicated.IgniteCacheReplicatedQuerySelfTest.testNodeLeft(IgniteCacheReplicatedQuerySelfTest.java:348)
>  at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) 
> at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>  at java.lang.reflect.Method.invoke(Method.java:498) at 
> junit.framework.TestCase.runTest(TestCase.java:176) at 
> org.apache.ignite.testframework.junits.GridAbstractTest.access$001(GridAbstractTest.java:151)
>  at 
> org.apache.ignite.testframework.junits.GridAbstractTest$6.evaluate(GridAbstractTest.java:2102)
>  at 
> org.apache.ignite.testframework.junits.GridAbstractTest$7.run(GridAbstractTest.java:2117)
>  at java.lang.Thread.run(Thread.java:745)



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (IGNITE-7832) Ignite.resetLostPartitions() resets state under race.

2018-11-20 Thread Vitaliy Biryukov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-7832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16692997#comment-16692997
 ] 

Vitaliy Biryukov commented on IGNITE-7832:
--

Hi, [~amashenkov].
 In my opinion, we can't avoid such race with current API.
 I think we should add an extra parameter to *resetLostPartitions*() method, 
such as topology version or array of partitions to reset.

Also at the moment, events are recorded for each lost partition. And in the 
case described in the description, users are forced to find out that the last 
event has arrived before call *resetLostPartitions*(). 
 Perhaps it would be more useful if there was one event for all partitions lost 
on a particular topology version.


 Thoughts?

> Ignite.resetLostPartitions() resets state under race.
> -
>
> Key: IGNITE-7832
> URL: https://issues.apache.org/jira/browse/IGNITE-7832
> Project: Ignite
>  Issue Type: Task
>  Components: cache
>Reporter: Andrew Mashenkov
>Assignee: Vitaliy Biryukov
>Priority: Critical
> Fix For: 2.8
>
>
> Assume, we have event listener that detects partition loss events and apply 
> some actions to recover lost data.
> After recovery process finished an Ignite.resetLostPartitions() method should 
> be called to mark all lost cache partitions as healthy.
> It is possible Ignite.resetLostPartitions() will be called during exchange, 
> but right before a new partition loss event will be fired.
> E.g. exchange thread own GridDhtPartitionTopologyImpl write lock in 
> detectLostPartitions() method, while user thread will wait for the lock 
> inside Ignite.resetLostPartitions().
> So, after a new partition loss will be detected, is will be not possible to 
> abort user action and state of just lost partition will be reset.
> For that case, we should either abort resetLostPartitions() or reset 
> partitions state regarding topology version provided by user some how.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (IGNITE-9861) Authorization object names must always be non-null.

2018-10-11 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-9861?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov updated IGNITE-9861:
-
Component/s: security

> Authorization object names must always be non-null.
> ---
>
> Key: IGNITE-9861
> URL: https://issues.apache.org/jira/browse/IGNITE-9861
> Project: Ignite
>  Issue Type: Improvement
>  Components: security
>Affects Versions: 2.6
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
> Fix For: 2.8
>
>
> Currently, sometimes *null* name parameter passing to a method 
> *GridSecurityProcessor:authorize*. This leads to the fact that it is 
> impossible to determine the authorization object by authorization event.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (IGNITE-9861) Authorization object names must always be non-null.

2018-10-11 Thread Vitaliy Biryukov (JIRA)
Vitaliy Biryukov created IGNITE-9861:


 Summary: Authorization object names must always be non-null.
 Key: IGNITE-9861
 URL: https://issues.apache.org/jira/browse/IGNITE-9861
 Project: Ignite
  Issue Type: Improvement
Affects Versions: 2.6
Reporter: Vitaliy Biryukov
Assignee: Vitaliy Biryukov
 Fix For: 2.8


Currently, sometimes *null* name parameter passing to a method 
*GridSecurityProcessor:authorize*. This leads to the fact that it is impossible 
to determine the authorization object by authorization event.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (IGNITE-6445) IgniteTxManager.txLocksInfo method misses locks

2018-09-19 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-6445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov updated IGNITE-6445:
-
Fix Version/s: (was: 2.7)
   2.8

> IgniteTxManager.txLocksInfo method misses locks
> ---
>
> Key: IGNITE-6445
> URL: https://issues.apache.org/jira/browse/IGNITE-6445
> Project: Ignite
>  Issue Type: Bug
>Affects Versions: 2.2
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
> Fix For: 2.8
>
>
> In some cases "IgniteTxManager.txLocksInfo" method (searches for locks) 
> misses locks.
> For example:
> # In case of a configuration with near cache, entries are created for the 
> near cache and for the ordinal cache. For each entry, their own MVCC 
> candidates are created.
> # For non-custom objects of type (Integer, etc.), the entry stored in 
> "GridNearTxLocal" is not associated with MVCC candidates with which the same 
> entity is associated in another format stored in "GridDhtTxLocal"



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (IGNITE-6224) Node stoping does not wait all transactions completion

2018-09-19 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-6224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov updated IGNITE-6224:
-
Fix Version/s: (was: 2.7)
   2.8

> Node stoping does not wait all transactions completion
> --
>
> Key: IGNITE-6224
> URL: https://issues.apache.org/jira/browse/IGNITE-6224
> Project: Ignite
>  Issue Type: Bug
>Affects Versions: 2.1
>Reporter: Vladislav Pyatkov
>Assignee: Vitaliy Biryukov
>Priority: Major
> Fix For: 2.8
>
> Attachments: TransactionBehindStopNodeTest.java
>
>
> I have started grid node and executing transaction over some cache. After I 
> stopped the node in the middle execution of transaction. I got transaction 
> execution exception:
> {noformat}
> java.lang.IllegalStateException: class 
> org.apache.ignite.internal.processors.cache.CacheStoppedException: Failed to 
> perform cache operation (cache is stopped): cache
>   at 
> org.apache.ignite.internal.processors.cache.GridCacheGateway.enter(GridCacheGateway.java:164)
>   at 
> org.apache.ignite.internal.processors.cache.GatewayProtectedCacheProxy.onEnter(GatewayProtectedCacheProxy.java:1656)
>   at 
> org.apache.ignite.internal.processors.cache.GatewayProtectedCacheProxy.put(GatewayProtectedCacheProxy.java:869)
>   at 
> org.apache.ignite.TransactionBehindStopNodeTest.testOneNode(TransactionBehindStopNodeTest.java:56)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.lang.reflect.Method.invoke(Method.java:498)
>   at junit.framework.TestCase.runTest(TestCase.java:176)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest.runTestInternal(GridAbstractTest.java:2000)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest.access$000(GridAbstractTest.java:132)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest$5.run(GridAbstractTest.java:1915)
>   at java.lang.Thread.run(Thread.java:745)
> {noformat}
> although I stopped node with _false_ {{cancel}} flag.
> {code}
> G.stop(getTestIgniteInstanceName(0), false);
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (IGNITE-9138) ZookeeperDiscoverySpiTest#checkInternalStructuresCleanup fails.

2018-09-19 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-9138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov updated IGNITE-9138:
-
Fix Version/s: (was: 2.7)
   2.8

> ZookeeperDiscoverySpiTest#checkInternalStructuresCleanup fails.
> ---
>
> Key: IGNITE-9138
> URL: https://issues.apache.org/jira/browse/IGNITE-9138
> Project: Ignite
>  Issue Type: Bug
>Affects Versions: 2.6
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
> Fix For: 2.8
>
>
>  
> {noformat}
> junit.framework.AssertionFailedError: Expected:  but was: 
> ZkCommunicationErrorProcessFuture 
> [impl=org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl@180b3819,
>  endTime=1532545453881, id=9e083d2d461-645a2360-f5bb-43d3-8327-83d0a4a00124, 
> state=WAIT_TIMEOUT, resolveTopVer=0, resErr=null, collectResFut=null]
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.checkInternalStructuresCleanup(ZookeeperDiscoverySpiTest.java:517)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.afterTest(ZookeeperDiscoverySpiTest.java:476)
> {noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Resolved] (IGNITE-9243) Avoid test suit hangs on stopAllGrids

2018-09-19 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-9243?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov resolved IGNITE-9243.
--
Resolution: Won't Fix

My idea did not work.

> Avoid test suit hangs on stopAllGrids
> -
>
> Key: IGNITE-9243
> URL: https://issues.apache.org/jira/browse/IGNITE-9243
> Project: Ignite
>  Issue Type: Improvement
>Affects Versions: 2.6
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
> Fix For: 2.7
>
>
> Tests sometimes hang on node join to topology, and this leads to the hang of 
> the whole suite.
> Solution: Do not wait for nodes to join a topology in *stopAllGrids* method.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Assigned] (IGNITE-7832) Ignite.resetLostPartitions() resets state under race.

2018-09-03 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-7832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov reassigned IGNITE-7832:


Assignee: Vitaliy Biryukov

> Ignite.resetLostPartitions() resets state under race.
> -
>
> Key: IGNITE-7832
> URL: https://issues.apache.org/jira/browse/IGNITE-7832
> Project: Ignite
>  Issue Type: Task
>  Components: cache
>Reporter: Andrew Mashenkov
>Assignee: Vitaliy Biryukov
>Priority: Critical
> Fix For: 2.7
>
>
> Assume, we have event listener that detects partition loss events and apply 
> some actions to recover lost data.
> After recovery process finished an Ignite.resetLostPartitions() method should 
> be called to mark all lost cache partitions as healthy.
> It is possible Ignite.resetLostPartitions() will be called during exchange, 
> but right before a new partition loss event will be fired.
> E.g. exchange thread own GridDhtPartitionTopologyImpl write lock in 
> detectLostPartitions() method, while user thread will wait for the lock 
> inside Ignite.resetLostPartitions().
> So, after a new partition loss will be detected, is will be not possible to 
> abort user action and state of just lost partition will be reset.
> For that case, we should either abort resetLostPartitions() or reset 
> partitions state regarding topology version provided by user some how.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (IGNITE-9169) Cache (Deadlock Detection) suite hangs

2018-08-17 Thread Vitaliy Biryukov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-9169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16584214#comment-16584214
 ] 

Vitaliy Biryukov commented on IGNITE-9169:
--

[~dpavlov], thank you.

It was the test-only issue.

> Cache (Deadlock Detection) suite hangs
> --
>
> Key: IGNITE-9169
> URL: https://issues.apache.org/jira/browse/IGNITE-9169
> Project: Ignite
>  Issue Type: Bug
>Affects Versions: 2.6
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
> Fix For: 2.7
>
> Attachments: threads_report.txt
>
>
> {noformat}
> "main" #1 prio=5 os_prio=0 tid=0x7f289800e000 nid=0x67bb in Object.wait() 
> [0x7f289f9dc000]
>java.lang.Thread.State: WAITING (on object monitor)
>   at java.lang.Object.wait(Native Method)
>   at java.lang.Thread.join(Thread.java:1252)
>   - locked <0x849c82b0> (a 
> org.apache.ignite.testframework.GridTestSafeThreadFactory$1)
>   at 
> org.apache.ignite.internal.util.IgniteUtils.join(IgniteUtils.java:4621)
>   at 
> org.apache.ignite.internal.util.IgniteUtils.join(IgniteUtils.java:4606)
>   at 
> org.apache.ignite.internal.util.IgniteUtils.joinThreads(IgniteUtils.java:4650)
>   at 
> org.apache.ignite.testframework.GridTestSafeThreadFactory.stopAllThreads(GridTestSafeThreadFactory.java:206)
>   at 
> org.apache.ignite.testframework.GridTestUtils.stopThreads(GridTestUtils.java:985)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest.tearDown(GridAbstractTest.java:1705)
>   at 
> org.apache.ignite.testframework.junits.common.GridCommonAbstractTest.tearDown(GridCommonAbstractTest.java:497)
>   at junit.framework.TestCase.runBare(TestCase.java:146)
>   at junit.framework.TestResult$1.protect(TestResult.java:122)
>   at junit.framework.TestResult.runProtected(TestResult.java:142)
>   at junit.framework.TestResult.run(TestResult.java:125)
>   at junit.framework.TestCase.run(TestCase.java:129)
>   at junit.framework.TestSuite.runTest(TestSuite.java:255)
>   at junit.framework.TestSuite.run(TestSuite.java:250)
>   at junit.framework.TestSuite.runTest(TestSuite.java:255)
>   at junit.framework.TestSuite.run(TestSuite.java:250)
>   at 
> org.junit.internal.runners.JUnit38ClassRunner.run(JUnit38ClassRunner.java:84)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:369)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.executeWithRerun(JUnit4Provider.java:275)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:239)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:160)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.lang.reflect.Method.invoke(Method.java:498)
>   at 
> org.apache.maven.surefire.util.ReflectionUtils.invokeMethodWithArray2(ReflectionUtils.java:206)
>   at 
> org.apache.maven.surefire.booter.ProviderFactory$ProviderProxy.invoke(ProviderFactory.java:160)
>   at 
> org.apache.maven.surefire.booter.ProviderFactory.invokeProvider(ProviderFactory.java:83)
>   at 
> org.apache.maven.plugin.surefire.InPluginVMSurefireStarter.runSuitesInProcess(InPluginVMSurefireStarter.java:84)
>   at 
> org.apache.maven.plugin.surefire.AbstractSurefireMojo.executeProvider(AbstractSurefireMojo.java:1107)
>   at 
> org.apache.maven.plugin.surefire.AbstractSurefireMojo.executeAfterPreconditionsChecked(AbstractSurefireMojo.java:954)
>   at 
> org.apache.maven.plugin.surefire.AbstractSurefireMojo.execute(AbstractSurefireMojo.java:832)
>   at 
> org.apache.maven.plugin.DefaultBuildPluginManager.executeMojo(DefaultBuildPluginManager.java:137)
>   at 
> org.apache.maven.lifecycle.internal.MojoExecutor.execute(MojoExecutor.java:208)
>   at 
> org.apache.maven.lifecycle.internal.MojoExecutor.execute(MojoExecutor.java:154)
>   at 
> org.apache.maven.lifecycle.internal.MojoExecutor.execute(MojoExecutor.java:146)
>   at 
> org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject(LifecycleModuleBuilder.java:117)
>   at 
> org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject(LifecycleModuleBuilder.java:81)
>   at 
> org.apache.maven.lifecycle.internal.builder.singlethreaded.SingleThreadedBuilder.build(SingleThreadedBuilder.java:56)
>   at 
> org.apache.maven.lifecycle.internal.LifecycleStarter.execute(LifecycleStarter.java:128)
>   at 

[jira] [Commented] (IGNITE-9169) Cache (Deadlock Detection) suite hangs

2018-08-16 Thread Vitaliy Biryukov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-9169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16582934#comment-16582934
 ] 

Vitaliy Biryukov commented on IGNITE-9169:
--

[~EdShangGG] , *TxOptimisticDeadlockDetectionTest* and 
*TxPessimisticDeadlockDetectionTest* start transactions and generate deadlock 
between them. Sometimes these tests hang because of unexpected PME during 
transactions. This PME caused by late affinity assignment after cache start. 
I've added waiting for assignment right after cache start to prevent this 
unexpected PME.

> Cache (Deadlock Detection) suite hangs
> --
>
> Key: IGNITE-9169
> URL: https://issues.apache.org/jira/browse/IGNITE-9169
> Project: Ignite
>  Issue Type: Bug
>Affects Versions: 2.6
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
> Fix For: 2.7
>
> Attachments: threads_report.txt
>
>
> {noformat}
> "main" #1 prio=5 os_prio=0 tid=0x7f289800e000 nid=0x67bb in Object.wait() 
> [0x7f289f9dc000]
>java.lang.Thread.State: WAITING (on object monitor)
>   at java.lang.Object.wait(Native Method)
>   at java.lang.Thread.join(Thread.java:1252)
>   - locked <0x849c82b0> (a 
> org.apache.ignite.testframework.GridTestSafeThreadFactory$1)
>   at 
> org.apache.ignite.internal.util.IgniteUtils.join(IgniteUtils.java:4621)
>   at 
> org.apache.ignite.internal.util.IgniteUtils.join(IgniteUtils.java:4606)
>   at 
> org.apache.ignite.internal.util.IgniteUtils.joinThreads(IgniteUtils.java:4650)
>   at 
> org.apache.ignite.testframework.GridTestSafeThreadFactory.stopAllThreads(GridTestSafeThreadFactory.java:206)
>   at 
> org.apache.ignite.testframework.GridTestUtils.stopThreads(GridTestUtils.java:985)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest.tearDown(GridAbstractTest.java:1705)
>   at 
> org.apache.ignite.testframework.junits.common.GridCommonAbstractTest.tearDown(GridCommonAbstractTest.java:497)
>   at junit.framework.TestCase.runBare(TestCase.java:146)
>   at junit.framework.TestResult$1.protect(TestResult.java:122)
>   at junit.framework.TestResult.runProtected(TestResult.java:142)
>   at junit.framework.TestResult.run(TestResult.java:125)
>   at junit.framework.TestCase.run(TestCase.java:129)
>   at junit.framework.TestSuite.runTest(TestSuite.java:255)
>   at junit.framework.TestSuite.run(TestSuite.java:250)
>   at junit.framework.TestSuite.runTest(TestSuite.java:255)
>   at junit.framework.TestSuite.run(TestSuite.java:250)
>   at 
> org.junit.internal.runners.JUnit38ClassRunner.run(JUnit38ClassRunner.java:84)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:369)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.executeWithRerun(JUnit4Provider.java:275)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:239)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:160)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.lang.reflect.Method.invoke(Method.java:498)
>   at 
> org.apache.maven.surefire.util.ReflectionUtils.invokeMethodWithArray2(ReflectionUtils.java:206)
>   at 
> org.apache.maven.surefire.booter.ProviderFactory$ProviderProxy.invoke(ProviderFactory.java:160)
>   at 
> org.apache.maven.surefire.booter.ProviderFactory.invokeProvider(ProviderFactory.java:83)
>   at 
> org.apache.maven.plugin.surefire.InPluginVMSurefireStarter.runSuitesInProcess(InPluginVMSurefireStarter.java:84)
>   at 
> org.apache.maven.plugin.surefire.AbstractSurefireMojo.executeProvider(AbstractSurefireMojo.java:1107)
>   at 
> org.apache.maven.plugin.surefire.AbstractSurefireMojo.executeAfterPreconditionsChecked(AbstractSurefireMojo.java:954)
>   at 
> org.apache.maven.plugin.surefire.AbstractSurefireMojo.execute(AbstractSurefireMojo.java:832)
>   at 
> org.apache.maven.plugin.DefaultBuildPluginManager.executeMojo(DefaultBuildPluginManager.java:137)
>   at 
> org.apache.maven.lifecycle.internal.MojoExecutor.execute(MojoExecutor.java:208)
>   at 
> org.apache.maven.lifecycle.internal.MojoExecutor.execute(MojoExecutor.java:154)
>   at 
> org.apache.maven.lifecycle.internal.MojoExecutor.execute(MojoExecutor.java:146)
>   at 
> org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject(LifecycleModuleBuilder.java:117)
>   at 
> 

[jira] [Commented] (IGNITE-9169) Cache (Deadlock Detection) suite hangs

2018-08-16 Thread Vitaliy Biryukov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-9169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16582771#comment-16582771
 ] 

Vitaliy Biryukov commented on IGNITE-9169:
--

[~EdShangGG], please take a look.

> Cache (Deadlock Detection) suite hangs
> --
>
> Key: IGNITE-9169
> URL: https://issues.apache.org/jira/browse/IGNITE-9169
> Project: Ignite
>  Issue Type: Bug
>Affects Versions: 2.6
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
> Fix For: 2.7
>
> Attachments: threads_report.txt
>
>
> {noformat}
> "main" #1 prio=5 os_prio=0 tid=0x7f289800e000 nid=0x67bb in Object.wait() 
> [0x7f289f9dc000]
>java.lang.Thread.State: WAITING (on object monitor)
>   at java.lang.Object.wait(Native Method)
>   at java.lang.Thread.join(Thread.java:1252)
>   - locked <0x849c82b0> (a 
> org.apache.ignite.testframework.GridTestSafeThreadFactory$1)
>   at 
> org.apache.ignite.internal.util.IgniteUtils.join(IgniteUtils.java:4621)
>   at 
> org.apache.ignite.internal.util.IgniteUtils.join(IgniteUtils.java:4606)
>   at 
> org.apache.ignite.internal.util.IgniteUtils.joinThreads(IgniteUtils.java:4650)
>   at 
> org.apache.ignite.testframework.GridTestSafeThreadFactory.stopAllThreads(GridTestSafeThreadFactory.java:206)
>   at 
> org.apache.ignite.testframework.GridTestUtils.stopThreads(GridTestUtils.java:985)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest.tearDown(GridAbstractTest.java:1705)
>   at 
> org.apache.ignite.testframework.junits.common.GridCommonAbstractTest.tearDown(GridCommonAbstractTest.java:497)
>   at junit.framework.TestCase.runBare(TestCase.java:146)
>   at junit.framework.TestResult$1.protect(TestResult.java:122)
>   at junit.framework.TestResult.runProtected(TestResult.java:142)
>   at junit.framework.TestResult.run(TestResult.java:125)
>   at junit.framework.TestCase.run(TestCase.java:129)
>   at junit.framework.TestSuite.runTest(TestSuite.java:255)
>   at junit.framework.TestSuite.run(TestSuite.java:250)
>   at junit.framework.TestSuite.runTest(TestSuite.java:255)
>   at junit.framework.TestSuite.run(TestSuite.java:250)
>   at 
> org.junit.internal.runners.JUnit38ClassRunner.run(JUnit38ClassRunner.java:84)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:369)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.executeWithRerun(JUnit4Provider.java:275)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:239)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:160)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.lang.reflect.Method.invoke(Method.java:498)
>   at 
> org.apache.maven.surefire.util.ReflectionUtils.invokeMethodWithArray2(ReflectionUtils.java:206)
>   at 
> org.apache.maven.surefire.booter.ProviderFactory$ProviderProxy.invoke(ProviderFactory.java:160)
>   at 
> org.apache.maven.surefire.booter.ProviderFactory.invokeProvider(ProviderFactory.java:83)
>   at 
> org.apache.maven.plugin.surefire.InPluginVMSurefireStarter.runSuitesInProcess(InPluginVMSurefireStarter.java:84)
>   at 
> org.apache.maven.plugin.surefire.AbstractSurefireMojo.executeProvider(AbstractSurefireMojo.java:1107)
>   at 
> org.apache.maven.plugin.surefire.AbstractSurefireMojo.executeAfterPreconditionsChecked(AbstractSurefireMojo.java:954)
>   at 
> org.apache.maven.plugin.surefire.AbstractSurefireMojo.execute(AbstractSurefireMojo.java:832)
>   at 
> org.apache.maven.plugin.DefaultBuildPluginManager.executeMojo(DefaultBuildPluginManager.java:137)
>   at 
> org.apache.maven.lifecycle.internal.MojoExecutor.execute(MojoExecutor.java:208)
>   at 
> org.apache.maven.lifecycle.internal.MojoExecutor.execute(MojoExecutor.java:154)
>   at 
> org.apache.maven.lifecycle.internal.MojoExecutor.execute(MojoExecutor.java:146)
>   at 
> org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject(LifecycleModuleBuilder.java:117)
>   at 
> org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject(LifecycleModuleBuilder.java:81)
>   at 
> org.apache.maven.lifecycle.internal.builder.singlethreaded.SingleThreadedBuilder.build(SingleThreadedBuilder.java:56)
>   at 
> org.apache.maven.lifecycle.internal.LifecycleStarter.execute(LifecycleStarter.java:128)
>   at 

[jira] [Assigned] (IGNITE-9267) Deadlock between unsuccessful client reconnecting and stopping.

2018-08-15 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-9267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov reassigned IGNITE-9267:


Assignee: Vitaliy Biryukov

> Deadlock between unsuccessful client reconnecting and stopping.
> ---
>
> Key: IGNITE-9267
> URL: https://issues.apache.org/jira/browse/IGNITE-9267
> Project: Ignite
>  Issue Type: Bug
>Affects Versions: 2.6
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
> Fix For: 2.7
>
>
> *T1* start reconnecting and acquire *stateMux* 
> (ZookeeperDiscoveryImpl#processLocalJoin) -> *T2* start stopping and acquire 
> *IgnitionEx.this* (IgnitionEx#stop0) -> *T2* stopping ZookeeperDiscoveryImpl 
> and wait for *stateMux* -> *T1* fails on reconnecting and start to stopping 
> node and wait for *IgnitionEx.this*.
> Reconnecting thread:
> {noformat}
> "zk-internal.IgniteClientReconnectCacheTest3-EventThread" #593633 daemon 
> prio=5 os_prio=0 tid=0x7ff8e4063800 nid=0x478e waiting for monitor entry 
> [0x7ff90f2f]
>java.lang.Thread.State: BLOCKED (on object monitor)
>   at 
> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.stop0(IgnitionEx.java:2565)
>   - waiting to lock <0xe9429280> (a 
> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance)
>   at 
> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.stop(IgnitionEx.java:2557)
>   at org.apache.ignite.internal.IgnitionEx.stop(IgnitionEx.java:374)
>   at org.apache.ignite.Ignition.stop(Ignition.java:229)
>   at org.apache.ignite.internal.IgniteKernal.close(IgniteKernal.java:3417)
>   at 
> org.apache.ignite.internal.IgniteKernal.onReconnected(IgniteKernal.java:3904)
>   at 
> org.apache.ignite.internal.managers.discovery.GridDiscoveryManager$4.onDiscovery0(GridDiscoveryManager.java:831)
>   at 
> org.apache.ignite.internal.managers.discovery.GridDiscoveryManager$4.onDiscovery(GridDiscoveryManager.java:590)
>   - locked <0xe9429468> (a java.lang.Object)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.processLocalJoin(ZookeeperDiscoveryImpl.java:2960)
>   - locked <0xe9429478> (a java.lang.Object)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.processBulkJoin(ZookeeperDiscoveryImpl.java:2760)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.processNewEvents(ZookeeperDiscoveryImpl.java:2623)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.processNewEvents(ZookeeperDiscoveryImpl.java:2598)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.access$2000(ZookeeperDiscoveryImpl.java:108)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl$ZkWatcher.processResult(ZookeeperDiscoveryImpl.java:4108)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperClient$DataCallbackWrapper.processResult(ZookeeperClient.java:1219)
>   at 
> org.apache.zookeeper.ClientCnxn$EventThread.processEvent(ClientCnxn.java:561)
>   at org.apache.zookeeper.ClientCnxn$EventThread.run(ClientCnxn.java:498)
> {noformat}
> Stopping thread:
> {noformat}
> "main" #1 prio=5 os_prio=0 tid=0x7ffba000e000 nid=0x6aa3 waiting for 
> monitor entry [0x7ffba8c83000]
>java.lang.Thread.State: BLOCKED (on object monitor)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.stop0(ZookeeperDiscoveryImpl.java:3838)
>   - waiting to lock <0xe9429478> (a java.lang.Object)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.stop(ZookeeperDiscoveryImpl.java:3813)
>   at 
> org.apache.ignite.spi.discovery.zk.ZookeeperDiscoverySpi.spiStop(ZookeeperDiscoverySpi.java:501)
>   at 
> org.apache.ignite.internal.managers.GridManagerAdapter.stopSpi(GridManagerAdapter.java:330)
>   at 
> org.apache.ignite.internal.managers.discovery.GridDiscoveryManager.stop(GridDiscoveryManager.java:1683)
>   at org.apache.ignite.internal.IgniteKernal.stop0(IgniteKernal.java:2206)
>   at org.apache.ignite.internal.IgniteKernal.stop(IgniteKernal.java:2081)
>   at 
> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.stop0(IgnitionEx.java:2594)
>   - locked <0xe9429280> (a 
> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance)
>   at 
> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.stop(IgnitionEx.java:2557)
>   at org.apache.ignite.internal.IgnitionEx.stop(IgnitionEx.java:374)
>   at org.apache.ignite.Ignition.stop(Ignition.java:229)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest.stopGrid(GridAbstractTest.java:1153)
>   at 
> 

[jira] [Updated] (IGNITE-9267) Deadlock between unsuccessful client reconnecting and stopping.

2018-08-14 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-9267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov updated IGNITE-9267:
-
Description: 
*T1* start reconnecting and acquire *stateMux* 
(ZookeeperDiscoveryImpl#processLocalJoin) -> *T2* start stopping and acquire 
*IgnitionEx.this* (IgnitionEx#stop0) -> *T2* stopping ZookeeperDiscoveryImpl 
and wait for *stateMux* -> *T1* fails on reconnecting and start to stopping 
node and wait for *IgnitionEx.this*.

Reconnecting thread:
{noformat}
"zk-internal.IgniteClientReconnectCacheTest3-EventThread" #593633 daemon prio=5 
os_prio=0 tid=0x7ff8e4063800 nid=0x478e waiting for monitor entry 
[0x7ff90f2f]
   java.lang.Thread.State: BLOCKED (on object monitor)
at 
org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.stop0(IgnitionEx.java:2565)
- waiting to lock <0xe9429280> (a 
org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance)
at 
org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.stop(IgnitionEx.java:2557)
at org.apache.ignite.internal.IgnitionEx.stop(IgnitionEx.java:374)
at org.apache.ignite.Ignition.stop(Ignition.java:229)
at org.apache.ignite.internal.IgniteKernal.close(IgniteKernal.java:3417)
at 
org.apache.ignite.internal.IgniteKernal.onReconnected(IgniteKernal.java:3904)
at 
org.apache.ignite.internal.managers.discovery.GridDiscoveryManager$4.onDiscovery0(GridDiscoveryManager.java:831)
at 
org.apache.ignite.internal.managers.discovery.GridDiscoveryManager$4.onDiscovery(GridDiscoveryManager.java:590)
- locked <0xe9429468> (a java.lang.Object)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.processLocalJoin(ZookeeperDiscoveryImpl.java:2960)
- locked <0xe9429478> (a java.lang.Object)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.processBulkJoin(ZookeeperDiscoveryImpl.java:2760)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.processNewEvents(ZookeeperDiscoveryImpl.java:2623)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.processNewEvents(ZookeeperDiscoveryImpl.java:2598)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.access$2000(ZookeeperDiscoveryImpl.java:108)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl$ZkWatcher.processResult(ZookeeperDiscoveryImpl.java:4108)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperClient$DataCallbackWrapper.processResult(ZookeeperClient.java:1219)
at 
org.apache.zookeeper.ClientCnxn$EventThread.processEvent(ClientCnxn.java:561)
at org.apache.zookeeper.ClientCnxn$EventThread.run(ClientCnxn.java:498)
{noformat}
Stopping thread:
{noformat}
"main" #1 prio=5 os_prio=0 tid=0x7ffba000e000 nid=0x6aa3 waiting for 
monitor entry [0x7ffba8c83000]
   java.lang.Thread.State: BLOCKED (on object monitor)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.stop0(ZookeeperDiscoveryImpl.java:3838)
- waiting to lock <0xe9429478> (a java.lang.Object)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.stop(ZookeeperDiscoveryImpl.java:3813)
at 
org.apache.ignite.spi.discovery.zk.ZookeeperDiscoverySpi.spiStop(ZookeeperDiscoverySpi.java:501)
at 
org.apache.ignite.internal.managers.GridManagerAdapter.stopSpi(GridManagerAdapter.java:330)
at 
org.apache.ignite.internal.managers.discovery.GridDiscoveryManager.stop(GridDiscoveryManager.java:1683)
at org.apache.ignite.internal.IgniteKernal.stop0(IgniteKernal.java:2206)
at org.apache.ignite.internal.IgniteKernal.stop(IgniteKernal.java:2081)
at 
org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.stop0(IgnitionEx.java:2594)
- locked <0xe9429280> (a 
org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance)
at 
org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.stop(IgnitionEx.java:2557)
at org.apache.ignite.internal.IgnitionEx.stop(IgnitionEx.java:374)
at org.apache.ignite.Ignition.stop(Ignition.java:229)
at 
org.apache.ignite.testframework.junits.GridAbstractTest.stopGrid(GridAbstractTest.java:1153)
at 
org.apache.ignite.testframework.junits.GridAbstractTest.stopAllGrids(GridAbstractTest.java:1193)
at 
org.apache.ignite.testframework.junits.GridAbstractTest.stopAllGrids(GridAbstractTest.java:1174)
at 
org.apache.ignite.internal.IgniteClientReconnectCacheTest.afterTest(IgniteClientReconnectCacheTest.java:151)
at 
org.apache.ignite.testframework.junits.GridAbstractTest.tearDown(GridAbstractTest.java:1763)
at 
org.apache.ignite.testframework.junits.common.GridCommonAbstractTest.tearDown(GridCommonAbstractTest.java:503)
{noformat}

  was:

[jira] [Created] (IGNITE-9267) Deadlock between unsuccessful client reconnecting and stopping.

2018-08-14 Thread Vitaliy Biryukov (JIRA)
Vitaliy Biryukov created IGNITE-9267:


 Summary: Deadlock between unsuccessful client reconnecting and 
stopping.
 Key: IGNITE-9267
 URL: https://issues.apache.org/jira/browse/IGNITE-9267
 Project: Ignite
  Issue Type: Bug
Affects Versions: 2.6
Reporter: Vitaliy Biryukov
 Fix For: 2.7


Reconnecting thread:
{noformat}
"zk-internal.IgniteClientReconnectCacheTest3-EventThread" #593633 daemon prio=5 
os_prio=0 tid=0x7ff8e4063800 nid=0x478e waiting for monitor entry 
[0x7ff90f2f]
   java.lang.Thread.State: BLOCKED (on object monitor)
at 
org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.stop0(IgnitionEx.java:2565)
- waiting to lock <0xe9429280> (a 
org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance)
at 
org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.stop(IgnitionEx.java:2557)
at org.apache.ignite.internal.IgnitionEx.stop(IgnitionEx.java:374)
at org.apache.ignite.Ignition.stop(Ignition.java:229)
at org.apache.ignite.internal.IgniteKernal.close(IgniteKernal.java:3417)
at 
org.apache.ignite.internal.IgniteKernal.onReconnected(IgniteKernal.java:3904)
at 
org.apache.ignite.internal.managers.discovery.GridDiscoveryManager$4.onDiscovery0(GridDiscoveryManager.java:831)
at 
org.apache.ignite.internal.managers.discovery.GridDiscoveryManager$4.onDiscovery(GridDiscoveryManager.java:590)
- locked <0xe9429468> (a java.lang.Object)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.processLocalJoin(ZookeeperDiscoveryImpl.java:2960)
- locked <0xe9429478> (a java.lang.Object)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.processBulkJoin(ZookeeperDiscoveryImpl.java:2760)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.processNewEvents(ZookeeperDiscoveryImpl.java:2623)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.processNewEvents(ZookeeperDiscoveryImpl.java:2598)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.access$2000(ZookeeperDiscoveryImpl.java:108)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl$ZkWatcher.processResult(ZookeeperDiscoveryImpl.java:4108)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperClient$DataCallbackWrapper.processResult(ZookeeperClient.java:1219)
at 
org.apache.zookeeper.ClientCnxn$EventThread.processEvent(ClientCnxn.java:561)
at org.apache.zookeeper.ClientCnxn$EventThread.run(ClientCnxn.java:498)
{noformat}

Stopping thread:

{noformat}
"main" #1 prio=5 os_prio=0 tid=0x7ffba000e000 nid=0x6aa3 waiting for 
monitor entry [0x7ffba8c83000]
   java.lang.Thread.State: BLOCKED (on object monitor)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.stop0(ZookeeperDiscoveryImpl.java:3838)
- waiting to lock <0xe9429478> (a java.lang.Object)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.stop(ZookeeperDiscoveryImpl.java:3813)
at 
org.apache.ignite.spi.discovery.zk.ZookeeperDiscoverySpi.spiStop(ZookeeperDiscoverySpi.java:501)
at 
org.apache.ignite.internal.managers.GridManagerAdapter.stopSpi(GridManagerAdapter.java:330)
at 
org.apache.ignite.internal.managers.discovery.GridDiscoveryManager.stop(GridDiscoveryManager.java:1683)
at org.apache.ignite.internal.IgniteKernal.stop0(IgniteKernal.java:2206)
at org.apache.ignite.internal.IgniteKernal.stop(IgniteKernal.java:2081)
at 
org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.stop0(IgnitionEx.java:2594)
- locked <0xe9429280> (a 
org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance)
at 
org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.stop(IgnitionEx.java:2557)
at org.apache.ignite.internal.IgnitionEx.stop(IgnitionEx.java:374)
at org.apache.ignite.Ignition.stop(Ignition.java:229)
at 
org.apache.ignite.testframework.junits.GridAbstractTest.stopGrid(GridAbstractTest.java:1153)
at 
org.apache.ignite.testframework.junits.GridAbstractTest.stopAllGrids(GridAbstractTest.java:1193)
at 
org.apache.ignite.testframework.junits.GridAbstractTest.stopAllGrids(GridAbstractTest.java:1174)
at 
org.apache.ignite.internal.IgniteClientReconnectCacheTest.afterTest(IgniteClientReconnectCacheTest.java:151)
at 
org.apache.ignite.testframework.junits.GridAbstractTest.tearDown(GridAbstractTest.java:1763)
at 
org.apache.ignite.testframework.junits.common.GridCommonAbstractTest.tearDown(GridCommonAbstractTest.java:503)
{noformat}




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (IGNITE-9243) Avoid test suit hangs on stopAllGrids

2018-08-09 Thread Vitaliy Biryukov (JIRA)
Vitaliy Biryukov created IGNITE-9243:


 Summary: Avoid test suit hangs on stopAllGrids
 Key: IGNITE-9243
 URL: https://issues.apache.org/jira/browse/IGNITE-9243
 Project: Ignite
  Issue Type: Improvement
Affects Versions: 2.6
Reporter: Vitaliy Biryukov
Assignee: Vitaliy Biryukov
 Fix For: 2.7


Tests sometimes hang on node join to topology, and this leads to the hang of 
the whole suite.

Solution: Do not wait for nodes to join a topology in *stopAllGrids* method.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (IGNITE-1905) High contention for CacheLockImpl causes AssertionErrors

2018-08-09 Thread Vitaliy Biryukov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-1905?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16574717#comment-16574717
 ] 

Vitaliy Biryukov commented on IGNITE-1905:
--

[~agoncharuk], we can't use future as the synch monitor, because it'll cause 
deadlocks.
As far as I understand *map* phase is always before *onResult* method is 
called. Just need to make *timeoutObj* field volatile.

> High contention for CacheLockImpl causes AssertionErrors
> 
>
> Key: IGNITE-1905
> URL: https://issues.apache.org/jira/browse/IGNITE-1905
> Project: Ignite
>  Issue Type: Bug
>  Components: cache
>Affects Versions: ignite-1.4
> Environment: Windows 7
>Reporter: Denis Magda
>Assignee: Vitaliy Biryukov
>Priority: Major
> Attachments: ClientTest.java, ServerTest.java
>
>
> When multiple threads, running on the same client node, compete for 
> CacheLockImpl this leads to AssertionErrors.
> Pseudo-code snippet, that is called from multiple threads and causes the 
> assertions, looks like this:
> {noformat}
> boolean locked = lock.tryLock(100, TimeUnit.MILLISECONDS);
> if (locked)
> lock.unlock();
> {noformat}
> Initially the issue was detected on ignite-1.4.
> In server's node logs the following assertion appears
> {noformat}
> java.lang.AssertionError
>   at 
> org.apache.ignite.internal.processors.cache.distributed.dht.GridDhtLockFuture.addOwned(GridDhtLockFuture.java:958)
>   at 
> org.apache.ignite.internal.processors.cache.distributed.dht.GridDhtLockFuture.map(GridDhtLockFuture.java:918)
>   at 
> org.apache.ignite.internal.processors.cache.distributed.dht.GridDhtLockFuture.onOwnerChanged(GridDhtLockFuture.java:663)
>   at 
> org.apache.ignite.internal.processors.cache.GridCacheMvccManager$2.onOwnerChanged(GridCacheMvccManager.java:155)
>   at 
> org.apache.ignite.internal.processors.cache.distributed.GridDistributedCacheEntry.checkOwnerChanged(GridDistributedCacheEntry.java:810)
>   at 
> org.apache.ignite.internal.processors.cache.distributed.GridDistributedCacheEntry.readyLock(GridDistributedCacheEntry.java:516)
>   at 
> org.apache.ignite.internal.processors.cache.distributed.dht.GridDhtLockFuture.readyLocks(GridDhtLockFuture.java:576)
>   at 
> org.apache.ignite.internal.processors.cache.distributed.dht.GridDhtLockFuture.map(GridDhtLockFuture.java:764)
>   at 
> org.apache.ignite.internal.processors.cache.distributed.dht.GridDhtTransactionalCacheAdapter.lockAllAsync(GridDhtTransactionalCacheAdapter.java:973)
>   at 
> org.apache.ignite.internal.processors.cache.distributed.dht.GridDhtTransactionalCacheAdapter.processNearLockRequest(GridDhtTransactionalCacheAdapter.java:557)
>   at 
> org.apache.ignite.internal.processors.cache.distributed.dht.GridDhtTransactionalCacheAdapter.access$000(GridDhtTransactionalCacheAdapter.java:88)
>   at 
> org.apache.ignite.internal.processors.cache.distributed.dht.GridDhtTransactionalCacheAdapter$2.apply(GridDhtTransactionalCacheAdapter.java:132)
>   at 
> org.apache.ignite.internal.processors.cache.distributed.dht.GridDhtTransactionalCacheAdapter$2.apply(GridDhtTransactionalCacheAdapter.java:130)
>   at 
> org.apache.ignite.internal.processors.cache.GridCacheIoManager.processMessage(GridCacheIoManager.java:580)
>   at 
> org.apache.ignite.internal.processors.cache.GridCacheIoManager.onMessage0(GridCacheIoManager.java:280)
>   at 
> org.apache.ignite.internal.processors.cache.GridCacheIoManager.handleMessage(GridCacheIoManager.java:198)
>   at 
> org.apache.ignite.internal.processors.cache.GridCacheIoManager.access$000(GridCacheIoManager.java:77)
>   at 
> org.apache.ignite.internal.processors.cache.GridCacheIoManager$1.onMessage(GridCacheIoManager.java:160)
>   at 
> org.apache.ignite.internal.managers.communication.GridIoManager.processRegularMessage0(GridIoManager.java:811)
>   at 
> org.apache.ignite.internal.managers.communication.GridIoManager.access$1500(GridIoManager.java:106)
>   at 
> org.apache.ignite.internal.managers.communication.GridIoManager$5.run(GridIoManager.java:774)
>   at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>   at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>   at java.lang.Thread.run(Thread.java:745)
> {noformat}
>   
> In addition from time to time a client node also outputs a different kind of 
> assertion:
> {noformat}
> Exception in thread "ignite-#4%sys-null%" java.lang.AssertionError
>   at 
> org.apache.ignite.internal.processors.cache.GridCacheExplicitLockSpan.markOwned(GridCacheExplicitLockSpan.java:196)
>   at 
> 

[jira] [Comment Edited] (IGNITE-1905) High contention for CacheLockImpl causes AssertionErrors

2018-08-08 Thread Vitaliy Biryukov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-1905?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16573115#comment-16573115
 ] 

Vitaliy Biryukov edited comment on IGNITE-1905 at 8/8/18 12:08 PM:
---

Hi, [~Jokser] please take a look.
 2 AssertionErrors was fixed:
 # "java.lang.AssertionError: Entry does not own lock for tx" by adding
 "ctx.mvcc().addRemoved(ctx, ver);" to 
GridDhtTransactionalCacheAdapter#removeLocks
 # "at 
org.apache.ignite.internal.processors.cache.GridCacheExplicitLockSpan.markOwned(GridCacheExplicitLockSpan.java:196)"
 by synchronizing a race between lock timeout and lock response handlng


was (Author: vitaliyb):
Hi, [~Jokser] please take a look.
2 AssertionErrors was fixed:
# "java.lang.AssertionError: Entry does not own lock for tx" by adding 
"ctx.mvcc().addRemoved(ctx, ver);" to 
GridDhtTransactionalCacheAdapter#removeLocks
# at 
org.apache.ignite.internal.processors.cache.GridCacheExplicitLockSpan.markOwned(GridCacheExplicitLockSpan.java:196)
 by synchronizing a race between lock timeout and lock response handlng 

> High contention for CacheLockImpl causes AssertionErrors
> 
>
> Key: IGNITE-1905
> URL: https://issues.apache.org/jira/browse/IGNITE-1905
> Project: Ignite
>  Issue Type: Bug
>  Components: cache
>Affects Versions: ignite-1.4
> Environment: Windows 7
>Reporter: Denis Magda
>Assignee: Vitaliy Biryukov
>Priority: Major
> Attachments: ClientTest.java, ServerTest.java
>
>
> When multiple threads, running on the same client node, compete for 
> CacheLockImpl this leads to AssertionErrors.
> Pseudo-code snippet, that is called from multiple threads and causes the 
> assertions, looks like this:
> {noformat}
> boolean locked = lock.tryLock(100, TimeUnit.MILLISECONDS);
> if (locked)
> lock.unlock();
> {noformat}
> Initially the issue was detected on ignite-1.4.
> In server's node logs the following assertion appears
> {noformat}
> java.lang.AssertionError
>   at 
> org.apache.ignite.internal.processors.cache.distributed.dht.GridDhtLockFuture.addOwned(GridDhtLockFuture.java:958)
>   at 
> org.apache.ignite.internal.processors.cache.distributed.dht.GridDhtLockFuture.map(GridDhtLockFuture.java:918)
>   at 
> org.apache.ignite.internal.processors.cache.distributed.dht.GridDhtLockFuture.onOwnerChanged(GridDhtLockFuture.java:663)
>   at 
> org.apache.ignite.internal.processors.cache.GridCacheMvccManager$2.onOwnerChanged(GridCacheMvccManager.java:155)
>   at 
> org.apache.ignite.internal.processors.cache.distributed.GridDistributedCacheEntry.checkOwnerChanged(GridDistributedCacheEntry.java:810)
>   at 
> org.apache.ignite.internal.processors.cache.distributed.GridDistributedCacheEntry.readyLock(GridDistributedCacheEntry.java:516)
>   at 
> org.apache.ignite.internal.processors.cache.distributed.dht.GridDhtLockFuture.readyLocks(GridDhtLockFuture.java:576)
>   at 
> org.apache.ignite.internal.processors.cache.distributed.dht.GridDhtLockFuture.map(GridDhtLockFuture.java:764)
>   at 
> org.apache.ignite.internal.processors.cache.distributed.dht.GridDhtTransactionalCacheAdapter.lockAllAsync(GridDhtTransactionalCacheAdapter.java:973)
>   at 
> org.apache.ignite.internal.processors.cache.distributed.dht.GridDhtTransactionalCacheAdapter.processNearLockRequest(GridDhtTransactionalCacheAdapter.java:557)
>   at 
> org.apache.ignite.internal.processors.cache.distributed.dht.GridDhtTransactionalCacheAdapter.access$000(GridDhtTransactionalCacheAdapter.java:88)
>   at 
> org.apache.ignite.internal.processors.cache.distributed.dht.GridDhtTransactionalCacheAdapter$2.apply(GridDhtTransactionalCacheAdapter.java:132)
>   at 
> org.apache.ignite.internal.processors.cache.distributed.dht.GridDhtTransactionalCacheAdapter$2.apply(GridDhtTransactionalCacheAdapter.java:130)
>   at 
> org.apache.ignite.internal.processors.cache.GridCacheIoManager.processMessage(GridCacheIoManager.java:580)
>   at 
> org.apache.ignite.internal.processors.cache.GridCacheIoManager.onMessage0(GridCacheIoManager.java:280)
>   at 
> org.apache.ignite.internal.processors.cache.GridCacheIoManager.handleMessage(GridCacheIoManager.java:198)
>   at 
> org.apache.ignite.internal.processors.cache.GridCacheIoManager.access$000(GridCacheIoManager.java:77)
>   at 
> org.apache.ignite.internal.processors.cache.GridCacheIoManager$1.onMessage(GridCacheIoManager.java:160)
>   at 
> org.apache.ignite.internal.managers.communication.GridIoManager.processRegularMessage0(GridIoManager.java:811)
>   at 
> org.apache.ignite.internal.managers.communication.GridIoManager.access$1500(GridIoManager.java:106)
>   at 
> 

[jira] [Assigned] (IGNITE-1905) High contention for CacheLockImpl causes AssertionErrors

2018-08-06 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-1905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov reassigned IGNITE-1905:


Assignee: Vitaliy Biryukov

> High contention for CacheLockImpl causes AssertionErrors
> 
>
> Key: IGNITE-1905
> URL: https://issues.apache.org/jira/browse/IGNITE-1905
> Project: Ignite
>  Issue Type: Bug
>  Components: cache
>Affects Versions: ignite-1.4
> Environment: Windows 7
>Reporter: Denis Magda
>Assignee: Vitaliy Biryukov
>Priority: Major
> Attachments: ClientTest.java, ServerTest.java
>
>
> When multiple threads, running on the same client node, compete for 
> CacheLockImpl this leads to AssertionErrors.
> Pseudo-code snippet, that is called from multiple threads and causes the 
> assertions, looks like this:
> {noformat}
> boolean locked = lock.tryLock(100, TimeUnit.MILLISECONDS);
> if (locked)
> lock.unlock();
> {noformat}
> Initially the issue was detected on ignite-1.4.
> In server's node logs the following assertion appears
> {noformat}
> java.lang.AssertionError
>   at 
> org.apache.ignite.internal.processors.cache.distributed.dht.GridDhtLockFuture.addOwned(GridDhtLockFuture.java:958)
>   at 
> org.apache.ignite.internal.processors.cache.distributed.dht.GridDhtLockFuture.map(GridDhtLockFuture.java:918)
>   at 
> org.apache.ignite.internal.processors.cache.distributed.dht.GridDhtLockFuture.onOwnerChanged(GridDhtLockFuture.java:663)
>   at 
> org.apache.ignite.internal.processors.cache.GridCacheMvccManager$2.onOwnerChanged(GridCacheMvccManager.java:155)
>   at 
> org.apache.ignite.internal.processors.cache.distributed.GridDistributedCacheEntry.checkOwnerChanged(GridDistributedCacheEntry.java:810)
>   at 
> org.apache.ignite.internal.processors.cache.distributed.GridDistributedCacheEntry.readyLock(GridDistributedCacheEntry.java:516)
>   at 
> org.apache.ignite.internal.processors.cache.distributed.dht.GridDhtLockFuture.readyLocks(GridDhtLockFuture.java:576)
>   at 
> org.apache.ignite.internal.processors.cache.distributed.dht.GridDhtLockFuture.map(GridDhtLockFuture.java:764)
>   at 
> org.apache.ignite.internal.processors.cache.distributed.dht.GridDhtTransactionalCacheAdapter.lockAllAsync(GridDhtTransactionalCacheAdapter.java:973)
>   at 
> org.apache.ignite.internal.processors.cache.distributed.dht.GridDhtTransactionalCacheAdapter.processNearLockRequest(GridDhtTransactionalCacheAdapter.java:557)
>   at 
> org.apache.ignite.internal.processors.cache.distributed.dht.GridDhtTransactionalCacheAdapter.access$000(GridDhtTransactionalCacheAdapter.java:88)
>   at 
> org.apache.ignite.internal.processors.cache.distributed.dht.GridDhtTransactionalCacheAdapter$2.apply(GridDhtTransactionalCacheAdapter.java:132)
>   at 
> org.apache.ignite.internal.processors.cache.distributed.dht.GridDhtTransactionalCacheAdapter$2.apply(GridDhtTransactionalCacheAdapter.java:130)
>   at 
> org.apache.ignite.internal.processors.cache.GridCacheIoManager.processMessage(GridCacheIoManager.java:580)
>   at 
> org.apache.ignite.internal.processors.cache.GridCacheIoManager.onMessage0(GridCacheIoManager.java:280)
>   at 
> org.apache.ignite.internal.processors.cache.GridCacheIoManager.handleMessage(GridCacheIoManager.java:198)
>   at 
> org.apache.ignite.internal.processors.cache.GridCacheIoManager.access$000(GridCacheIoManager.java:77)
>   at 
> org.apache.ignite.internal.processors.cache.GridCacheIoManager$1.onMessage(GridCacheIoManager.java:160)
>   at 
> org.apache.ignite.internal.managers.communication.GridIoManager.processRegularMessage0(GridIoManager.java:811)
>   at 
> org.apache.ignite.internal.managers.communication.GridIoManager.access$1500(GridIoManager.java:106)
>   at 
> org.apache.ignite.internal.managers.communication.GridIoManager$5.run(GridIoManager.java:774)
>   at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>   at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>   at java.lang.Thread.run(Thread.java:745)
> {noformat}
>   
> In addition from time to time a client node also outputs a different kind of 
> assertion:
> {noformat}
> Exception in thread "ignite-#4%sys-null%" java.lang.AssertionError
>   at 
> org.apache.ignite.internal.processors.cache.GridCacheExplicitLockSpan.markOwned(GridCacheExplicitLockSpan.java:196)
>   at 
> org.apache.ignite.internal.processors.cache.GridCacheMvccManager.markExplicitOwner(GridCacheMvccManager.java:862)
>   at 
> org.apache.ignite.internal.processors.cache.distributed.dht.colocated.GridDhtColocatedLockFuture$MiniFuture.onResult(GridDhtColocatedLockFuture.java:1412)
>   at 
> 

[jira] [Commented] (IGNITE-9159) Basic 2 TC and Cache 2 configurations are halted by failure handler

2018-08-03 Thread Vitaliy Biryukov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-9159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568147#comment-16568147
 ] 

Vitaliy Biryukov commented on IGNITE-9159:
--

Looks good to me.

> Basic 2 TC and Cache 2 configurations are halted by failure handler
> ---
>
> Key: IGNITE-9159
> URL: https://issues.apache.org/jira/browse/IGNITE-9159
> Project: Ignite
>  Issue Type: Bug
>Reporter: Eduard Shangareev
>Assignee: Eduard Shangareev
>Priority: Major
> Fix For: 2.7
>
>
> https://ci.ignite.apache.org/viewLog.html?tab=buildLog=tree=debug=all=1569350&_focus=7955
> https://ci.ignite.apache.org/viewLog.html?buildId=1575803=buildResultsDiv=IgniteTests24Java8_Cache2
> {code}
> [01:19:21][org.apache.ignite:ignite-core] [2018-07-13 
> 22:19:21,713][ERROR][exchange-worker-#3043%messaging.IgniteMessagingConfigVariationFullApiTest3%][IgniteTestResources]
>  JVM will be halted immediately due to the failure: 
> [failureCtx=FailureContext [type=SYSTEM_WORKER_TERMINATION, err=class 
> o.a.i.IgniteCheckedException: Failed to send message (node may have left the 
> grid or TCP connection cannot be established due to firewall issues) 
> [node=TcpDiscoveryNode [id=b8f7b201-ab85-4a14-bcf0-2b397532, 
> addrs=[127.0.0.1], sockAddrs=[/127.0.0.1:47502], discPort=47502, order=3, 
> intOrder=3, lastExchangeTime=1531520358330, loc=false, 
> ver=2.7.0#20180713-sha1:6a8a2ff8, isClient=false], topic=TOPIC_CACHE, 
> msg=GridDhtPartitionsSingleMessage [parts=null, partCntrs=null, 
> partsSizes=null, partHistCntrs=null, err=null, client=true, compress=true, 
> finishMsg=null, super=GridDhtPartitionsAbstractMessage 
> [exchId=GridDhtPartitionExchangeId [topVer=AffinityTopologyVersion 
> [topVer=10, minorTopVer=0], discoEvt=DiscoveryEvent [evtNode=TcpDiscoveryNode 
> [id=0fe68fcc-eca6-4f40-9f16-c6181fa0, addrs=[127.0.0.1], 
> sockAddrs=[/127.0.0.1:47500], discPort=47500, order=1, intOrder=1, 
> lastExchangeTime=1531520358330, loc=false, ver=2.7.0#20180713-sha1:6a8a2ff8, 
> isClient=false], topVer=10, nodeId8=7b265a8e, msg=Node left: TcpDiscoveryNode 
> [id=0fe68fcc-eca6-4f40-9f16-c6181fa0, addrs=[127.0.0.1], 
> sockAddrs=[/127.0.0.1:47500], discPort=47500, order=1, intOrder=1, 
> lastExchangeTime=1531520358330, loc=false, ver=2.7.0#20180713-sha1:6a8a2ff8, 
> isClient=false], type=NODE_LEFT, tstamp=1531520361620], nodeId=0fe68fcc, 
> evt=NODE_LEFT], lastVer=GridCacheVersion [topVer=0, order=1531520357089, 
> nodeOrder=0], super=GridCacheMessage [msgId=238, depInfo=null, err=null, 
> skipPrepare=false]]], policy=2]]]
> [01:19:22][org.apache.ignite:ignite-core] Process exited with code 130
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (IGNITE-9134) ZookeeperDiscoverySpiTest#testLargeUserAttribute3 fails with OOME

2018-08-03 Thread Vitaliy Biryukov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-9134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568026#comment-16568026
 ] 

Vitaliy Biryukov commented on IGNITE-9134:
--

Hi [~dpavlov], please take a look.

> ZookeeperDiscoverySpiTest#testLargeUserAttribute3 fails with OOME
> -
>
> Key: IGNITE-9134
> URL: https://issues.apache.org/jira/browse/IGNITE-9134
> Project: Ignite
>  Issue Type: Bug
>Affects Versions: 2.6
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
> Fix For: 2.7
>
>
> {noformat}
> java.lang.OutOfMemoryError: Java heap space
>   at java.lang.reflect.Array.newInstance(Array.java:75)
>   at java.io.ObjectInputStream.readArray(ObjectInputStream.java:1671)
>   at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1345)
>   at java.io.ObjectInputStream.readObject(ObjectInputStream.java:371)
>   at 
> org.apache.ignite.internal.util.IgniteUtils.readMap(IgniteUtils.java:5179)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperClusterNode.readExternal(ZookeeperClusterNode.java:365)
>   at 
> java.io.ObjectInputStream.readExternalData(ObjectInputStream.java:1842)
>   at 
> java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1799)
>   at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1351)
>   at 
> java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:2000)
>   at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1924)
>   at 
> java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1801)
>   at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1351)
>   at java.io.ObjectInputStream.readObject(ObjectInputStream.java:371)
>   at 
> org.apache.ignite.marshaller.jdk.JdkMarshaller.unmarshal0(JdkMarshaller.java:139)
>   at 
> org.apache.ignite.marshaller.AbstractNodeNameAwareMarshaller.unmarshal(AbstractNodeNameAwareMarshaller.java:94)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.unmarshalZip(ZookeeperDiscoveryImpl.java:3925)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.unmarshalJoinData(ZookeeperDiscoveryImpl.java:1824)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.processBulkJoin(ZookeeperDiscoveryImpl.java:2779)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.processNewEvents(ZookeeperDiscoveryImpl.java:2628)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.processNewEvents(ZookeeperDiscoveryImpl.java:2603)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.access$2000(ZookeeperDiscoveryImpl.java:108)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl$ZkWatcher.processResult(ZookeeperDiscoveryImpl.java:4113)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperClient$DataCallbackWrapper.processResult(ZookeeperClient.java:1219)
>   at 
> org.apache.zookeeper.ClientCnxn$EventThread.processEvent(ClientCnxn.java:561)
>   at org.apache.zookeeper.ClientCnxn$EventThread.run(ClientCnxn.java:498)
> {noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (IGNITE-9169) Cache (Deadlock Detection) suite hangs

2018-08-02 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-9169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov updated IGNITE-9169:
-
Attachment: threads_report.txt

> Cache (Deadlock Detection) suite hangs
> --
>
> Key: IGNITE-9169
> URL: https://issues.apache.org/jira/browse/IGNITE-9169
> Project: Ignite
>  Issue Type: Bug
>Affects Versions: 2.6
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
> Fix For: 2.7
>
> Attachments: threads_report.txt
>
>
> {noformat}
> "main" #1 prio=5 os_prio=0 tid=0x7f289800e000 nid=0x67bb in Object.wait() 
> [0x7f289f9dc000]
>java.lang.Thread.State: WAITING (on object monitor)
>   at java.lang.Object.wait(Native Method)
>   at java.lang.Thread.join(Thread.java:1252)
>   - locked <0x849c82b0> (a 
> org.apache.ignite.testframework.GridTestSafeThreadFactory$1)
>   at 
> org.apache.ignite.internal.util.IgniteUtils.join(IgniteUtils.java:4621)
>   at 
> org.apache.ignite.internal.util.IgniteUtils.join(IgniteUtils.java:4606)
>   at 
> org.apache.ignite.internal.util.IgniteUtils.joinThreads(IgniteUtils.java:4650)
>   at 
> org.apache.ignite.testframework.GridTestSafeThreadFactory.stopAllThreads(GridTestSafeThreadFactory.java:206)
>   at 
> org.apache.ignite.testframework.GridTestUtils.stopThreads(GridTestUtils.java:985)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest.tearDown(GridAbstractTest.java:1705)
>   at 
> org.apache.ignite.testframework.junits.common.GridCommonAbstractTest.tearDown(GridCommonAbstractTest.java:497)
>   at junit.framework.TestCase.runBare(TestCase.java:146)
>   at junit.framework.TestResult$1.protect(TestResult.java:122)
>   at junit.framework.TestResult.runProtected(TestResult.java:142)
>   at junit.framework.TestResult.run(TestResult.java:125)
>   at junit.framework.TestCase.run(TestCase.java:129)
>   at junit.framework.TestSuite.runTest(TestSuite.java:255)
>   at junit.framework.TestSuite.run(TestSuite.java:250)
>   at junit.framework.TestSuite.runTest(TestSuite.java:255)
>   at junit.framework.TestSuite.run(TestSuite.java:250)
>   at 
> org.junit.internal.runners.JUnit38ClassRunner.run(JUnit38ClassRunner.java:84)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:369)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.executeWithRerun(JUnit4Provider.java:275)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:239)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:160)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.lang.reflect.Method.invoke(Method.java:498)
>   at 
> org.apache.maven.surefire.util.ReflectionUtils.invokeMethodWithArray2(ReflectionUtils.java:206)
>   at 
> org.apache.maven.surefire.booter.ProviderFactory$ProviderProxy.invoke(ProviderFactory.java:160)
>   at 
> org.apache.maven.surefire.booter.ProviderFactory.invokeProvider(ProviderFactory.java:83)
>   at 
> org.apache.maven.plugin.surefire.InPluginVMSurefireStarter.runSuitesInProcess(InPluginVMSurefireStarter.java:84)
>   at 
> org.apache.maven.plugin.surefire.AbstractSurefireMojo.executeProvider(AbstractSurefireMojo.java:1107)
>   at 
> org.apache.maven.plugin.surefire.AbstractSurefireMojo.executeAfterPreconditionsChecked(AbstractSurefireMojo.java:954)
>   at 
> org.apache.maven.plugin.surefire.AbstractSurefireMojo.execute(AbstractSurefireMojo.java:832)
>   at 
> org.apache.maven.plugin.DefaultBuildPluginManager.executeMojo(DefaultBuildPluginManager.java:137)
>   at 
> org.apache.maven.lifecycle.internal.MojoExecutor.execute(MojoExecutor.java:208)
>   at 
> org.apache.maven.lifecycle.internal.MojoExecutor.execute(MojoExecutor.java:154)
>   at 
> org.apache.maven.lifecycle.internal.MojoExecutor.execute(MojoExecutor.java:146)
>   at 
> org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject(LifecycleModuleBuilder.java:117)
>   at 
> org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject(LifecycleModuleBuilder.java:81)
>   at 
> org.apache.maven.lifecycle.internal.builder.singlethreaded.SingleThreadedBuilder.build(SingleThreadedBuilder.java:56)
>   at 
> org.apache.maven.lifecycle.internal.LifecycleStarter.execute(LifecycleStarter.java:128)
>   at org.apache.maven.DefaultMaven.doExecute(DefaultMaven.java:305)
>   at 

[jira] [Updated] (IGNITE-9169) Cache (Deadlock Detection) suite hangs

2018-08-02 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-9169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov updated IGNITE-9169:
-
Description: 
{noformat}
"main" #1 prio=5 os_prio=0 tid=0x7f289800e000 nid=0x67bb in Object.wait() 
[0x7f289f9dc000]
   java.lang.Thread.State: WAITING (on object monitor)
at java.lang.Object.wait(Native Method)
at java.lang.Thread.join(Thread.java:1252)
- locked <0x849c82b0> (a 
org.apache.ignite.testframework.GridTestSafeThreadFactory$1)
at 
org.apache.ignite.internal.util.IgniteUtils.join(IgniteUtils.java:4621)
at 
org.apache.ignite.internal.util.IgniteUtils.join(IgniteUtils.java:4606)
at 
org.apache.ignite.internal.util.IgniteUtils.joinThreads(IgniteUtils.java:4650)
at 
org.apache.ignite.testframework.GridTestSafeThreadFactory.stopAllThreads(GridTestSafeThreadFactory.java:206)
at 
org.apache.ignite.testframework.GridTestUtils.stopThreads(GridTestUtils.java:985)
at 
org.apache.ignite.testframework.junits.GridAbstractTest.tearDown(GridAbstractTest.java:1705)
at 
org.apache.ignite.testframework.junits.common.GridCommonAbstractTest.tearDown(GridCommonAbstractTest.java:497)
at junit.framework.TestCase.runBare(TestCase.java:146)
at junit.framework.TestResult$1.protect(TestResult.java:122)
at junit.framework.TestResult.runProtected(TestResult.java:142)
at junit.framework.TestResult.run(TestResult.java:125)
at junit.framework.TestCase.run(TestCase.java:129)
at junit.framework.TestSuite.runTest(TestSuite.java:255)
at junit.framework.TestSuite.run(TestSuite.java:250)
at junit.framework.TestSuite.runTest(TestSuite.java:255)
at junit.framework.TestSuite.run(TestSuite.java:250)
at 
org.junit.internal.runners.JUnit38ClassRunner.run(JUnit38ClassRunner.java:84)
at 
org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:369)
at 
org.apache.maven.surefire.junit4.JUnit4Provider.executeWithRerun(JUnit4Provider.java:275)
at 
org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:239)
at 
org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:160)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:498)
at 
org.apache.maven.surefire.util.ReflectionUtils.invokeMethodWithArray2(ReflectionUtils.java:206)
at 
org.apache.maven.surefire.booter.ProviderFactory$ProviderProxy.invoke(ProviderFactory.java:160)
at 
org.apache.maven.surefire.booter.ProviderFactory.invokeProvider(ProviderFactory.java:83)
at 
org.apache.maven.plugin.surefire.InPluginVMSurefireStarter.runSuitesInProcess(InPluginVMSurefireStarter.java:84)
at 
org.apache.maven.plugin.surefire.AbstractSurefireMojo.executeProvider(AbstractSurefireMojo.java:1107)
at 
org.apache.maven.plugin.surefire.AbstractSurefireMojo.executeAfterPreconditionsChecked(AbstractSurefireMojo.java:954)
at 
org.apache.maven.plugin.surefire.AbstractSurefireMojo.execute(AbstractSurefireMojo.java:832)
at 
org.apache.maven.plugin.DefaultBuildPluginManager.executeMojo(DefaultBuildPluginManager.java:137)
at 
org.apache.maven.lifecycle.internal.MojoExecutor.execute(MojoExecutor.java:208)
at 
org.apache.maven.lifecycle.internal.MojoExecutor.execute(MojoExecutor.java:154)
at 
org.apache.maven.lifecycle.internal.MojoExecutor.execute(MojoExecutor.java:146)
at 
org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject(LifecycleModuleBuilder.java:117)
at 
org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject(LifecycleModuleBuilder.java:81)
at 
org.apache.maven.lifecycle.internal.builder.singlethreaded.SingleThreadedBuilder.build(SingleThreadedBuilder.java:56)
at 
org.apache.maven.lifecycle.internal.LifecycleStarter.execute(LifecycleStarter.java:128)
at org.apache.maven.DefaultMaven.doExecute(DefaultMaven.java:305)
at org.apache.maven.DefaultMaven.doExecute(DefaultMaven.java:192)
at org.apache.maven.DefaultMaven.execute(DefaultMaven.java:105)
at org.apache.maven.cli.MavenCli.execute(MavenCli.java:956)
at org.apache.maven.cli.MavenCli.doMain(MavenCli.java:290)
at org.apache.maven.cli.MavenCli.main(MavenCli.java:194)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at 

[jira] [Updated] (IGNITE-9169) Cache (Deadlock Detection) suite hangs

2018-08-02 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-9169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov updated IGNITE-9169:
-
Ignite Flags:   (was: Docs Required)

> Cache (Deadlock Detection) suite hangs
> --
>
> Key: IGNITE-9169
> URL: https://issues.apache.org/jira/browse/IGNITE-9169
> Project: Ignite
>  Issue Type: Bug
>Affects Versions: 2.6
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
> Fix For: 2.7
>
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (IGNITE-9169) Cache (Deadlock Detection) suite hangs

2018-08-02 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-9169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov updated IGNITE-9169:
-
Summary: Cache (Deadlock Detection) suite hangs  (was: Cache (Deadlock 
Detection) hangs)

> Cache (Deadlock Detection) suite hangs
> --
>
> Key: IGNITE-9169
> URL: https://issues.apache.org/jira/browse/IGNITE-9169
> Project: Ignite
>  Issue Type: Bug
>Affects Versions: 2.6
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
> Fix For: 2.7
>
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (IGNITE-9169) Cache (Deadlock Detection) hangs

2018-08-02 Thread Vitaliy Biryukov (JIRA)
Vitaliy Biryukov created IGNITE-9169:


 Summary: Cache (Deadlock Detection) hangs
 Key: IGNITE-9169
 URL: https://issues.apache.org/jira/browse/IGNITE-9169
 Project: Ignite
  Issue Type: Bug
Affects Versions: 2.6
Reporter: Vitaliy Biryukov
Assignee: Vitaliy Biryukov
 Fix For: 2.7






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (IGNITE-9138) ZookeeperDiscoverySpiTest#checkInternalStructuresCleanup fails.

2018-08-02 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-9138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov updated IGNITE-9138:
-
Summary: ZookeeperDiscoverySpiTest#checkInternalStructuresCleanup fails.  
(was: ZookeeperDiscoverySpiTest#checkInternalStructuresCleanup fails if zk 
cluster was stpped before nodes.)

> ZookeeperDiscoverySpiTest#checkInternalStructuresCleanup fails.
> ---
>
> Key: IGNITE-9138
> URL: https://issues.apache.org/jira/browse/IGNITE-9138
> Project: Ignite
>  Issue Type: Bug
>Affects Versions: 2.6
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
> Fix For: 2.7
>
>
>  
> {noformat}
> junit.framework.AssertionFailedError: Expected:  but was: 
> ZkCommunicationErrorProcessFuture 
> [impl=org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl@180b3819,
>  endTime=1532545453881, id=9e083d2d461-645a2360-f5bb-43d3-8327-83d0a4a00124, 
> state=WAIT_TIMEOUT, resolveTopVer=0, resErr=null, collectResFut=null]
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.checkInternalStructuresCleanup(ZookeeperDiscoverySpiTest.java:517)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.afterTest(ZookeeperDiscoverySpiTest.java:476)
> {noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (IGNITE-8179) ZookeeperDiscoverySpiTest#testCommunicationFailureResolve_KillRandom always fails on TC

2018-08-01 Thread Vitaliy Biryukov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-8179?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16565216#comment-16565216
 ] 

Vitaliy Biryukov commented on IGNITE-8179:
--

[~Jokser], Done. Please take a look.

> ZookeeperDiscoverySpiTest#testCommunicationFailureResolve_KillRandom always 
> fails on TC
> ---
>
> Key: IGNITE-8179
> URL: https://issues.apache.org/jira/browse/IGNITE-8179
> Project: Ignite
>  Issue Type: Bug
>  Components: zookeeper
>Reporter: Sergey Chugunov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
>
> Test fails on TC with the following stack trace:
> {noformat}
> class org.apache.ignite.IgniteCheckedException: Failed to start manager: 
> GridManagerAdapter [enabled=true, 
> name=org.apache.ignite.internal.managers.discovery.GridDiscoveryManager]
> at 
> org.apache.ignite.internal.IgniteKernal.startManager(IgniteKernal.java:1698)
> at org.apache.ignite.internal.IgniteKernal.start(IgniteKernal.java:1007)
> at 
> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.start0(IgnitionEx.java:1977)
> at 
> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.start(IgnitionEx.java:1720)
> at org.apache.ignite.internal.IgnitionEx.start0(IgnitionEx.java:1148)
> at org.apache.ignite.internal.IgnitionEx.start(IgnitionEx.java:646)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGrid(GridAbstractTest.java:882)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGrid(GridAbstractTest.java:845)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGrid(GridAbstractTest.java:833)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGrid(GridAbstractTest.java:799)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGrids(GridAbstractTest.java:683)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGridsMultiThreaded(GridAbstractTest.java:710)
> at 
> org.apache.ignite.testframework.junits.common.GridCommonAbstractTest.startGridsMultiThreaded(GridCommonAbstractTest.java:507)
> at 
> org.apache.ignite.testframework.junits.common.GridCommonAbstractTest.startGridsMultiThreaded(GridCommonAbstractTest.java:497)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.testCommunicationFailureResolve_KillRandom(ZookeeperDiscoverySpiTest.java:2742)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
> at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> at java.lang.reflect.Method.invoke(Method.java:498)
> at junit.framework.TestCase.runTest(TestCase.java:176)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.runTestInternal(GridAbstractTest.java:2080)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.access$000(GridAbstractTest.java:140)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest$5.run(GridAbstractTest.java:1995)
> at java.lang.Thread.run(Thread.java:748)
> Caused by: class org.apache.ignite.IgniteCheckedException: Failed to start 
> SPI: ZookeeperDiscoverySpi [zkRootPath=/apacheIgnite, 
> zkConnectionString=127.0.0.1:40921,127.0.0.1:35014,127.0.0.1:38754, 
> joinTimeout=0, sesTimeout=2000, clientReconnectDisabled=false, 
> internalLsnr=null]
> at 
> org.apache.ignite.internal.managers.GridManagerAdapter.startSpi(GridManagerAdapter.java:300)
> at 
> org.apache.ignite.internal.managers.discovery.GridDiscoveryManager.start(GridDiscoveryManager.java:905)
> at 
> org.apache.ignite.internal.IgniteKernal.startManager(IgniteKernal.java:1693)
> ... 23 more
> Caused by: class org.apache.ignite.spi.IgniteSpiException: Failed to 
> initialize Zookeeper nodes
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.initZkNodes(ZookeeperDiscoveryImpl.java:827)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.startJoin(ZookeeperDiscoveryImpl.java:957)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.joinTopology(ZookeeperDiscoveryImpl.java:775)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.startJoinAndWait(ZookeeperDiscoveryImpl.java:693)
> at 
> org.apache.ignite.spi.discovery.zk.ZookeeperDiscoverySpi.spiStart(ZookeeperDiscoverySpi.java:471)
> at 
> org.apache.ignite.internal.managers.GridManagerAdapter.startSpi(GridManagerAdapter.java:297)
> ... 25 more
> Caused by: 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperClientFailedException: 
> 

[jira] [Updated] (IGNITE-9134) ZookeeperDiscoverySpiTest#testLargeUserAttribute3 fails with OOME

2018-07-31 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-9134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov updated IGNITE-9134:
-
Description: 
{noformat}
java.lang.OutOfMemoryError: Java heap space
at java.lang.reflect.Array.newInstance(Array.java:75)
at java.io.ObjectInputStream.readArray(ObjectInputStream.java:1671)
at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1345)
at java.io.ObjectInputStream.readObject(ObjectInputStream.java:371)
at 
org.apache.ignite.internal.util.IgniteUtils.readMap(IgniteUtils.java:5179)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperClusterNode.readExternal(ZookeeperClusterNode.java:365)
at 
java.io.ObjectInputStream.readExternalData(ObjectInputStream.java:1842)
at 
java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1799)
at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1351)
at 
java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:2000)
at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1924)
at 
java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1801)
at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1351)
at java.io.ObjectInputStream.readObject(ObjectInputStream.java:371)
at 
org.apache.ignite.marshaller.jdk.JdkMarshaller.unmarshal0(JdkMarshaller.java:139)
at 
org.apache.ignite.marshaller.AbstractNodeNameAwareMarshaller.unmarshal(AbstractNodeNameAwareMarshaller.java:94)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.unmarshalZip(ZookeeperDiscoveryImpl.java:3925)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.unmarshalJoinData(ZookeeperDiscoveryImpl.java:1824)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.processBulkJoin(ZookeeperDiscoveryImpl.java:2779)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.processNewEvents(ZookeeperDiscoveryImpl.java:2628)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.processNewEvents(ZookeeperDiscoveryImpl.java:2603)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.access$2000(ZookeeperDiscoveryImpl.java:108)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl$ZkWatcher.processResult(ZookeeperDiscoveryImpl.java:4113)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperClient$DataCallbackWrapper.processResult(ZookeeperClient.java:1219)
at 
org.apache.zookeeper.ClientCnxn$EventThread.processEvent(ClientCnxn.java:561)
at org.apache.zookeeper.ClientCnxn$EventThread.run(ClientCnxn.java:498)
{noformat}


  was:
java.lang.OutOfMemoryError: Java heap space
at java.lang.reflect.Array.newInstance(Array.java:75)
at java.io.ObjectInputStream.readArray(ObjectInputStream.java:1671)
at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1345)
at java.io.ObjectInputStream.readObject(ObjectInputStream.java:371)
at 
org.apache.ignite.internal.util.IgniteUtils.readMap(IgniteUtils.java:5179)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperClusterNode.readExternal(ZookeeperClusterNode.java:365)
at 
java.io.ObjectInputStream.readExternalData(ObjectInputStream.java:1842)
at 
java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1799)
at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1351)
at 
java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:2000)
at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1924)
at 
java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1801)
at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1351)
at java.io.ObjectInputStream.readObject(ObjectInputStream.java:371)
at 
org.apache.ignite.marshaller.jdk.JdkMarshaller.unmarshal0(JdkMarshaller.java:139)
at 
org.apache.ignite.marshaller.AbstractNodeNameAwareMarshaller.unmarshal(AbstractNodeNameAwareMarshaller.java:94)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.unmarshalZip(ZookeeperDiscoveryImpl.java:3925)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.unmarshalJoinData(ZookeeperDiscoveryImpl.java:1824)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.processBulkJoin(ZookeeperDiscoveryImpl.java:2779)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.processNewEvents(ZookeeperDiscoveryImpl.java:2628)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.processNewEvents(ZookeeperDiscoveryImpl.java:2603)
at 

[jira] [Updated] (IGNITE-9134) ZookeeperDiscoverySpiTest#testLargeUserAttribute3 fails with OOME

2018-07-31 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-9134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov updated IGNITE-9134:
-
Description: 
java.lang.OutOfMemoryError: Java heap space
at java.lang.reflect.Array.newInstance(Array.java:75)
at java.io.ObjectInputStream.readArray(ObjectInputStream.java:1671)
at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1345)
at java.io.ObjectInputStream.readObject(ObjectInputStream.java:371)
at 
org.apache.ignite.internal.util.IgniteUtils.readMap(IgniteUtils.java:5179)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperClusterNode.readExternal(ZookeeperClusterNode.java:365)
at 
java.io.ObjectInputStream.readExternalData(ObjectInputStream.java:1842)
at 
java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1799)
at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1351)
at 
java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:2000)
at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1924)
at 
java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1801)
at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1351)
at java.io.ObjectInputStream.readObject(ObjectInputStream.java:371)
at 
org.apache.ignite.marshaller.jdk.JdkMarshaller.unmarshal0(JdkMarshaller.java:139)
at 
org.apache.ignite.marshaller.AbstractNodeNameAwareMarshaller.unmarshal(AbstractNodeNameAwareMarshaller.java:94)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.unmarshalZip(ZookeeperDiscoveryImpl.java:3925)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.unmarshalJoinData(ZookeeperDiscoveryImpl.java:1824)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.processBulkJoin(ZookeeperDiscoveryImpl.java:2779)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.processNewEvents(ZookeeperDiscoveryImpl.java:2628)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.processNewEvents(ZookeeperDiscoveryImpl.java:2603)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.access$2000(ZookeeperDiscoveryImpl.java:108)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl$ZkWatcher.processResult(ZookeeperDiscoveryImpl.java:4113)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperClient$DataCallbackWrapper.processResult(ZookeeperClient.java:1219)
at 
org.apache.zookeeper.ClientCnxn$EventThread.processEvent(ClientCnxn.java:561)
at org.apache.zookeeper.ClientCnxn$EventThread.run(ClientCnxn.java:498)

> ZookeeperDiscoverySpiTest#testLargeUserAttribute3 fails with OOME
> -
>
> Key: IGNITE-9134
> URL: https://issues.apache.org/jira/browse/IGNITE-9134
> Project: Ignite
>  Issue Type: Bug
>Affects Versions: 2.6
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
> Fix For: 2.7
>
>
> java.lang.OutOfMemoryError: Java heap space
>   at java.lang.reflect.Array.newInstance(Array.java:75)
>   at java.io.ObjectInputStream.readArray(ObjectInputStream.java:1671)
>   at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1345)
>   at java.io.ObjectInputStream.readObject(ObjectInputStream.java:371)
>   at 
> org.apache.ignite.internal.util.IgniteUtils.readMap(IgniteUtils.java:5179)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperClusterNode.readExternal(ZookeeperClusterNode.java:365)
>   at 
> java.io.ObjectInputStream.readExternalData(ObjectInputStream.java:1842)
>   at 
> java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1799)
>   at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1351)
>   at 
> java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:2000)
>   at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1924)
>   at 
> java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1801)
>   at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1351)
>   at java.io.ObjectInputStream.readObject(ObjectInputStream.java:371)
>   at 
> org.apache.ignite.marshaller.jdk.JdkMarshaller.unmarshal0(JdkMarshaller.java:139)
>   at 
> org.apache.ignite.marshaller.AbstractNodeNameAwareMarshaller.unmarshal(AbstractNodeNameAwareMarshaller.java:94)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.unmarshalZip(ZookeeperDiscoveryImpl.java:3925)
>   at 
> 

[jira] [Commented] (IGNITE-9130) ZookeeperDiscoverySpiTest#testDuplicatedNodeId fails

2018-07-31 Thread Vitaliy Biryukov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-9130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16563697#comment-16563697
 ] 

Vitaliy Biryukov commented on IGNITE-9130:
--

[~dpavlov], Done.

> ZookeeperDiscoverySpiTest#testDuplicatedNodeId fails
> 
>
> Key: IGNITE-9130
> URL: https://issues.apache.org/jira/browse/IGNITE-9130
> Project: Ignite
>  Issue Type: Bug
>Affects Versions: 2.6
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
> Fix For: 2.7
>
>
> Fail:
> {noformat}
> junit.framework.AssertionFailedError
>   at junit.framework.Assert.fail(Assert.java:55)
>   at junit.framework.Assert.assertTrue(Assert.java:22)
>   at junit.framework.Assert.assertNotNull(Assert.java:256)
>   at junit.framework.Assert.assertNotNull(Assert.java:248)
>   at junit.framework.TestCase.assertNotNull(TestCase.java:417)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.testDuplicatedNodeId(ZookeeperDiscoverySpiTest.java:2565)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.lang.reflect.Method.invoke(Method.java:498)
>   at junit.framework.TestCase.runTest(TestCase.java:176)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest.runTestInternal(GridAbstractTest.java:2087)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest.access$000(GridAbstractTest.java:140)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest$5.run(GridAbstractTest.java:2002)
>   at java.lang.Thread.run(Thread.java:748)
> {noformat}
> A new error that throws instead of the old one:
> {noformat}
> Caused by: org.h2.jdbc.JdbcSQLException: Table "NODES" already exists; SQL 
> statement:
> CREATE TABLE NODES(ID UUID, CONSISTENT_ID VARCHAR, VERSION VARCHAR, IS_LOCAL 
> BOOLEAN, IS_CLIENT BOOLEAN, IS_DAEMON BOOLEAN, NODE_ORDER INTEGER, ADDRESSES 
> VARCHAR, HOSTNAMES VARCHAR) ENGINE 
> "org.apache.ignite.internal.processors.query.h2.sys.SqlSystemTableEngine" 
> [42101-195]
>   at org.h2.message.DbException.getJdbcSQLException(DbException.java:345)
>   at org.h2.message.DbException.get(DbException.java:179)
>   at org.h2.message.DbException.get(DbException.java:155)
>   at org.h2.command.ddl.CreateTable.update(CreateTable.java:117)
>   at org.h2.command.CommandContainer.update(CommandContainer.java:101)
>   at org.h2.command.Command.executeUpdate(Command.java:260)
>   at org.h2.jdbc.JdbcStatement.executeInternal(JdbcStatement.java:193)
>   at org.h2.jdbc.JdbcStatement.execute(JdbcStatement.java:165)
>   at 
> org.apache.ignite.internal.processors.query.h2.sys.SqlSystemTableEngine.registerView(SqlSystemTableEngine.java:48)
>   at 
> org.apache.ignite.internal.processors.query.h2.IgniteH2Indexing.start(IgniteH2Indexing.java:2608)
> {noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (IGNITE-9130) ZookeeperDiscoverySpiTest#testDuplicatedNodeId fails

2018-07-31 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-9130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov updated IGNITE-9130:
-
Description: 
Fail:
{noformat}
junit.framework.AssertionFailedError
at junit.framework.Assert.fail(Assert.java:55)
at junit.framework.Assert.assertTrue(Assert.java:22)
at junit.framework.Assert.assertNotNull(Assert.java:256)
at junit.framework.Assert.assertNotNull(Assert.java:248)
at junit.framework.TestCase.assertNotNull(TestCase.java:417)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.testDuplicatedNodeId(ZookeeperDiscoverySpiTest.java:2565)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:498)
at junit.framework.TestCase.runTest(TestCase.java:176)
at 
org.apache.ignite.testframework.junits.GridAbstractTest.runTestInternal(GridAbstractTest.java:2087)
at 
org.apache.ignite.testframework.junits.GridAbstractTest.access$000(GridAbstractTest.java:140)
at 
org.apache.ignite.testframework.junits.GridAbstractTest$5.run(GridAbstractTest.java:2002)
at java.lang.Thread.run(Thread.java:748)
{noformat}

A new error that throes instead of the old one:
{noformat}
Caused by: org.h2.jdbc.JdbcSQLException: Table "NODES" already exists; SQL 
statement:
CREATE TABLE NODES(ID UUID, CONSISTENT_ID VARCHAR, VERSION VARCHAR, IS_LOCAL 
BOOLEAN, IS_CLIENT BOOLEAN, IS_DAEMON BOOLEAN, NODE_ORDER INTEGER, ADDRESSES 
VARCHAR, HOSTNAMES VARCHAR) ENGINE 
"org.apache.ignite.internal.processors.query.h2.sys.SqlSystemTableEngine" 
[42101-195]
at org.h2.message.DbException.getJdbcSQLException(DbException.java:345)
at org.h2.message.DbException.get(DbException.java:179)
at org.h2.message.DbException.get(DbException.java:155)
at org.h2.command.ddl.CreateTable.update(CreateTable.java:117)
at org.h2.command.CommandContainer.update(CommandContainer.java:101)
at org.h2.command.Command.executeUpdate(Command.java:260)
at org.h2.jdbc.JdbcStatement.executeInternal(JdbcStatement.java:193)
at org.h2.jdbc.JdbcStatement.execute(JdbcStatement.java:165)
at 
org.apache.ignite.internal.processors.query.h2.sys.SqlSystemTableEngine.registerView(SqlSystemTableEngine.java:48)
at 
org.apache.ignite.internal.processors.query.h2.IgniteH2Indexing.start(IgniteH2Indexing.java:2608)
{noformat}


> ZookeeperDiscoverySpiTest#testDuplicatedNodeId fails
> 
>
> Key: IGNITE-9130
> URL: https://issues.apache.org/jira/browse/IGNITE-9130
> Project: Ignite
>  Issue Type: Bug
>Affects Versions: 2.6
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
> Fix For: 2.7
>
>
> Fail:
> {noformat}
> junit.framework.AssertionFailedError
>   at junit.framework.Assert.fail(Assert.java:55)
>   at junit.framework.Assert.assertTrue(Assert.java:22)
>   at junit.framework.Assert.assertNotNull(Assert.java:256)
>   at junit.framework.Assert.assertNotNull(Assert.java:248)
>   at junit.framework.TestCase.assertNotNull(TestCase.java:417)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.testDuplicatedNodeId(ZookeeperDiscoverySpiTest.java:2565)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.lang.reflect.Method.invoke(Method.java:498)
>   at junit.framework.TestCase.runTest(TestCase.java:176)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest.runTestInternal(GridAbstractTest.java:2087)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest.access$000(GridAbstractTest.java:140)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest$5.run(GridAbstractTest.java:2002)
>   at java.lang.Thread.run(Thread.java:748)
> {noformat}
> A new error that throes instead of the old one:
> {noformat}
> Caused by: org.h2.jdbc.JdbcSQLException: Table "NODES" already exists; SQL 
> statement:
> CREATE TABLE NODES(ID UUID, CONSISTENT_ID VARCHAR, VERSION VARCHAR, IS_LOCAL 
> BOOLEAN, IS_CLIENT BOOLEAN, IS_DAEMON BOOLEAN, NODE_ORDER INTEGER, ADDRESSES 
> VARCHAR, HOSTNAMES VARCHAR) ENGINE 
> "org.apache.ignite.internal.processors.query.h2.sys.SqlSystemTableEngine" 
> [42101-195]
>   at 

[jira] [Updated] (IGNITE-9130) ZookeeperDiscoverySpiTest#testDuplicatedNodeId fails

2018-07-31 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-9130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov updated IGNITE-9130:
-
Description: 
Fail:
{noformat}
junit.framework.AssertionFailedError
at junit.framework.Assert.fail(Assert.java:55)
at junit.framework.Assert.assertTrue(Assert.java:22)
at junit.framework.Assert.assertNotNull(Assert.java:256)
at junit.framework.Assert.assertNotNull(Assert.java:248)
at junit.framework.TestCase.assertNotNull(TestCase.java:417)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.testDuplicatedNodeId(ZookeeperDiscoverySpiTest.java:2565)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:498)
at junit.framework.TestCase.runTest(TestCase.java:176)
at 
org.apache.ignite.testframework.junits.GridAbstractTest.runTestInternal(GridAbstractTest.java:2087)
at 
org.apache.ignite.testframework.junits.GridAbstractTest.access$000(GridAbstractTest.java:140)
at 
org.apache.ignite.testframework.junits.GridAbstractTest$5.run(GridAbstractTest.java:2002)
at java.lang.Thread.run(Thread.java:748)
{noformat}

A new error that throws instead of the old one:
{noformat}
Caused by: org.h2.jdbc.JdbcSQLException: Table "NODES" already exists; SQL 
statement:
CREATE TABLE NODES(ID UUID, CONSISTENT_ID VARCHAR, VERSION VARCHAR, IS_LOCAL 
BOOLEAN, IS_CLIENT BOOLEAN, IS_DAEMON BOOLEAN, NODE_ORDER INTEGER, ADDRESSES 
VARCHAR, HOSTNAMES VARCHAR) ENGINE 
"org.apache.ignite.internal.processors.query.h2.sys.SqlSystemTableEngine" 
[42101-195]
at org.h2.message.DbException.getJdbcSQLException(DbException.java:345)
at org.h2.message.DbException.get(DbException.java:179)
at org.h2.message.DbException.get(DbException.java:155)
at org.h2.command.ddl.CreateTable.update(CreateTable.java:117)
at org.h2.command.CommandContainer.update(CommandContainer.java:101)
at org.h2.command.Command.executeUpdate(Command.java:260)
at org.h2.jdbc.JdbcStatement.executeInternal(JdbcStatement.java:193)
at org.h2.jdbc.JdbcStatement.execute(JdbcStatement.java:165)
at 
org.apache.ignite.internal.processors.query.h2.sys.SqlSystemTableEngine.registerView(SqlSystemTableEngine.java:48)
at 
org.apache.ignite.internal.processors.query.h2.IgniteH2Indexing.start(IgniteH2Indexing.java:2608)
{noformat}


  was:
Fail:
{noformat}
junit.framework.AssertionFailedError
at junit.framework.Assert.fail(Assert.java:55)
at junit.framework.Assert.assertTrue(Assert.java:22)
at junit.framework.Assert.assertNotNull(Assert.java:256)
at junit.framework.Assert.assertNotNull(Assert.java:248)
at junit.framework.TestCase.assertNotNull(TestCase.java:417)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.testDuplicatedNodeId(ZookeeperDiscoverySpiTest.java:2565)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:498)
at junit.framework.TestCase.runTest(TestCase.java:176)
at 
org.apache.ignite.testframework.junits.GridAbstractTest.runTestInternal(GridAbstractTest.java:2087)
at 
org.apache.ignite.testframework.junits.GridAbstractTest.access$000(GridAbstractTest.java:140)
at 
org.apache.ignite.testframework.junits.GridAbstractTest$5.run(GridAbstractTest.java:2002)
at java.lang.Thread.run(Thread.java:748)
{noformat}

A new error that throes instead of the old one:
{noformat}
Caused by: org.h2.jdbc.JdbcSQLException: Table "NODES" already exists; SQL 
statement:
CREATE TABLE NODES(ID UUID, CONSISTENT_ID VARCHAR, VERSION VARCHAR, IS_LOCAL 
BOOLEAN, IS_CLIENT BOOLEAN, IS_DAEMON BOOLEAN, NODE_ORDER INTEGER, ADDRESSES 
VARCHAR, HOSTNAMES VARCHAR) ENGINE 
"org.apache.ignite.internal.processors.query.h2.sys.SqlSystemTableEngine" 
[42101-195]
at org.h2.message.DbException.getJdbcSQLException(DbException.java:345)
at org.h2.message.DbException.get(DbException.java:179)
at org.h2.message.DbException.get(DbException.java:155)
at org.h2.command.ddl.CreateTable.update(CreateTable.java:117)
at org.h2.command.CommandContainer.update(CommandContainer.java:101)
at org.h2.command.Command.executeUpdate(Command.java:260)
at org.h2.jdbc.JdbcStatement.executeInternal(JdbcStatement.java:193)
at 

[jira] [Commented] (IGNITE-9130) ZookeeperDiscoverySpiTest#testDuplicatedNodeId fails

2018-07-31 Thread Vitaliy Biryukov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-9130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16563532#comment-16563532
 ] 

Vitaliy Biryukov commented on IGNITE-9130:
--

This PR  [https://github.com/apache/ignite/pull/4458] is right.

 

> ZookeeperDiscoverySpiTest#testDuplicatedNodeId fails
> 
>
> Key: IGNITE-9130
> URL: https://issues.apache.org/jira/browse/IGNITE-9130
> Project: Ignite
>  Issue Type: Bug
>Affects Versions: 2.6
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
> Fix For: 2.7
>
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (IGNITE-9138) ZookeeperDiscoverySpiTest#checkInternalStructuresCleanup fails if zk cluster was stpped before nodes.

2018-07-31 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-9138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov updated IGNITE-9138:
-
Description: 
 
{noformat}
junit.framework.AssertionFailedError: Expected:  but was: 
ZkCommunicationErrorProcessFuture 
[impl=org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl@180b3819,
 endTime=1532545453881, id=9e083d2d461-645a2360-f5bb-43d3-8327-83d0a4a00124, 
state=WAIT_TIMEOUT, resolveTopVer=0, resErr=null, collectResFut=null]
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.checkInternalStructuresCleanup(ZookeeperDiscoverySpiTest.java:517)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.afterTest(ZookeeperDiscoverySpiTest.java:476)
{noformat}

> ZookeeperDiscoverySpiTest#checkInternalStructuresCleanup fails if zk cluster 
> was stpped before nodes.
> -
>
> Key: IGNITE-9138
> URL: https://issues.apache.org/jira/browse/IGNITE-9138
> Project: Ignite
>  Issue Type: Bug
>Affects Versions: 2.6
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
> Fix For: 2.7
>
>
>  
> {noformat}
> junit.framework.AssertionFailedError: Expected:  but was: 
> ZkCommunicationErrorProcessFuture 
> [impl=org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl@180b3819,
>  endTime=1532545453881, id=9e083d2d461-645a2360-f5bb-43d3-8327-83d0a4a00124, 
> state=WAIT_TIMEOUT, resolveTopVer=0, resErr=null, collectResFut=null]
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.checkInternalStructuresCleanup(ZookeeperDiscoverySpiTest.java:517)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.afterTest(ZookeeperDiscoverySpiTest.java:476)
> {noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (IGNITE-9138) ZookeeperDiscoverySpiTest#checkInternalStructuresCleanup fails if zk cluster was stpped before nodes.

2018-07-31 Thread Vitaliy Biryukov (JIRA)
Vitaliy Biryukov created IGNITE-9138:


 Summary: ZookeeperDiscoverySpiTest#checkInternalStructuresCleanup 
fails if zk cluster was stpped before nodes.
 Key: IGNITE-9138
 URL: https://issues.apache.org/jira/browse/IGNITE-9138
 Project: Ignite
  Issue Type: Bug
Affects Versions: 2.6
Reporter: Vitaliy Biryukov
Assignee: Vitaliy Biryukov
 Fix For: 2.7






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (IGNITE-9134) ZookeeperDiscoverySpiTest#testLargeUserAttribute3 fails with OOME

2018-07-30 Thread Vitaliy Biryukov (JIRA)
Vitaliy Biryukov created IGNITE-9134:


 Summary: ZookeeperDiscoverySpiTest#testLargeUserAttribute3 fails 
with OOME
 Key: IGNITE-9134
 URL: https://issues.apache.org/jira/browse/IGNITE-9134
 Project: Ignite
  Issue Type: Bug
Affects Versions: 2.6
Reporter: Vitaliy Biryukov
Assignee: Vitaliy Biryukov
 Fix For: 2.7






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (IGNITE-9130) ZookeeperDiscoverySpiTest#testDuplicatedNodeId fails

2018-07-30 Thread Vitaliy Biryukov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-9130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16562216#comment-16562216
 ] 

Vitaliy Biryukov commented on IGNITE-9130:
--

This test checks that if a node starts with a duplicate ID, a certain error 
throws at the start of discovery SPI.
After this commit[1], another error started to throw at the start of 
*IgniteH2Indexing*.
I changed the test so that it checks that an error has thrown in one of the 
processors or managers.

[1]https://github.com/apache/ignite/commit/6263dbe6c45a0656ba55a0eb04d07941fac27c7c

> ZookeeperDiscoverySpiTest#testDuplicatedNodeId fails
> 
>
> Key: IGNITE-9130
> URL: https://issues.apache.org/jira/browse/IGNITE-9130
> Project: Ignite
>  Issue Type: Bug
>Affects Versions: 2.6
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
> Fix For: 2.7
>
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (IGNITE-9130) ZookeeperDiscoverySpiTest#testDuplicatedNodeId fails

2018-07-30 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-9130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov updated IGNITE-9130:
-
Fix Version/s: 2.7

> ZookeeperDiscoverySpiTest#testDuplicatedNodeId fails
> 
>
> Key: IGNITE-9130
> URL: https://issues.apache.org/jira/browse/IGNITE-9130
> Project: Ignite
>  Issue Type: Bug
>Affects Versions: 2.6
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
> Fix For: 2.7
>
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (IGNITE-9130) ZookeeperDiscoverySpiTest#testDuplicatedNodeId fails

2018-07-30 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-9130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov updated IGNITE-9130:
-
Ignite Flags:   (was: Docs Required)

> ZookeeperDiscoverySpiTest#testDuplicatedNodeId fails
> 
>
> Key: IGNITE-9130
> URL: https://issues.apache.org/jira/browse/IGNITE-9130
> Project: Ignite
>  Issue Type: Bug
>Affects Versions: 2.6
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (IGNITE-9130) ZookeeperDiscoverySpiTest#testDuplicatedNodeId fails

2018-07-30 Thread Vitaliy Biryukov (JIRA)
Vitaliy Biryukov created IGNITE-9130:


 Summary: ZookeeperDiscoverySpiTest#testDuplicatedNodeId fails
 Key: IGNITE-9130
 URL: https://issues.apache.org/jira/browse/IGNITE-9130
 Project: Ignite
  Issue Type: Bug
Affects Versions: 2.6
Reporter: Vitaliy Biryukov
Assignee: Vitaliy Biryukov






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Comment Edited] (IGNITE-9127) ZooKeeper (Discovery) 1 suite hang.

2018-07-30 Thread Vitaliy Biryukov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-9127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16562073#comment-16562073
 ] 

Vitaliy Biryukov edited comment on IGNITE-9127 at 7/30/18 4:15 PM:
---

I've run this suit on TC about a 100 times.

It seems hangs are over.


was (Author: vitaliyb):
I've run this suit on TC about a 100 times.

it seems hangs are over.

> ZooKeeper (Discovery) 1 suite hang.
> ---
>
> Key: IGNITE-9127
> URL: https://issues.apache.org/jira/browse/IGNITE-9127
> Project: Ignite
>  Issue Type: Bug
>Affects Versions: 2.6
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
>
> Causes:
> 1)
> {noformat}
> [13:14:30]W:   [org.apache.ignite:ignite-zookeeper] [2018-07-28 
> 10:14:30,637][WARN ][Thread-3997][QuorumPeerMain] Unable to register log4j 
> JMX control
> [13:14:30]W:   [org.apache.ignite:ignite-zookeeper] 
> javax.management.InstanceAlreadyExistsException: log4j:hiearchy=default
> [13:14:30]W:   [org.apache.ignite:ignite-zookeeper]   at 
> com.sun.jmx.mbeanserver.Repository.addMBean(Repository.java:437)
> [13:14:30]W:   [org.apache.ignite:ignite-zookeeper]   at 
> com.sun.jmx.interceptor.DefaultMBeanServerInterceptor.registerWithRepository(DefaultMBeanServerInterceptor.java:1898)
> [13:14:30]W:   [org.apache.ignite:ignite-zookeeper]   at 
> com.sun.jmx.interceptor.DefaultMBeanServerInterceptor.registerDynamicMBean(DefaultMBeanServerInterceptor.java:966)
> [13:14:30]W:   [org.apache.ignite:ignite-zookeeper]   at 
> com.sun.jmx.interceptor.DefaultMBeanServerInterceptor.registerObject(DefaultMBeanServerInterceptor.java:900)
> [13:14:30]W:   [org.apache.ignite:ignite-zookeeper]   at 
> com.sun.jmx.interceptor.DefaultMBeanServerInterceptor.registerMBean(DefaultMBeanServerInterceptor.java:324)
> [13:14:30]W:   [org.apache.ignite:ignite-zookeeper]   at 
> com.sun.jmx.mbeanserver.JmxMBeanServer.registerMBean(JmxMBeanServer.java:522)
> [13:14:30]W:   [org.apache.ignite:ignite-zookeeper]   at 
> org.apache.zookeeper.jmx.ManagedUtil.registerLog4jMBeans(ManagedUtil.java:53)
> [13:14:30]W:   [org.apache.ignite:ignite-zookeeper]   at 
> org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:122)
> {noformat}
> 2)
> {noformat}
> "main" #1 prio=5 os_prio=0 tid=0x7fd28400d000 nid=0xbb7 waiting on 
> condition [0x7fd28c0a9000]
>java.lang.Thread.State: WAITING (parking)
>   at sun.misc.Unsafe.park(Native Method)
>   - parking to wait for  <0xc1a24e40> (a 
> java.util.concurrent.CountDownLatch$Sync)
>   at java.util.concurrent.locks.LockSupport.park(LockSupport.java:175)
>   at 
> java.util.concurrent.locks.AbstractQueuedSynchronizer.parkAndCheckInterrupt(AbstractQueuedSynchronizer.java:836)
>   at 
> java.util.concurrent.locks.AbstractQueuedSynchronizer.doAcquireSharedInterruptibly(AbstractQueuedSynchronizer.java:997)
>   at 
> java.util.concurrent.locks.AbstractQueuedSynchronizer.acquireSharedInterruptibly(AbstractQueuedSynchronizer.java:1304)
>   at java.util.concurrent.CountDownLatch.await(CountDownLatch.java:231)
>   at 
> org.apache.ignite.internal.util.IgniteUtils.awaitQuiet(IgniteUtils.java:7564)
>   at 
> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.grid(IgnitionEx.java:1666)
>   at org.apache.ignite.internal.IgnitionEx.allGrids(IgnitionEx.java:1284)
>   at org.apache.ignite.internal.IgnitionEx.allGrids(IgnitionEx.java:1262)
>   at org.apache.ignite.Ignition.allGrids(Ignition.java:502)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.checkInternalStructuresCleanup(ZookeeperDiscoverySpiTest.java:485)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.afterTest(ZookeeperDiscoverySpiTest.java:453)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest.tearDown(GridAbstractTest.java:1694)
>   at 
> org.apache.ignite.testframework.junits.common.GridCommonAbstractTest.tearDown(GridCommonAbstractTest.java:497)
>   at junit.framework.TestCase.runBare(TestCase.java:146)
>   at junit.framework.TestResult$1.protect(TestResult.java:122)
>   at junit.framework.TestResult.runProtected(TestResult.java:142)
>   at junit.framework.TestResult.run(TestResult.java:125)
>   at junit.framework.TestCase.run(TestCase.java:129)
>   at junit.framework.TestSuite.runTest(TestSuite.java:255)
>   at junit.framework.TestSuite.run(TestSuite.java:250)
>   at junit.framework.TestSuite.runTest(TestSuite.java:255)
>   at junit.framework.TestSuite.run(TestSuite.java:250)
>   at 
> 

[jira] [Commented] (IGNITE-9127) ZooKeeper (Discovery) 1 suite hang.

2018-07-30 Thread Vitaliy Biryukov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-9127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16562073#comment-16562073
 ] 

Vitaliy Biryukov commented on IGNITE-9127:
--

I've run this suit on TC about a 100 times.

it seems hangs are over.

> ZooKeeper (Discovery) 1 suite hang.
> ---
>
> Key: IGNITE-9127
> URL: https://issues.apache.org/jira/browse/IGNITE-9127
> Project: Ignite
>  Issue Type: Bug
>Affects Versions: 2.6
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
>
> Causes:
> 1)
> {noformat}
> [13:14:30]W:   [org.apache.ignite:ignite-zookeeper] [2018-07-28 
> 10:14:30,637][WARN ][Thread-3997][QuorumPeerMain] Unable to register log4j 
> JMX control
> [13:14:30]W:   [org.apache.ignite:ignite-zookeeper] 
> javax.management.InstanceAlreadyExistsException: log4j:hiearchy=default
> [13:14:30]W:   [org.apache.ignite:ignite-zookeeper]   at 
> com.sun.jmx.mbeanserver.Repository.addMBean(Repository.java:437)
> [13:14:30]W:   [org.apache.ignite:ignite-zookeeper]   at 
> com.sun.jmx.interceptor.DefaultMBeanServerInterceptor.registerWithRepository(DefaultMBeanServerInterceptor.java:1898)
> [13:14:30]W:   [org.apache.ignite:ignite-zookeeper]   at 
> com.sun.jmx.interceptor.DefaultMBeanServerInterceptor.registerDynamicMBean(DefaultMBeanServerInterceptor.java:966)
> [13:14:30]W:   [org.apache.ignite:ignite-zookeeper]   at 
> com.sun.jmx.interceptor.DefaultMBeanServerInterceptor.registerObject(DefaultMBeanServerInterceptor.java:900)
> [13:14:30]W:   [org.apache.ignite:ignite-zookeeper]   at 
> com.sun.jmx.interceptor.DefaultMBeanServerInterceptor.registerMBean(DefaultMBeanServerInterceptor.java:324)
> [13:14:30]W:   [org.apache.ignite:ignite-zookeeper]   at 
> com.sun.jmx.mbeanserver.JmxMBeanServer.registerMBean(JmxMBeanServer.java:522)
> [13:14:30]W:   [org.apache.ignite:ignite-zookeeper]   at 
> org.apache.zookeeper.jmx.ManagedUtil.registerLog4jMBeans(ManagedUtil.java:53)
> [13:14:30]W:   [org.apache.ignite:ignite-zookeeper]   at 
> org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:122)
> {noformat}
> 2)
> {noformat}
> "main" #1 prio=5 os_prio=0 tid=0x7fd28400d000 nid=0xbb7 waiting on 
> condition [0x7fd28c0a9000]
>java.lang.Thread.State: WAITING (parking)
>   at sun.misc.Unsafe.park(Native Method)
>   - parking to wait for  <0xc1a24e40> (a 
> java.util.concurrent.CountDownLatch$Sync)
>   at java.util.concurrent.locks.LockSupport.park(LockSupport.java:175)
>   at 
> java.util.concurrent.locks.AbstractQueuedSynchronizer.parkAndCheckInterrupt(AbstractQueuedSynchronizer.java:836)
>   at 
> java.util.concurrent.locks.AbstractQueuedSynchronizer.doAcquireSharedInterruptibly(AbstractQueuedSynchronizer.java:997)
>   at 
> java.util.concurrent.locks.AbstractQueuedSynchronizer.acquireSharedInterruptibly(AbstractQueuedSynchronizer.java:1304)
>   at java.util.concurrent.CountDownLatch.await(CountDownLatch.java:231)
>   at 
> org.apache.ignite.internal.util.IgniteUtils.awaitQuiet(IgniteUtils.java:7564)
>   at 
> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.grid(IgnitionEx.java:1666)
>   at org.apache.ignite.internal.IgnitionEx.allGrids(IgnitionEx.java:1284)
>   at org.apache.ignite.internal.IgnitionEx.allGrids(IgnitionEx.java:1262)
>   at org.apache.ignite.Ignition.allGrids(Ignition.java:502)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.checkInternalStructuresCleanup(ZookeeperDiscoverySpiTest.java:485)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.afterTest(ZookeeperDiscoverySpiTest.java:453)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest.tearDown(GridAbstractTest.java:1694)
>   at 
> org.apache.ignite.testframework.junits.common.GridCommonAbstractTest.tearDown(GridCommonAbstractTest.java:497)
>   at junit.framework.TestCase.runBare(TestCase.java:146)
>   at junit.framework.TestResult$1.protect(TestResult.java:122)
>   at junit.framework.TestResult.runProtected(TestResult.java:142)
>   at junit.framework.TestResult.run(TestResult.java:125)
>   at junit.framework.TestCase.run(TestCase.java:129)
>   at junit.framework.TestSuite.runTest(TestSuite.java:255)
>   at junit.framework.TestSuite.run(TestSuite.java:250)
>   at junit.framework.TestSuite.runTest(TestSuite.java:255)
>   at junit.framework.TestSuite.run(TestSuite.java:250)
>   at 
> org.junit.internal.runners.JUnit38ClassRunner.run(JUnit38ClassRunner.java:84)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:369)
>   at 
> 

[jira] [Updated] (IGNITE-9127) ZooKeeper (Discovery) 1 suite hang.

2018-07-30 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-9127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov updated IGNITE-9127:
-
Description: 
Causes:
1)
{noformat}
[13:14:30]W: [org.apache.ignite:ignite-zookeeper] [2018-07-28 
10:14:30,637][WARN ][Thread-3997][QuorumPeerMain] Unable to register log4j JMX 
control
[13:14:30]W: [org.apache.ignite:ignite-zookeeper] 
javax.management.InstanceAlreadyExistsException: log4j:hiearchy=default
[13:14:30]W: [org.apache.ignite:ignite-zookeeper]   at 
com.sun.jmx.mbeanserver.Repository.addMBean(Repository.java:437)
[13:14:30]W: [org.apache.ignite:ignite-zookeeper]   at 
com.sun.jmx.interceptor.DefaultMBeanServerInterceptor.registerWithRepository(DefaultMBeanServerInterceptor.java:1898)
[13:14:30]W: [org.apache.ignite:ignite-zookeeper]   at 
com.sun.jmx.interceptor.DefaultMBeanServerInterceptor.registerDynamicMBean(DefaultMBeanServerInterceptor.java:966)
[13:14:30]W: [org.apache.ignite:ignite-zookeeper]   at 
com.sun.jmx.interceptor.DefaultMBeanServerInterceptor.registerObject(DefaultMBeanServerInterceptor.java:900)
[13:14:30]W: [org.apache.ignite:ignite-zookeeper]   at 
com.sun.jmx.interceptor.DefaultMBeanServerInterceptor.registerMBean(DefaultMBeanServerInterceptor.java:324)
[13:14:30]W: [org.apache.ignite:ignite-zookeeper]   at 
com.sun.jmx.mbeanserver.JmxMBeanServer.registerMBean(JmxMBeanServer.java:522)
[13:14:30]W: [org.apache.ignite:ignite-zookeeper]   at 
org.apache.zookeeper.jmx.ManagedUtil.registerLog4jMBeans(ManagedUtil.java:53)
[13:14:30]W: [org.apache.ignite:ignite-zookeeper]   at 
org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:122)
{noformat}
2)
{noformat}
"main" #1 prio=5 os_prio=0 tid=0x7fd28400d000 nid=0xbb7 waiting on 
condition [0x7fd28c0a9000]
   java.lang.Thread.State: WAITING (parking)
at sun.misc.Unsafe.park(Native Method)
- parking to wait for  <0xc1a24e40> (a 
java.util.concurrent.CountDownLatch$Sync)
at java.util.concurrent.locks.LockSupport.park(LockSupport.java:175)
at 
java.util.concurrent.locks.AbstractQueuedSynchronizer.parkAndCheckInterrupt(AbstractQueuedSynchronizer.java:836)
at 
java.util.concurrent.locks.AbstractQueuedSynchronizer.doAcquireSharedInterruptibly(AbstractQueuedSynchronizer.java:997)
at 
java.util.concurrent.locks.AbstractQueuedSynchronizer.acquireSharedInterruptibly(AbstractQueuedSynchronizer.java:1304)
at java.util.concurrent.CountDownLatch.await(CountDownLatch.java:231)
at 
org.apache.ignite.internal.util.IgniteUtils.awaitQuiet(IgniteUtils.java:7564)
at 
org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.grid(IgnitionEx.java:1666)
at org.apache.ignite.internal.IgnitionEx.allGrids(IgnitionEx.java:1284)
at org.apache.ignite.internal.IgnitionEx.allGrids(IgnitionEx.java:1262)
at org.apache.ignite.Ignition.allGrids(Ignition.java:502)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.checkInternalStructuresCleanup(ZookeeperDiscoverySpiTest.java:485)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.afterTest(ZookeeperDiscoverySpiTest.java:453)
at 
org.apache.ignite.testframework.junits.GridAbstractTest.tearDown(GridAbstractTest.java:1694)
at 
org.apache.ignite.testframework.junits.common.GridCommonAbstractTest.tearDown(GridCommonAbstractTest.java:497)
at junit.framework.TestCase.runBare(TestCase.java:146)
at junit.framework.TestResult$1.protect(TestResult.java:122)
at junit.framework.TestResult.runProtected(TestResult.java:142)
at junit.framework.TestResult.run(TestResult.java:125)
at junit.framework.TestCase.run(TestCase.java:129)
at junit.framework.TestSuite.runTest(TestSuite.java:255)
at junit.framework.TestSuite.run(TestSuite.java:250)
at junit.framework.TestSuite.runTest(TestSuite.java:255)
at junit.framework.TestSuite.run(TestSuite.java:250)
at 
org.junit.internal.runners.JUnit38ClassRunner.run(JUnit38ClassRunner.java:84)
at 
org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:369)
at 
org.apache.maven.surefire.junit4.JUnit4Provider.executeWithRerun(JUnit4Provider.java:275)
at 
org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:239)
at 
org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:160)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at 

[jira] [Updated] (IGNITE-9127) ZooKeeper (Discovery) 1 suite hang.

2018-07-30 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-9127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov updated IGNITE-9127:
-
Affects Version/s: 2.7

> ZooKeeper (Discovery) 1 suite hang.
> ---
>
> Key: IGNITE-9127
> URL: https://issues.apache.org/jira/browse/IGNITE-9127
> Project: Ignite
>  Issue Type: Bug
>Affects Versions: 2.6
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (IGNITE-9127) ZooKeeper (Discovery) 1 suite hang.

2018-07-30 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-9127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov updated IGNITE-9127:
-
Affects Version/s: (was: 2.7)
   2.6

> ZooKeeper (Discovery) 1 suite hang.
> ---
>
> Key: IGNITE-9127
> URL: https://issues.apache.org/jira/browse/IGNITE-9127
> Project: Ignite
>  Issue Type: Bug
>Affects Versions: 2.6
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (IGNITE-9127) ZooKeeper (Discovery) 1 suite hang.

2018-07-30 Thread Vitaliy Biryukov (JIRA)
Vitaliy Biryukov created IGNITE-9127:


 Summary: ZooKeeper (Discovery) 1 suite hang.
 Key: IGNITE-9127
 URL: https://issues.apache.org/jira/browse/IGNITE-9127
 Project: Ignite
  Issue Type: Bug
Reporter: Vitaliy Biryukov
Assignee: Vitaliy Biryukov






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Comment Edited] (IGNITE-4380) Cache invoke calls can be lost

2018-07-27 Thread Vitaliy Biryukov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-4380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16559945#comment-16559945
 ] 

Vitaliy Biryukov edited comment on IGNITE-4380 at 7/27/18 4:40 PM:
---

[~Alexey Kuznetsov], LGTM.


was (Author: vitaliyb):
[~Alexey Kuznetsov], LGYM.

> Cache invoke calls can be lost
> --
>
> Key: IGNITE-4380
> URL: https://issues.apache.org/jira/browse/IGNITE-4380
> Project: Ignite
>  Issue Type: Bug
>  Components: cache
>Affects Versions: 2.0
>Reporter: Semen Boikov
>Assignee: Alexey Kuznetsov
>Priority: Critical
>  Labels: MakeTeamcityGreenAgain
>
> * Recently added test 
> GridCacheAbstractFullApiSelfTest.testInvokeAllMultithreaded fails on TC in 
> various configurations with transactional cache.
> Example of failure 
> GridCacheReplicatedOffHeapTieredMultiNodeFullApiSelfTest.testInvokeAllMultithreaded:
> {noformat}
> junit.framework.AssertionFailedError: expected:<2> but was:<10868>
> at junit.framework.Assert.fail(Assert.java:57)
> at junit.framework.Assert.failNotEquals(Assert.java:329)
> at junit.framework.Assert.assertEquals(Assert.java:78)
> at junit.framework.Assert.assertEquals(Assert.java:234)
> at junit.framework.Assert.assertEquals(Assert.java:241)
> at junit.framework.TestCase.assertEquals(TestCase.java:409)
> at 
> org.apache.ignite.internal.processors.cache.GridCacheAbstractFullApiSelfTest.testInvokeAllMultithreaded(GridCacheAbstractFullApiSelfTest.java:342)
> at sun.reflect.GeneratedMethodAccessor96.invoke(Unknown Source)
> at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> at java.lang.reflect.Method.invoke(Method.java:606)
> at junit.framework.TestCase.runTest(TestCase.java:176)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.runTestInternal(GridAbstractTest.java:1803)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.access$000(GridAbstractTest.java:118)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest$4.run(GridAbstractTest.java:1718)
> at java.lang.Thread.run(Thread.java:745)
> {noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (IGNITE-4380) Cache invoke calls can be lost

2018-07-27 Thread Vitaliy Biryukov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-4380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16559945#comment-16559945
 ] 

Vitaliy Biryukov commented on IGNITE-4380:
--

[~Alexey Kuznetsov], LGYM.

> Cache invoke calls can be lost
> --
>
> Key: IGNITE-4380
> URL: https://issues.apache.org/jira/browse/IGNITE-4380
> Project: Ignite
>  Issue Type: Bug
>  Components: cache
>Affects Versions: 2.0
>Reporter: Semen Boikov
>Assignee: Alexey Kuznetsov
>Priority: Critical
>  Labels: MakeTeamcityGreenAgain
>
> * Recently added test 
> GridCacheAbstractFullApiSelfTest.testInvokeAllMultithreaded fails on TC in 
> various configurations with transactional cache.
> Example of failure 
> GridCacheReplicatedOffHeapTieredMultiNodeFullApiSelfTest.testInvokeAllMultithreaded:
> {noformat}
> junit.framework.AssertionFailedError: expected:<2> but was:<10868>
> at junit.framework.Assert.fail(Assert.java:57)
> at junit.framework.Assert.failNotEquals(Assert.java:329)
> at junit.framework.Assert.assertEquals(Assert.java:78)
> at junit.framework.Assert.assertEquals(Assert.java:234)
> at junit.framework.Assert.assertEquals(Assert.java:241)
> at junit.framework.TestCase.assertEquals(TestCase.java:409)
> at 
> org.apache.ignite.internal.processors.cache.GridCacheAbstractFullApiSelfTest.testInvokeAllMultithreaded(GridCacheAbstractFullApiSelfTest.java:342)
> at sun.reflect.GeneratedMethodAccessor96.invoke(Unknown Source)
> at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> at java.lang.reflect.Method.invoke(Method.java:606)
> at junit.framework.TestCase.runTest(TestCase.java:176)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.runTestInternal(GridAbstractTest.java:1803)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.access$000(GridAbstractTest.java:118)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest$4.run(GridAbstractTest.java:1718)
> at java.lang.Thread.run(Thread.java:745)
> {noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (IGNITE-8179) ZookeeperDiscoverySpiTest#testCommunicationFailureResolve_KillRandom always fails on TC

2018-07-25 Thread Vitaliy Biryukov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-8179?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16555651#comment-16555651
 ] 

Vitaliy Biryukov commented on IGNITE-8179:
--

[~sergey-chugunov], Done, please, take a look.

> ZookeeperDiscoverySpiTest#testCommunicationFailureResolve_KillRandom always 
> fails on TC
> ---
>
> Key: IGNITE-8179
> URL: https://issues.apache.org/jira/browse/IGNITE-8179
> Project: Ignite
>  Issue Type: Bug
>  Components: zookeeper
>Reporter: Sergey Chugunov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
>
> Test fails on TC with the following stack trace:
> {noformat}
> class org.apache.ignite.IgniteCheckedException: Failed to start manager: 
> GridManagerAdapter [enabled=true, 
> name=org.apache.ignite.internal.managers.discovery.GridDiscoveryManager]
> at 
> org.apache.ignite.internal.IgniteKernal.startManager(IgniteKernal.java:1698)
> at org.apache.ignite.internal.IgniteKernal.start(IgniteKernal.java:1007)
> at 
> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.start0(IgnitionEx.java:1977)
> at 
> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.start(IgnitionEx.java:1720)
> at org.apache.ignite.internal.IgnitionEx.start0(IgnitionEx.java:1148)
> at org.apache.ignite.internal.IgnitionEx.start(IgnitionEx.java:646)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGrid(GridAbstractTest.java:882)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGrid(GridAbstractTest.java:845)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGrid(GridAbstractTest.java:833)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGrid(GridAbstractTest.java:799)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGrids(GridAbstractTest.java:683)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGridsMultiThreaded(GridAbstractTest.java:710)
> at 
> org.apache.ignite.testframework.junits.common.GridCommonAbstractTest.startGridsMultiThreaded(GridCommonAbstractTest.java:507)
> at 
> org.apache.ignite.testframework.junits.common.GridCommonAbstractTest.startGridsMultiThreaded(GridCommonAbstractTest.java:497)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.testCommunicationFailureResolve_KillRandom(ZookeeperDiscoverySpiTest.java:2742)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
> at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> at java.lang.reflect.Method.invoke(Method.java:498)
> at junit.framework.TestCase.runTest(TestCase.java:176)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.runTestInternal(GridAbstractTest.java:2080)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.access$000(GridAbstractTest.java:140)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest$5.run(GridAbstractTest.java:1995)
> at java.lang.Thread.run(Thread.java:748)
> Caused by: class org.apache.ignite.IgniteCheckedException: Failed to start 
> SPI: ZookeeperDiscoverySpi [zkRootPath=/apacheIgnite, 
> zkConnectionString=127.0.0.1:40921,127.0.0.1:35014,127.0.0.1:38754, 
> joinTimeout=0, sesTimeout=2000, clientReconnectDisabled=false, 
> internalLsnr=null]
> at 
> org.apache.ignite.internal.managers.GridManagerAdapter.startSpi(GridManagerAdapter.java:300)
> at 
> org.apache.ignite.internal.managers.discovery.GridDiscoveryManager.start(GridDiscoveryManager.java:905)
> at 
> org.apache.ignite.internal.IgniteKernal.startManager(IgniteKernal.java:1693)
> ... 23 more
> Caused by: class org.apache.ignite.spi.IgniteSpiException: Failed to 
> initialize Zookeeper nodes
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.initZkNodes(ZookeeperDiscoveryImpl.java:827)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.startJoin(ZookeeperDiscoveryImpl.java:957)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.joinTopology(ZookeeperDiscoveryImpl.java:775)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.startJoinAndWait(ZookeeperDiscoveryImpl.java:693)
> at 
> org.apache.ignite.spi.discovery.zk.ZookeeperDiscoverySpi.spiStart(ZookeeperDiscoverySpi.java:471)
> at 
> org.apache.ignite.internal.managers.GridManagerAdapter.startSpi(GridManagerAdapter.java:297)
> ... 25 more
> Caused by: 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperClientFailedException: 
> 

[jira] [Commented] (IGNITE-8179) ZookeeperDiscoverySpiTest#testCommunicationFailureResolve_KillRandom always fails on TC

2018-07-23 Thread Vitaliy Biryukov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-8179?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16552577#comment-16552577
 ] 

Vitaliy Biryukov commented on IGNITE-8179:
--

[~sergey-chugunov], I've removed PME fix (item 3) from this PR. It will be 
fixed in [IGNITE-8783|https://issues.apache.org/jira/browse/IGNITE-8783]

> ZookeeperDiscoverySpiTest#testCommunicationFailureResolve_KillRandom always 
> fails on TC
> ---
>
> Key: IGNITE-8179
> URL: https://issues.apache.org/jira/browse/IGNITE-8179
> Project: Ignite
>  Issue Type: Bug
>  Components: zookeeper
>Reporter: Sergey Chugunov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
>
> Test fails on TC with the following stack trace:
> {noformat}
> class org.apache.ignite.IgniteCheckedException: Failed to start manager: 
> GridManagerAdapter [enabled=true, 
> name=org.apache.ignite.internal.managers.discovery.GridDiscoveryManager]
> at 
> org.apache.ignite.internal.IgniteKernal.startManager(IgniteKernal.java:1698)
> at org.apache.ignite.internal.IgniteKernal.start(IgniteKernal.java:1007)
> at 
> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.start0(IgnitionEx.java:1977)
> at 
> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.start(IgnitionEx.java:1720)
> at org.apache.ignite.internal.IgnitionEx.start0(IgnitionEx.java:1148)
> at org.apache.ignite.internal.IgnitionEx.start(IgnitionEx.java:646)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGrid(GridAbstractTest.java:882)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGrid(GridAbstractTest.java:845)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGrid(GridAbstractTest.java:833)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGrid(GridAbstractTest.java:799)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGrids(GridAbstractTest.java:683)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGridsMultiThreaded(GridAbstractTest.java:710)
> at 
> org.apache.ignite.testframework.junits.common.GridCommonAbstractTest.startGridsMultiThreaded(GridCommonAbstractTest.java:507)
> at 
> org.apache.ignite.testframework.junits.common.GridCommonAbstractTest.startGridsMultiThreaded(GridCommonAbstractTest.java:497)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.testCommunicationFailureResolve_KillRandom(ZookeeperDiscoverySpiTest.java:2742)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
> at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> at java.lang.reflect.Method.invoke(Method.java:498)
> at junit.framework.TestCase.runTest(TestCase.java:176)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.runTestInternal(GridAbstractTest.java:2080)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.access$000(GridAbstractTest.java:140)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest$5.run(GridAbstractTest.java:1995)
> at java.lang.Thread.run(Thread.java:748)
> Caused by: class org.apache.ignite.IgniteCheckedException: Failed to start 
> SPI: ZookeeperDiscoverySpi [zkRootPath=/apacheIgnite, 
> zkConnectionString=127.0.0.1:40921,127.0.0.1:35014,127.0.0.1:38754, 
> joinTimeout=0, sesTimeout=2000, clientReconnectDisabled=false, 
> internalLsnr=null]
> at 
> org.apache.ignite.internal.managers.GridManagerAdapter.startSpi(GridManagerAdapter.java:300)
> at 
> org.apache.ignite.internal.managers.discovery.GridDiscoveryManager.start(GridDiscoveryManager.java:905)
> at 
> org.apache.ignite.internal.IgniteKernal.startManager(IgniteKernal.java:1693)
> ... 23 more
> Caused by: class org.apache.ignite.spi.IgniteSpiException: Failed to 
> initialize Zookeeper nodes
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.initZkNodes(ZookeeperDiscoveryImpl.java:827)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.startJoin(ZookeeperDiscoveryImpl.java:957)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.joinTopology(ZookeeperDiscoveryImpl.java:775)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.startJoinAndWait(ZookeeperDiscoveryImpl.java:693)
> at 
> org.apache.ignite.spi.discovery.zk.ZookeeperDiscoverySpi.spiStart(ZookeeperDiscoverySpi.java:471)
> at 
> org.apache.ignite.internal.managers.GridManagerAdapter.startSpi(GridManagerAdapter.java:297)
> ... 25 more
> Caused by: 

[jira] [Commented] (IGNITE-8179) ZookeeperDiscoverySpiTest#testCommunicationFailureResolve_KillRandom always fails on TC

2018-07-09 Thread Vitaliy Biryukov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-8179?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16536890#comment-16536890
 ] 

Vitaliy Biryukov commented on IGNITE-8179:
--

Hi, [~sergey-chugunov] please take a look.

This test falls for some reasons:
 # Rarely random killed all server nodes, and after tries to start client node. 
Now, in this case, а server node will start.
 # Sometimes *awaitPartitionMapExchange* throws *IllegalStateException*, 
*IgniteClientDisconnectedException* or some exception caused by node stoping 
(Nodes forced to kill by communication failure resolver). I've changed it to 
*waitForTopology*.
 # In case of client reconnects *checkEventsConsistency* throws assertion 
error. So I clean events map at each client reconnect.
 # In cases where а coordinator is killed by communication failure resolver 
sometimes cluster hang on PME. Scenario: PME start -> coordinator left -> some 
node send latch countdown ack to future coordinator -> future coordinator 
creates *createClientLatch* and immediately complete it, because ack was 
received from non-coordinator node.

And there is still one more reason for fails. Rarely long GC pause occurs 
during the first connection to Zk cluster and Ignite node fails to start 
because of first connection timeout. 
This problem can be solved by a significant increase in the session timeout 
(calculated as session timeout/Zk servers count *ClientCnxn:381*), but this 
will greatly increase test processing time. Or by increasing the size of the 
heap.
{noformat}
[20:09:28]W: [org.apache.ignite:ignite-zookeeper] [2018-07-06 
17:09:28,624][WARN ][jvm-pause-detector-worker][ZookeeperDiscoverySpiTest9] 
Possible too long JVM pause: 2803 milliseconds.
[20:09:28] : [Step 3/4] [2018-07-06 17:09:28,624][INFO 
][zk-internal.ZookeeperDiscoverySpiTest9-SendThread(127.0.0.1:39805)][ClientCnxn]
 Client session timed out, have not heard from server in 2854ms for sessionid 
0x0, closing socket connection and attempting reconnect
[20:09:28]W: [org.apache.ignite:ignite-zookeeper] [2018-07-06 
17:09:28,747][WARN 
][zk-client-timer-internal.ZookeeperDiscoverySpiTest9][ZookeeperDiscoveryImpl] 
Connection to Zookeeper server is lost, local node SEGMENTED.
{noformat}

> ZookeeperDiscoverySpiTest#testCommunicationFailureResolve_KillRandom always 
> fails on TC
> ---
>
> Key: IGNITE-8179
> URL: https://issues.apache.org/jira/browse/IGNITE-8179
> Project: Ignite
>  Issue Type: Bug
>  Components: zookeeper
>Reporter: Sergey Chugunov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
>
> Test fails on TC with the following stack trace:
> {noformat}
> class org.apache.ignite.IgniteCheckedException: Failed to start manager: 
> GridManagerAdapter [enabled=true, 
> name=org.apache.ignite.internal.managers.discovery.GridDiscoveryManager]
> at 
> org.apache.ignite.internal.IgniteKernal.startManager(IgniteKernal.java:1698)
> at org.apache.ignite.internal.IgniteKernal.start(IgniteKernal.java:1007)
> at 
> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.start0(IgnitionEx.java:1977)
> at 
> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.start(IgnitionEx.java:1720)
> at org.apache.ignite.internal.IgnitionEx.start0(IgnitionEx.java:1148)
> at org.apache.ignite.internal.IgnitionEx.start(IgnitionEx.java:646)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGrid(GridAbstractTest.java:882)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGrid(GridAbstractTest.java:845)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGrid(GridAbstractTest.java:833)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGrid(GridAbstractTest.java:799)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGrids(GridAbstractTest.java:683)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGridsMultiThreaded(GridAbstractTest.java:710)
> at 
> org.apache.ignite.testframework.junits.common.GridCommonAbstractTest.startGridsMultiThreaded(GridCommonAbstractTest.java:507)
> at 
> org.apache.ignite.testframework.junits.common.GridCommonAbstractTest.startGridsMultiThreaded(GridCommonAbstractTest.java:497)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.testCommunicationFailureResolve_KillRandom(ZookeeperDiscoverySpiTest.java:2742)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
> at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> at 

[jira] [Commented] (IGNITE-8180) ZookeeperDiscoverySpiTest#testQuorumRestore fails on TC

2018-07-05 Thread Vitaliy Biryukov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-8180?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16533398#comment-16533398
 ] 

Vitaliy Biryukov commented on IGNITE-8180:
--

[~NSAmelchev], LGTM.

 

> ZookeeperDiscoverySpiTest#testQuorumRestore fails on TC
> ---
>
> Key: IGNITE-8180
> URL: https://issues.apache.org/jira/browse/IGNITE-8180
> Project: Ignite
>  Issue Type: Bug
>  Components: zookeeper
>Reporter: Sergey Chugunov
>Assignee: Amelchev Nikita
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
>
> Test fails on TC with the following stack trace:
> {noformat}
> class org.apache.ignite.IgniteCheckedException: Failed to start manager: 
> GridManagerAdapter [enabled=true, 
> name=org.apache.ignite.internal.managers.discovery.GridDiscoveryManager]
> at 
> org.apache.ignite.internal.IgniteKernal.startManager(IgniteKernal.java:1698)
> at org.apache.ignite.internal.IgniteKernal.start(IgniteKernal.java:1007)
> at 
> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.start0(IgnitionEx.java:1977)
> at 
> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.start(IgnitionEx.java:1720)
> at org.apache.ignite.internal.IgnitionEx.start0(IgnitionEx.java:1148)
> at org.apache.ignite.internal.IgnitionEx.start(IgnitionEx.java:646)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGrid(GridAbstractTest.java:882)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGrid(GridAbstractTest.java:845)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGrid(GridAbstractTest.java:833)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGrid(GridAbstractTest.java:799)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGrids(GridAbstractTest.java:683)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.testQuorumRestore(ZookeeperDiscoverySpiTest.java:1077)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
> at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> at java.lang.reflect.Method.invoke(Method.java:498)
> at junit.framework.TestCase.runTest(TestCase.java:176)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.runTestInternal(GridAbstractTest.java:2080)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.access$000(GridAbstractTest.java:140)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest$5.run(GridAbstractTest.java:1995)
> at java.lang.Thread.run(Thread.java:748)
> Caused by: class org.apache.ignite.IgniteCheckedException: Failed to start 
> SPI: ZookeeperDiscoverySpi [zkRootPath=/apacheIgnite, 
> zkConnectionString=127.0.0.1:40921,127.0.0.1:35014,127.0.0.1:38754, 
> joinTimeout=0, sesTimeout=15000, clientReconnectDisabled=false, 
> internalLsnr=null]
> at 
> org.apache.ignite.internal.managers.GridManagerAdapter.startSpi(GridManagerAdapter.java:300)
> at 
> org.apache.ignite.internal.managers.discovery.GridDiscoveryManager.start(GridDiscoveryManager.java:905)
> at 
> org.apache.ignite.internal.IgniteKernal.startManager(IgniteKernal.java:1693)
> ... 20 more
> Caused by: class org.apache.ignite.spi.IgniteSpiException: Failed to 
> initialize Zookeeper nodes
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.initZkNodes(ZookeeperDiscoveryImpl.java:827)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.startJoin(ZookeeperDiscoveryImpl.java:957)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.joinTopology(ZookeeperDiscoveryImpl.java:775)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.startJoinAndWait(ZookeeperDiscoveryImpl.java:693)
> at 
> org.apache.ignite.spi.discovery.zk.ZookeeperDiscoverySpi.spiStart(ZookeeperDiscoverySpi.java:471)
> at 
> org.apache.ignite.internal.managers.GridManagerAdapter.startSpi(GridManagerAdapter.java:297)
> ... 22 more
> Caused by: 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperClientFailedException: 
> org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode 
> = ConnectionLoss for /apacheIgnite
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperClient.onZookeeperError(ZookeeperClient.java:758)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperClient.exists(ZookeeperClient.java:276)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.initZkNodes(ZookeeperDiscoveryImpl.java:789)
> ... 27 more
> Caused by: 

[jira] [Commented] (IGNITE-8184) ZookeeperDiscoverySpiTest#testTopologyChangeMultithreaded_RestartZk* tests fail on TC

2018-07-04 Thread Vitaliy Biryukov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-8184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16532957#comment-16532957
 ] 

Vitaliy Biryukov commented on IGNITE-8184:
--

[~daradurvs], [~sergey-chugunov],
This tests fixed in 
[IGNITE-8182|https://issues.apache.org/jira/browse/IGNITE-8182].
[TC 
link|https://ci.ignite.apache.org/viewType.html?buildTypeId=IgniteTests24Java8_ZooKeeperDiscovery1=buildTypeStatusDiv_IgniteTests24Java8=pull%2F4192%2Fhead].
This issue can be closed as a duplicate.

> ZookeeperDiscoverySpiTest#testTopologyChangeMultithreaded_RestartZk* tests 
> fail on TC
> -
>
> Key: IGNITE-8184
> URL: https://issues.apache.org/jira/browse/IGNITE-8184
> Project: Ignite
>  Issue Type: Bug
>  Components: zookeeper
>Reporter: Sergey Chugunov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
> Fix For: 2.7
>
>
> Tests fail on TC but pass locally.
> There are some errors in logs like this:
> {noformat}
> class org.apache.ignite.IgniteCheckedException: Failed to start manager: 
> GridManagerAdapter [enabled=true, 
> name=org.apache.ignite.internal.managers.discovery.GridDiscoveryManager]
> at 
> org.apache.ignite.internal.IgniteKernal.startManager(IgniteKernal.java:1698)
> at org.apache.ignite.internal.IgniteKernal.start(IgniteKernal.java:1007)
> at 
> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.start0(IgnitionEx.java:1977)
> at 
> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.start(IgnitionEx.java:1720)
> at org.apache.ignite.internal.IgnitionEx.start0(IgnitionEx.java:1148)
> at org.apache.ignite.internal.IgnitionEx.start(IgnitionEx.java:646)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGrid(GridAbstractTest.java:882)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGrid(GridAbstractTest.java:845)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGrid(GridAbstractTest.java:833)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGrid(GridAbstractTest.java:799)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest$3.call(GridAbstractTest.java:742)
> at 
> org.apache.ignite.testframework.GridTestThread.run(GridTestThread.java:86)
> Caused by: class org.apache.ignite.IgniteCheckedException: Failed to start 
> SPI: ZookeeperDiscoverySpi [zkRootPath=/apacheIgnite, 
> zkConnectionString=127.0.0.1:45822,127.0.0.1:46661,127.0.0.1:43724, 
> joinTimeout=0, sesTimeout=3, clientReconnectDisabled=false, 
> internalLsnr=null]
> at 
> org.apache.ignite.internal.managers.GridManagerAdapter.startSpi(GridManagerAdapter.java:300)
> at 
> org.apache.ignite.internal.managers.discovery.GridDiscoveryManager.start(GridDiscoveryManager.java:905)
> at 
> org.apache.ignite.internal.IgniteKernal.startManager(IgniteKernal.java:1693)
> ... 11 more
> Caused by: class org.apache.ignite.spi.IgniteSpiException: Failed to 
> initialize Zookeeper nodes
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.initZkNodes(ZookeeperDiscoveryImpl.java:827)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.startJoin(ZookeeperDiscoveryImpl.java:957)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.joinTopology(ZookeeperDiscoveryImpl.java:775)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.startJoinAndWait(ZookeeperDiscoveryImpl.java:693)
> at 
> org.apache.ignite.spi.discovery.zk.ZookeeperDiscoverySpi.spiStart(ZookeeperDiscoverySpi.java:471)
> at 
> org.apache.ignite.internal.managers.GridManagerAdapter.startSpi(GridManagerAdapter.java:297)
> ... 13 more
> Caused by: 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperClientFailedException: 
> org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode 
> = ConnectionLoss for /apacheIgnite
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperClient.onZookeeperError(ZookeeperClient.java:808)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperClient.exists(ZookeeperClient.java:276)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.initZkNodes(ZookeeperDiscoveryImpl.java:789)
> ... 18 more
> Caused by: org.apache.zookeeper.KeeperException$ConnectionLossException: 
> KeeperErrorCode = ConnectionLoss for /apacheIgnite
> at org.apache.zookeeper.KeeperException.create(KeeperException.java:99)
> at org.apache.zookeeper.KeeperException.create(KeeperException.java:51)
> at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1045)
> at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1073)
> 

[jira] [Commented] (IGNITE-8182) ZookeeperDiscoverySpiTest#testRandomTopologyChanges_RestartZk fails on TC

2018-07-04 Thread Vitaliy Biryukov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-8182?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16532435#comment-16532435
 ] 

Vitaliy Biryukov commented on IGNITE-8182:
--

Hi, [~dpavlov] please take a look.

> ZookeeperDiscoverySpiTest#testRandomTopologyChanges_RestartZk fails on TC
> -
>
> Key: IGNITE-8182
> URL: https://issues.apache.org/jira/browse/IGNITE-8182
> Project: Ignite
>  Issue Type: Bug
>  Components: zookeeper
>Reporter: Sergey Chugunov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
> Fix For: 2.7
>
>
> Test fails on TC with the following stack trace in logs:
> {noformat}
> org.apache.ignite.IgniteCheckedException: Failed to start manager: 
> GridManagerAdapter [enabled=true, 
> name=org.apache.ignite.internal.managers.discovery.GridDiscoveryManager]
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.randomTopologyChanges(ZookeeperDiscoverySpiTest.java:3791)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.testRandomTopologyChanges_RestartZk(ZookeeperDiscoverySpiTest.java:2035)
> Caused by: org.apache.ignite.IgniteCheckedException: Failed to start SPI: 
> ZookeeperDiscoverySpi [zkRootPath=/apacheIgnite, 
> zkConnectionString=127.0.0.1:45427,127.0.0.1:38620,127.0.0.1:45472, 
> joinTimeout=0, sesTimeout=3, clientReconnectDisabled=false, 
> internalLsnr=null]
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.randomTopologyChanges(ZookeeperDiscoverySpiTest.java:3791)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.testRandomTopologyChanges_RestartZk(ZookeeperDiscoverySpiTest.java:2035)
> Caused by: org.apache.ignite.spi.IgniteSpiException: Failed to initialize 
> Zookeeper nodes
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.randomTopologyChanges(ZookeeperDiscoverySpiTest.java:3791)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.testRandomTopologyChanges_RestartZk(ZookeeperDiscoverySpiTest.java:2035)
> Caused by: 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperClientFailedException: 
> org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode 
> = ConnectionLoss for /apacheIgnite
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.randomTopologyChanges(ZookeeperDiscoverySpiTest.java:3791)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.testRandomTopologyChanges_RestartZk(ZookeeperDiscoverySpiTest.java:2035)
> Caused by: org.apache.zookeeper.KeeperException$ConnectionLossException: 
> KeeperErrorCode = ConnectionLoss for /apacheIgnite
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.randomTopologyChanges(ZookeeperDiscoverySpiTest.java:3791)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.testRandomTopologyChanges_RestartZk(ZookeeperDiscoverySpiTest.java:2035)
> {noformat}
> Reproducible on TC, investigation of failure conditions is needed.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (IGNITE-8188) Batching operations should perform check for ZooKeeper request max size

2018-06-29 Thread Vitaliy Biryukov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-8188?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16527569#comment-16527569
 ] 

Vitaliy Biryukov commented on IGNITE-8188:
--

[~NSAmelchev], LGTM.

> Batching operations should perform check for ZooKeeper request max size
> ---
>
> Key: IGNITE-8188
> URL: https://issues.apache.org/jira/browse/IGNITE-8188
> Project: Ignite
>  Issue Type: Improvement
>  Components: zookeeper
>Reporter: Sergey Chugunov
>Assignee: Amelchev Nikita
>Priority: Major
> Fix For: 2.7
>
>
> As ZooKeeper documentation 
> [says|https://zookeeper.apache.org/doc/r3.4.3/api/org/apache/zookeeper/ZooKeeper.html#multi(java.lang.Iterable)]
>  batching *multi* operation has a limit for size of a single request.
> ZookeeperClient batching methods *createAll* and *deleteAll* should check 
> this limit and fall back to execute operations one by one.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (IGNITE-8131) ZookeeperDiscoverySpiTest#testClientReconnectSessionExpire* tests fail on TC

2018-06-29 Thread Vitaliy Biryukov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-8131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16527443#comment-16527443
 ] 

Vitaliy Biryukov commented on IGNITE-8131:
--

[~garus.d.g], LGTM.

> ZookeeperDiscoverySpiTest#testClientReconnectSessionExpire* tests fail on TC
> 
>
> Key: IGNITE-8131
> URL: https://issues.apache.org/jira/browse/IGNITE-8131
> Project: Ignite
>  Issue Type: Bug
>  Components: zookeeper
>Reporter: Sergey Chugunov
>Assignee: Denis Garus
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
> Fix For: 2.7
>
> Attachments: ZK_client_reconnect_failure.log, 
> ZK_client_reconnect_success.log
>
>
> Two tests always fail on TC with the assertion
> {noformat}
> junit.framework.AssertionFailedError: Failed to wait for disconnect/reconnect 
> event.
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.waitReconnectEvent(ZookeeperDiscoverySpiTest.java:4221)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.reconnectClientNodes(ZookeeperDiscoverySpiTest.java:4183)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.clientReconnectSessionExpire(ZookeeperDiscoverySpiTest.java:2231)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.testClientReconnectSessionExpire1_1(ZookeeperDiscoverySpiTest.java:2206)
> {noformat}
> from client disconnect/reconnect events check. Obviously client doesn't 
> generate these events as it supposed to do.
> (TC runs can be found 
> [here|https://ci.ignite.apache.org/viewType.html?buildTypeId=IgniteTests24Java8_IgniteZooKeeperDiscovery_IgniteTests24Java8=pull%2F3730%2Fhead=buildTypeStatusDiv]).
> It is possible to reproduce test failure locally as well, but with low 
> probability: one failure for 50 or even 300 successful executions.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Comment Edited] (IGNITE-8182) ZookeeperDiscoverySpiTest#testRandomTopologyChanges_RestartZk fails on TC

2018-06-27 Thread Vitaliy Biryukov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-8182?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16524887#comment-16524887
 ] 

Vitaliy Biryukov edited comment on IGNITE-8182 at 6/27/18 10:35 AM:


Hi, [~sergey-chugunov]  please take a look.

This test falls for two reasons:
#  The bug in *curator-test*[1]. Scenario: in *TestingCluster* static block, 
some classes are loaded and their bytecode rewrote. Further GC unload some 
classes and the next time the classes will be loaded without bytecode 
rewriting. 
This scenario reproduces only in full test suite because some rewritten classes 
are used only during star, stop or leader election.
This bug was fixed in latest versions of curator-test, but the new versions are 
not compatible with ZooKeeper 3.4.x[2]. So I've added the workaround to 
*ZookeeperDiscoverySpiTestSuite1*.
#  Rarely ZK servers restarted too often and the connection does not have time 
to recover.

[IGNITE-8184|https://issues.apache.org/jira/browse/IGNITE-8184] can be closed 
as duplicate.

[1] https://github.com/Netflix/curator/issues/121
[2] https://curator.apache.org/zk-compatibility.html


was (Author: vitaliyb):
Hi, [~sergey-chugunov]  please take a look.

This test falls for two reasons:
#  The bug in curator-test[1]. Scenario: in TestingCluster static block, some 
classes are loaded and their bytecode rewrote. Further GC unload some classes 
and the next time the classes will be loaded without bytecode rewriting. 
This scenario reproduces only in full test suite because some rewritten classes 
are used only during star, stop or leader election.
This bug was fixed in latest versions of curator-test, but the new versions are 
not compatible with ZooKeeper 3.4.x[2]. So I've added the workaround to 
ZookeeperDiscoverySpiTestSuite1.
#  Rarely ZK servers restarted too often and the connection does not have time 
to recover.

[IGNITE-8184|https://issues.apache.org/jira/browse/IGNITE-8184] can be closed 
as duplicate.

[1] https://github.com/Netflix/curator/issues/121
[2] https://curator.apache.org/zk-compatibility.html

> ZookeeperDiscoverySpiTest#testRandomTopologyChanges_RestartZk fails on TC
> -
>
> Key: IGNITE-8182
> URL: https://issues.apache.org/jira/browse/IGNITE-8182
> Project: Ignite
>  Issue Type: Bug
>  Components: zookeeper
>Reporter: Sergey Chugunov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
>
> Test fails on TC with the following stack trace in logs:
> {noformat}
> org.apache.ignite.IgniteCheckedException: Failed to start manager: 
> GridManagerAdapter [enabled=true, 
> name=org.apache.ignite.internal.managers.discovery.GridDiscoveryManager]
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.randomTopologyChanges(ZookeeperDiscoverySpiTest.java:3791)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.testRandomTopologyChanges_RestartZk(ZookeeperDiscoverySpiTest.java:2035)
> Caused by: org.apache.ignite.IgniteCheckedException: Failed to start SPI: 
> ZookeeperDiscoverySpi [zkRootPath=/apacheIgnite, 
> zkConnectionString=127.0.0.1:45427,127.0.0.1:38620,127.0.0.1:45472, 
> joinTimeout=0, sesTimeout=3, clientReconnectDisabled=false, 
> internalLsnr=null]
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.randomTopologyChanges(ZookeeperDiscoverySpiTest.java:3791)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.testRandomTopologyChanges_RestartZk(ZookeeperDiscoverySpiTest.java:2035)
> Caused by: org.apache.ignite.spi.IgniteSpiException: Failed to initialize 
> Zookeeper nodes
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.randomTopologyChanges(ZookeeperDiscoverySpiTest.java:3791)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.testRandomTopologyChanges_RestartZk(ZookeeperDiscoverySpiTest.java:2035)
> Caused by: 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperClientFailedException: 
> org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode 
> = ConnectionLoss for /apacheIgnite
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.randomTopologyChanges(ZookeeperDiscoverySpiTest.java:3791)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.testRandomTopologyChanges_RestartZk(ZookeeperDiscoverySpiTest.java:2035)
> Caused by: org.apache.zookeeper.KeeperException$ConnectionLossException: 
> KeeperErrorCode = ConnectionLoss for /apacheIgnite
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.randomTopologyChanges(ZookeeperDiscoverySpiTest.java:3791)
> at 
> 

[jira] [Commented] (IGNITE-8182) ZookeeperDiscoverySpiTest#testRandomTopologyChanges_RestartZk fails on TC

2018-06-27 Thread Vitaliy Biryukov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-8182?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16524887#comment-16524887
 ] 

Vitaliy Biryukov commented on IGNITE-8182:
--

Hi, [~sergey-chugunov]  please take a look.

This test falls for two reasons:
#  The bug in curator-test[1]. Scenario: in TestingCluster static block, some 
classes are loaded and their bytecode rewrote. Further GC unload some classes 
and the next time the classes will be loaded without bytecode rewriting. 
This scenario reproduces only in full test suite because some rewritten classes 
are used only during star, stop or leader election.
This bug was fixed in latest versions of curator-test, but the new versions are 
not compatible with ZooKeeper 3.4.x[2]. So I've added the workaround to 
ZookeeperDiscoverySpiTestSuite1.
#  Rarely ZK servers restarted too often and the connection does not have time 
to recover.

[IGNITE-8184|https://issues.apache.org/jira/browse/IGNITE-8184] can be closed 
as duplicate.

[1] https://github.com/Netflix/curator/issues/121
[2] https://curator.apache.org/zk-compatibility.html

> ZookeeperDiscoverySpiTest#testRandomTopologyChanges_RestartZk fails on TC
> -
>
> Key: IGNITE-8182
> URL: https://issues.apache.org/jira/browse/IGNITE-8182
> Project: Ignite
>  Issue Type: Bug
>  Components: zookeeper
>Reporter: Sergey Chugunov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
>
> Test fails on TC with the following stack trace in logs:
> {noformat}
> org.apache.ignite.IgniteCheckedException: Failed to start manager: 
> GridManagerAdapter [enabled=true, 
> name=org.apache.ignite.internal.managers.discovery.GridDiscoveryManager]
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.randomTopologyChanges(ZookeeperDiscoverySpiTest.java:3791)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.testRandomTopologyChanges_RestartZk(ZookeeperDiscoverySpiTest.java:2035)
> Caused by: org.apache.ignite.IgniteCheckedException: Failed to start SPI: 
> ZookeeperDiscoverySpi [zkRootPath=/apacheIgnite, 
> zkConnectionString=127.0.0.1:45427,127.0.0.1:38620,127.0.0.1:45472, 
> joinTimeout=0, sesTimeout=3, clientReconnectDisabled=false, 
> internalLsnr=null]
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.randomTopologyChanges(ZookeeperDiscoverySpiTest.java:3791)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.testRandomTopologyChanges_RestartZk(ZookeeperDiscoverySpiTest.java:2035)
> Caused by: org.apache.ignite.spi.IgniteSpiException: Failed to initialize 
> Zookeeper nodes
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.randomTopologyChanges(ZookeeperDiscoverySpiTest.java:3791)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.testRandomTopologyChanges_RestartZk(ZookeeperDiscoverySpiTest.java:2035)
> Caused by: 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperClientFailedException: 
> org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode 
> = ConnectionLoss for /apacheIgnite
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.randomTopologyChanges(ZookeeperDiscoverySpiTest.java:3791)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.testRandomTopologyChanges_RestartZk(ZookeeperDiscoverySpiTest.java:2035)
> Caused by: org.apache.zookeeper.KeeperException$ConnectionLossException: 
> KeeperErrorCode = ConnectionLoss for /apacheIgnite
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.randomTopologyChanges(ZookeeperDiscoverySpiTest.java:3791)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.testRandomTopologyChanges_RestartZk(ZookeeperDiscoverySpiTest.java:2035)
> {noformat}
> Reproducible on TC, investigation of failure conditions is needed.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (IGNITE-8699) ZookeeperDiscoverySpiTest#testDisconnectOnServersLeft flaky fails (rarely)

2018-06-20 Thread Vitaliy Biryukov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-8699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16518078#comment-16518078
 ] 

Vitaliy Biryukov commented on IGNITE-8699:
--

[~sergey-chugunov], 
Is full *ZooKeeper (Discovery) 1* suit enough? [TC 
link|https://ci.ignite.apache.org/viewLog.html?buildId=1374005=IgniteTests24Java8_ZooKeeperDiscovery1=buildResultsDiv]

You are right about option#1. This case reproduce on my Linux machine sometimes.
The piece of thread dump (full thread dump in attachments):  
{noformat}
Thread [name="disco-event-worker-#2605%internal.ZookeeperDiscoverySpiTest5%", 
id=3211, state=WAITING, blockCnt=2, waitCnt=6]
at sun.misc.Unsafe.park(Native Method)
at java.util.concurrent.locks.LockSupport.park(LockSupport.java:304)
at 
o.a.i.i.util.future.GridFutureAdapter.get0(GridFutureAdapter.java:177)
at o.a.i.i.util.future.GridFutureAdapter.get(GridFutureAdapter.java:140)
at 
o.a.i.i.managers.discovery.GridDiscoveryManager.localJoin(GridDiscoveryManager.java:2190)
at 
o.a.i.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest$2.apply(ZookeeperDiscoverySpiTest.java:315)
- locked java.util.TreeMap@38081448
at 
o.a.i.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest$2.apply(ZookeeperDiscoverySpiTest.java:295)
at 
o.a.i.i.managers.eventstorage.GridEventStorageManager$UserListenerWrapper.onEvent(GridEventStorageManager.java:1477)
at 
o.a.i.i.managers.eventstorage.GridEventStorageManager.notifyListeners(GridEventStorageManager.java:873)
at 
o.a.i.i.managers.eventstorage.GridEventStorageManager.notifyListeners(GridEventStorageManager.java:858)
at 
o.a.i.i.managers.eventstorage.GridEventStorageManager.record0(GridEventStorageManager.java:341)
at 
o.a.i.i.managers.eventstorage.GridEventStorageManager.record(GridEventStorageManager.java:307)
at 
o.a.i.i.managers.discovery.GridDiscoveryManager$DiscoveryWorker.recordEvent(GridDiscoveryManager.java:2703)
at 
o.a.i.i.managers.discovery.GridDiscoveryManager$DiscoveryWorker.body0(GridDiscoveryManager.java:2920)
at 
o.a.i.i.managers.discovery.GridDiscoveryManager$DiscoveryWorker.body(GridDiscoveryManager.java:2732)
at o.a.i.i.util.worker.GridWorker.run(GridWorker.java:110)
at java.lang.Thread.run(Thread.java:748)
{noformat}


> ZookeeperDiscoverySpiTest#testDisconnectOnServersLeft flaky fails (rarely)
> --
>
> Key: IGNITE-8699
> URL: https://issues.apache.org/jira/browse/IGNITE-8699
> Project: Ignite
>  Issue Type: Bug
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
> Attachments: thread-dump-fail-before-local-join
>
>
> *Affected tests:*
> testDisconnectOnServersLeft_1
> testDisconnectOnServersLeft_2
> testDisconnectOnServersLeft_3
> testDisconnectOnServersLeft_4
> testDisconnectOnServersLeft_5
> {noformat}
> junit.framework.AssertionFailedError: Failed to wait for disconnect/reconnect 
> event.
>   at junit.framework.Assert.fail(Assert.java:57)
>   at junit.framework.TestCase.fail(TestCase.java:227)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.waitReconnectEvent(ZookeeperDiscoverySpiTest.java:4685)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.disconnectOnServersLeft(ZookeeperDiscoverySpiTest.java:3541)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.testDisconnectOnServersLeft_4(ZookeeperDiscoverySpiTest.java:3476)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.lang.reflect.Method.invoke(Method.java:498)
>   at junit.framework.TestCase.runTest(TestCase.java:176)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest.runTestInternal(GridAbstractTest.java:2086)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest.access$000(GridAbstractTest.java:140)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest$5.run(GridAbstractTest.java:2001)
>   at java.lang.Thread.run(Thread.java:748)
> {noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (IGNITE-8699) ZookeeperDiscoverySpiTest#testDisconnectOnServersLeft flaky fails (rarely)

2018-06-20 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-8699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov updated IGNITE-8699:
-
Attachment: thread-dump-fail-before-local-join

> ZookeeperDiscoverySpiTest#testDisconnectOnServersLeft flaky fails (rarely)
> --
>
> Key: IGNITE-8699
> URL: https://issues.apache.org/jira/browse/IGNITE-8699
> Project: Ignite
>  Issue Type: Bug
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
> Attachments: thread-dump-fail-before-local-join
>
>
> *Affected tests:*
> testDisconnectOnServersLeft_1
> testDisconnectOnServersLeft_2
> testDisconnectOnServersLeft_3
> testDisconnectOnServersLeft_4
> testDisconnectOnServersLeft_5
> {noformat}
> junit.framework.AssertionFailedError: Failed to wait for disconnect/reconnect 
> event.
>   at junit.framework.Assert.fail(Assert.java:57)
>   at junit.framework.TestCase.fail(TestCase.java:227)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.waitReconnectEvent(ZookeeperDiscoverySpiTest.java:4685)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.disconnectOnServersLeft(ZookeeperDiscoverySpiTest.java:3541)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.testDisconnectOnServersLeft_4(ZookeeperDiscoverySpiTest.java:3476)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.lang.reflect.Method.invoke(Method.java:498)
>   at junit.framework.TestCase.runTest(TestCase.java:176)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest.runTestInternal(GridAbstractTest.java:2086)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest.access$000(GridAbstractTest.java:140)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest$5.run(GridAbstractTest.java:2001)
>   at java.lang.Thread.run(Thread.java:748)
> {noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Comment Edited] (IGNITE-8699) ZookeeperDiscoverySpiTest#testDisconnectOnServersLeft flaky fails (rarely)

2018-06-19 Thread Vitaliy Biryukov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-8699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16517297#comment-16517297
 ] 

Vitaliy Biryukov edited comment on IGNITE-8699 at 6/19/18 4:48 PM:
---

Hi, [~sergey-chugunov] Please take a look.
This test falls for two reasons:
1) If node fail event occurs before local join, *DiscoveryWorker* hangs.
2) If all servers left cluster oldest client notifies all clients. Oldest 
client receives events data and adds *ZkNoServersMessage*. If new events are 
added at this time, the events data version will change and cause 
*BadVersionException*.


was (Author: vitaliyb):
Hi, [~sergey-chugunov] Please take a look.
This test falls for two reasons:
1) If node fail event occurs before local join, DiscoveryWorker hangs.
2) If all servers left cluster oldest client notifies all clients. Oldest 
client receives events data and adds *ZkNoServersMessage*. If new events are 
added at this time, the events data version will change and cause 
BadVersionException.

> ZookeeperDiscoverySpiTest#testDisconnectOnServersLeft flaky fails (rarely)
> --
>
> Key: IGNITE-8699
> URL: https://issues.apache.org/jira/browse/IGNITE-8699
> Project: Ignite
>  Issue Type: Bug
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
>
> *Affected tests:*
> testDisconnectOnServersLeft_1
> testDisconnectOnServersLeft_2
> testDisconnectOnServersLeft_3
> testDisconnectOnServersLeft_4
> testDisconnectOnServersLeft_5
> {noformat}
> junit.framework.AssertionFailedError: Failed to wait for disconnect/reconnect 
> event.
>   at junit.framework.Assert.fail(Assert.java:57)
>   at junit.framework.TestCase.fail(TestCase.java:227)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.waitReconnectEvent(ZookeeperDiscoverySpiTest.java:4685)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.disconnectOnServersLeft(ZookeeperDiscoverySpiTest.java:3541)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.testDisconnectOnServersLeft_4(ZookeeperDiscoverySpiTest.java:3476)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.lang.reflect.Method.invoke(Method.java:498)
>   at junit.framework.TestCase.runTest(TestCase.java:176)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest.runTestInternal(GridAbstractTest.java:2086)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest.access$000(GridAbstractTest.java:140)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest$5.run(GridAbstractTest.java:2001)
>   at java.lang.Thread.run(Thread.java:748)
> {noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (IGNITE-8699) ZookeeperDiscoverySpiTest#testDisconnectOnServersLeft flaky fails (rarely)

2018-06-19 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-8699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov updated IGNITE-8699:
-
Description: 
*Affected tests:*
testDisconnectOnServersLeft_1
testDisconnectOnServersLeft_2
testDisconnectOnServersLeft_3
testDisconnectOnServersLeft_4
testDisconnectOnServersLeft_5


{noformat}
junit.framework.AssertionFailedError: Failed to wait for disconnect/reconnect 
event.

at junit.framework.Assert.fail(Assert.java:57)
at junit.framework.TestCase.fail(TestCase.java:227)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.waitReconnectEvent(ZookeeperDiscoverySpiTest.java:4685)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.disconnectOnServersLeft(ZookeeperDiscoverySpiTest.java:3541)
at 
org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.testDisconnectOnServersLeft_4(ZookeeperDiscoverySpiTest.java:3476)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:498)
at junit.framework.TestCase.runTest(TestCase.java:176)
at 
org.apache.ignite.testframework.junits.GridAbstractTest.runTestInternal(GridAbstractTest.java:2086)
at 
org.apache.ignite.testframework.junits.GridAbstractTest.access$000(GridAbstractTest.java:140)
at 
org.apache.ignite.testframework.junits.GridAbstractTest$5.run(GridAbstractTest.java:2001)
at java.lang.Thread.run(Thread.java:748)
{noformat}


  was:
*Affected tests:*
testDisconnectOnServersLeft_1
testDisconnectOnServersLeft_2
testDisconnectOnServersLeft_3
testDisconnectOnServersLeft_4
testDisconnectOnServersLeft_5

*Causes:*
* Sometimes client nodes don't have time to join the topology.
* Sometimes starts communication failure resolver and wait for server nodes. 


> ZookeeperDiscoverySpiTest#testDisconnectOnServersLeft flaky fails (rarely)
> --
>
> Key: IGNITE-8699
> URL: https://issues.apache.org/jira/browse/IGNITE-8699
> Project: Ignite
>  Issue Type: Bug
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
>
> *Affected tests:*
> testDisconnectOnServersLeft_1
> testDisconnectOnServersLeft_2
> testDisconnectOnServersLeft_3
> testDisconnectOnServersLeft_4
> testDisconnectOnServersLeft_5
> {noformat}
> junit.framework.AssertionFailedError: Failed to wait for disconnect/reconnect 
> event.
>   at junit.framework.Assert.fail(Assert.java:57)
>   at junit.framework.TestCase.fail(TestCase.java:227)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.waitReconnectEvent(ZookeeperDiscoverySpiTest.java:4685)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.disconnectOnServersLeft(ZookeeperDiscoverySpiTest.java:3541)
>   at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.testDisconnectOnServersLeft_4(ZookeeperDiscoverySpiTest.java:3476)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.lang.reflect.Method.invoke(Method.java:498)
>   at junit.framework.TestCase.runTest(TestCase.java:176)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest.runTestInternal(GridAbstractTest.java:2086)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest.access$000(GridAbstractTest.java:140)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest$5.run(GridAbstractTest.java:2001)
>   at java.lang.Thread.run(Thread.java:748)
> {noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (IGNITE-8073) Cache read metric is calculated incorrectly in atomic cache.

2018-06-14 Thread Vitaliy Biryukov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-8073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16512450#comment-16512450
 ] 

Vitaliy Biryukov commented on IGNITE-8073:
--

[~Alexey Kuznetsov], LGTM.

> Cache read metric is calculated incorrectly in atomic cache.
> 
>
> Key: IGNITE-8073
> URL: https://issues.apache.org/jira/browse/IGNITE-8073
> Project: Ignite
>  Issue Type: Bug
>  Components: cache
>Affects Versions: 2.4
>Reporter: Alexey Kuznetsov
>Assignee: Alexey Kuznetsov
>Priority: Major
> Fix For: 2.6
>
> Attachments: GridCacheNearAtomicMetricsSelfTest.java
>
>
> In atomic cache with near enabled we perform put and remove operations.
> After it, get operation is called.
> Now, cache 'read' metric is calculated incorrectly, because it takes into 
> account near cache entry.
> Reproducer is attached.
> Note that remove operation untracks 'reader' node from dht cache entry, but 
> near cache entry still exists. The following test checks it :
> GridCacheAtomicNearCacheSelfTest#checkNearCache, see checkReaderRemove().



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Assigned] (IGNITE-8179) ZookeeperDiscoverySpiTest#testCommunicationFailureResolve_KillRandom always fails on TC

2018-06-08 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-8179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov reassigned IGNITE-8179:


Assignee: Vitaliy Biryukov

> ZookeeperDiscoverySpiTest#testCommunicationFailureResolve_KillRandom always 
> fails on TC
> ---
>
> Key: IGNITE-8179
> URL: https://issues.apache.org/jira/browse/IGNITE-8179
> Project: Ignite
>  Issue Type: Bug
>  Components: zookeeper
>Reporter: Sergey Chugunov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
>
> Test fails on TC with the following stack trace:
> {noformat}
> class org.apache.ignite.IgniteCheckedException: Failed to start manager: 
> GridManagerAdapter [enabled=true, 
> name=org.apache.ignite.internal.managers.discovery.GridDiscoveryManager]
> at 
> org.apache.ignite.internal.IgniteKernal.startManager(IgniteKernal.java:1698)
> at org.apache.ignite.internal.IgniteKernal.start(IgniteKernal.java:1007)
> at 
> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.start0(IgnitionEx.java:1977)
> at 
> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.start(IgnitionEx.java:1720)
> at org.apache.ignite.internal.IgnitionEx.start0(IgnitionEx.java:1148)
> at org.apache.ignite.internal.IgnitionEx.start(IgnitionEx.java:646)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGrid(GridAbstractTest.java:882)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGrid(GridAbstractTest.java:845)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGrid(GridAbstractTest.java:833)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGrid(GridAbstractTest.java:799)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGrids(GridAbstractTest.java:683)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.startGridsMultiThreaded(GridAbstractTest.java:710)
> at 
> org.apache.ignite.testframework.junits.common.GridCommonAbstractTest.startGridsMultiThreaded(GridCommonAbstractTest.java:507)
> at 
> org.apache.ignite.testframework.junits.common.GridCommonAbstractTest.startGridsMultiThreaded(GridCommonAbstractTest.java:497)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.testCommunicationFailureResolve_KillRandom(ZookeeperDiscoverySpiTest.java:2742)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
> at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> at java.lang.reflect.Method.invoke(Method.java:498)
> at junit.framework.TestCase.runTest(TestCase.java:176)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.runTestInternal(GridAbstractTest.java:2080)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.access$000(GridAbstractTest.java:140)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest$5.run(GridAbstractTest.java:1995)
> at java.lang.Thread.run(Thread.java:748)
> Caused by: class org.apache.ignite.IgniteCheckedException: Failed to start 
> SPI: ZookeeperDiscoverySpi [zkRootPath=/apacheIgnite, 
> zkConnectionString=127.0.0.1:40921,127.0.0.1:35014,127.0.0.1:38754, 
> joinTimeout=0, sesTimeout=2000, clientReconnectDisabled=false, 
> internalLsnr=null]
> at 
> org.apache.ignite.internal.managers.GridManagerAdapter.startSpi(GridManagerAdapter.java:300)
> at 
> org.apache.ignite.internal.managers.discovery.GridDiscoveryManager.start(GridDiscoveryManager.java:905)
> at 
> org.apache.ignite.internal.IgniteKernal.startManager(IgniteKernal.java:1693)
> ... 23 more
> Caused by: class org.apache.ignite.spi.IgniteSpiException: Failed to 
> initialize Zookeeper nodes
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.initZkNodes(ZookeeperDiscoveryImpl.java:827)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.startJoin(ZookeeperDiscoveryImpl.java:957)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.joinTopology(ZookeeperDiscoveryImpl.java:775)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoveryImpl.startJoinAndWait(ZookeeperDiscoveryImpl.java:693)
> at 
> org.apache.ignite.spi.discovery.zk.ZookeeperDiscoverySpi.spiStart(ZookeeperDiscoverySpi.java:471)
> at 
> org.apache.ignite.internal.managers.GridManagerAdapter.startSpi(GridManagerAdapter.java:297)
> ... 25 more
> Caused by: 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperClientFailedException: 
> org.apache.zookeeper.KeeperException$SessionExpiredException: 

[jira] [Commented] (IGNITE-6010) ZookeeperIpFinderTest.testFourNodesKillRestartZookeeper fails sometimes

2018-06-06 Thread Vitaliy Biryukov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-6010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16503407#comment-16503407
 ] 

Vitaliy Biryukov commented on IGNITE-6010:
--

[~NSAmelchev], LGTM.

> ZookeeperIpFinderTest.testFourNodesKillRestartZookeeper fails sometimes
> ---
>
> Key: IGNITE-6010
> URL: https://issues.apache.org/jira/browse/IGNITE-6010
> Project: Ignite
>  Issue Type: Bug
>  Components: zookeeper
>Affects Versions: 2.1
>Reporter: Ilya Lantukh
>Assignee: Amelchev Nikita
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
> Fix For: 2.6
>
>
> {noformat}
> junit.framework.AssertionFailedError: null
> at junit.framework.Assert.fail(Assert.java:55)
> at junit.framework.Assert.assertTrue(Assert.java:22)
> at junit.framework.Assert.assertTrue(Assert.java:31)
> at junit.framework.TestCase.assertTrue(TestCase.java:201)
> at 
> org.apache.ignite.spi.discovery.tcp.ipfinder.zk.ZookeeperIpFinderTest.testFourNodesKillRestartZookeeper(ZookeeperIpFinderTest.java:365)
> {noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (IGNITE-8699) ZookeeperDiscoverySpiTest#testDisconnectOnServersLeft flaky fails (rarely)

2018-06-05 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-8699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov updated IGNITE-8699:
-
Description: 
*Affected tests:*
testDisconnectOnServersLeft_1
testDisconnectOnServersLeft_2
testDisconnectOnServersLeft_3
testDisconnectOnServersLeft_4
testDisconnectOnServersLeft_5

*Causes:*
* Sometimes client nodes don't have time to join the topology.
* Sometimes starts communication failure resolver and wait for server nodes. 

  was:
*Affected tests:*
testDisconnectOnServersLeft_1
testDisconnectOnServersLeft_2
testDisconnectOnServersLeft_3
testDisconnectOnServersLeft_4
testDisconnectOnServersLeft_5

*Causes:*
* Sometimes the node does not have time to join the topology.
* Sometimes starts communication failure resolver and wait for server nodes. 


> ZookeeperDiscoverySpiTest#testDisconnectOnServersLeft flaky fails (rarely)
> --
>
> Key: IGNITE-8699
> URL: https://issues.apache.org/jira/browse/IGNITE-8699
> Project: Ignite
>  Issue Type: Bug
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
>
> *Affected tests:*
> testDisconnectOnServersLeft_1
> testDisconnectOnServersLeft_2
> testDisconnectOnServersLeft_3
> testDisconnectOnServersLeft_4
> testDisconnectOnServersLeft_5
> *Causes:*
> * Sometimes client nodes don't have time to join the topology.
> * Sometimes starts communication failure resolver and wait for server nodes. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (IGNITE-8699) ZookeeperDiscoverySpiTest#testDisconnectOnServersLeft flaky fails (rarely)

2018-06-05 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-8699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov updated IGNITE-8699:
-
Description: 
*Affected tests:*
testDisconnectOnServersLeft_1
testDisconnectOnServersLeft_2
testDisconnectOnServersLeft_3
testDisconnectOnServersLeft_4
testDisconnectOnServersLeft_5

*Causes:*
* Sometimes the node does not have time to join the topology.
* Sometimes starts communication failure resolver and wait for server nodes. 

  was:
Affected tests:

testDisconnectOnServersLeft_1
testDisconnectOnServersLeft_2
testDisconnectOnServersLeft_3
testDisconnectOnServersLeft_4
testDisconnectOnServersLeft_5

Causes:
* Sometimes the node does not have time to join the topology.
* Sometimes starts communication failure resolver and wait for server nodes. 


> ZookeeperDiscoverySpiTest#testDisconnectOnServersLeft flaky fails (rarely)
> --
>
> Key: IGNITE-8699
> URL: https://issues.apache.org/jira/browse/IGNITE-8699
> Project: Ignite
>  Issue Type: Bug
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
>
> *Affected tests:*
> testDisconnectOnServersLeft_1
> testDisconnectOnServersLeft_2
> testDisconnectOnServersLeft_3
> testDisconnectOnServersLeft_4
> testDisconnectOnServersLeft_5
> *Causes:*
> * Sometimes the node does not have time to join the topology.
> * Sometimes starts communication failure resolver and wait for server nodes. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (IGNITE-8699) ZookeeperDiscoverySpiTest#testDisconnectOnServersLeft flaky fails (rarely)

2018-06-05 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-8699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov updated IGNITE-8699:
-
Description: 
Affected tests:

testDisconnectOnServersLeft_1
testDisconnectOnServersLeft_2
testDisconnectOnServersLeft_3
testDisconnectOnServersLeft_4
testDisconnectOnServersLeft_5

Causes:
* Sometimes the node does not have time to join the topology.
* Sometimes starts communication failure resolver and wait for server nodes. 

  was:
Affected tests:

testDisconnectOnServersLeft_1
testDisconnectOnServersLeft_2
testDisconnectOnServersLeft_3
testDisconnectOnServersLeft_4
testDisconnectOnServersLeft_5


> ZookeeperDiscoverySpiTest#testDisconnectOnServersLeft flaky fails (rarely)
> --
>
> Key: IGNITE-8699
> URL: https://issues.apache.org/jira/browse/IGNITE-8699
> Project: Ignite
>  Issue Type: Bug
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
>
> Affected tests:
> testDisconnectOnServersLeft_1
> testDisconnectOnServersLeft_2
> testDisconnectOnServersLeft_3
> testDisconnectOnServersLeft_4
> testDisconnectOnServersLeft_5
> Causes:
> * Sometimes the node does not have time to join the topology.
> * Sometimes starts communication failure resolver and wait for server nodes. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (IGNITE-8699) ZookeeperDiscoverySpiTest#testDisconnectOnServersLeft flaky fails (rarely)

2018-06-05 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-8699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov updated IGNITE-8699:
-
Description: 
Affected tests:

testDisconnectOnServersLeft_1
testDisconnectOnServersLeft_2
testDisconnectOnServersLeft_3
testDisconnectOnServersLeft_4
testDisconnectOnServersLeft_5

> ZookeeperDiscoverySpiTest#testDisconnectOnServersLeft flaky fails (rarely)
> --
>
> Key: IGNITE-8699
> URL: https://issues.apache.org/jira/browse/IGNITE-8699
> Project: Ignite
>  Issue Type: Bug
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
>
> Affected tests:
> testDisconnectOnServersLeft_1
> testDisconnectOnServersLeft_2
> testDisconnectOnServersLeft_3
> testDisconnectOnServersLeft_4
> testDisconnectOnServersLeft_5



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Assigned] (IGNITE-8699) ZookeeperDiscoverySpiTest#testDisconnectOnServersLeft flaky fails (rarely)

2018-06-05 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-8699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov reassigned IGNITE-8699:


Assignee: Vitaliy Biryukov
  Labels: MakeTeamcityGreenAgain  (was: )

> ZookeeperDiscoverySpiTest#testDisconnectOnServersLeft flaky fails (rarely)
> --
>
> Key: IGNITE-8699
> URL: https://issues.apache.org/jira/browse/IGNITE-8699
> Project: Ignite
>  Issue Type: Bug
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (IGNITE-8699) ZookeeperDiscoverySpiTest#testDisconnectOnServersLeft flaky fails (rarely)

2018-06-05 Thread Vitaliy Biryukov (JIRA)
Vitaliy Biryukov created IGNITE-8699:


 Summary: ZookeeperDiscoverySpiTest#testDisconnectOnServersLeft 
flaky fails (rarely)
 Key: IGNITE-8699
 URL: https://issues.apache.org/jira/browse/IGNITE-8699
 Project: Ignite
  Issue Type: Bug
Reporter: Vitaliy Biryukov






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Assigned] (IGNITE-8182) ZookeeperDiscoverySpiTest#testRandomTopologyChanges_RestartZk fails on TC

2018-06-04 Thread Vitaliy Biryukov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-8182?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov reassigned IGNITE-8182:


Assignee: Vitaliy Biryukov

> ZookeeperDiscoverySpiTest#testRandomTopologyChanges_RestartZk fails on TC
> -
>
> Key: IGNITE-8182
> URL: https://issues.apache.org/jira/browse/IGNITE-8182
> Project: Ignite
>  Issue Type: Bug
>  Components: zookeeper
>Reporter: Sergey Chugunov
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
>
> Test fails on TC with the following stack trace in logs:
> {noformat}
> org.apache.ignite.IgniteCheckedException: Failed to start manager: 
> GridManagerAdapter [enabled=true, 
> name=org.apache.ignite.internal.managers.discovery.GridDiscoveryManager]
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.randomTopologyChanges(ZookeeperDiscoverySpiTest.java:3791)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.testRandomTopologyChanges_RestartZk(ZookeeperDiscoverySpiTest.java:2035)
> Caused by: org.apache.ignite.IgniteCheckedException: Failed to start SPI: 
> ZookeeperDiscoverySpi [zkRootPath=/apacheIgnite, 
> zkConnectionString=127.0.0.1:45427,127.0.0.1:38620,127.0.0.1:45472, 
> joinTimeout=0, sesTimeout=3, clientReconnectDisabled=false, 
> internalLsnr=null]
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.randomTopologyChanges(ZookeeperDiscoverySpiTest.java:3791)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.testRandomTopologyChanges_RestartZk(ZookeeperDiscoverySpiTest.java:2035)
> Caused by: org.apache.ignite.spi.IgniteSpiException: Failed to initialize 
> Zookeeper nodes
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.randomTopologyChanges(ZookeeperDiscoverySpiTest.java:3791)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.testRandomTopologyChanges_RestartZk(ZookeeperDiscoverySpiTest.java:2035)
> Caused by: 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperClientFailedException: 
> org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode 
> = ConnectionLoss for /apacheIgnite
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.randomTopologyChanges(ZookeeperDiscoverySpiTest.java:3791)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.testRandomTopologyChanges_RestartZk(ZookeeperDiscoverySpiTest.java:2035)
> Caused by: org.apache.zookeeper.KeeperException$ConnectionLossException: 
> KeeperErrorCode = ConnectionLoss for /apacheIgnite
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.randomTopologyChanges(ZookeeperDiscoverySpiTest.java:3791)
> at 
> org.apache.ignite.spi.discovery.zk.internal.ZookeeperDiscoverySpiTest.testRandomTopologyChanges_RestartZk(ZookeeperDiscoverySpiTest.java:2035)
> {noformat}
> Reproducible on TC, investigation of failure conditions is needed.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (IGNITE-8161) Suspend-resume TX test is flaky on TC (~5% fail rate)

2018-05-17 Thread Vitaliy Biryukov (JIRA)

[ 
https://issues.apache.org/jira/browse/IGNITE-8161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16478987#comment-16478987
 ] 

Vitaliy Biryukov commented on IGNITE-8161:
--

[~Alexey Kuznetsov] , LGTM.

> Suspend-resume TX test is flaky on TC (~5% fail rate)
> -
>
> Key: IGNITE-8161
> URL: https://issues.apache.org/jira/browse/IGNITE-8161
> Project: Ignite
>  Issue Type: Test
>  Components: cache
>Reporter: Dmitriy Pavlov
>Assignee: Alexey Kuznetsov
>Priority: Critical
>  Labels: MakeTeamcityGreenAgain
> Fix For: 2.6
>
>
> https://ci.ignite.apache.org/viewLog.html?buildId=1176294=buildResultsDiv=IgniteTests24Java8_Cache6#testNameId-7194341254453895210
> Causal chani java.lang.RuntimeException: javax.cache.CacheException: class 
> org.apache.ignite.transactions.TransactionTimeoutException: Cache transaction 
> timed out: GridNearTxLocal 
> First exception in log
> {noformat}
> validParts=null, state=MARKED_ROLLBACK, timedOut=true, 
> topVer=AffinityTopologyVersion [topVer=-1, minorTopVer=0], duration=172ms, 
> onePhaseCommit=false], size=0]]]
> at 
> org.apache.ignite.internal.processors.cache.distributed.IgniteOptimisticTxSuspendResumeTest$CI2Exc.apply(IgniteOptimisticTxSuspendResumeTest.java:759)
> at 
> org.apache.ignite.internal.processors.cache.distributed.IgniteOptimisticTxSuspendResumeTest.executeTestForAllCaches(IgniteOptimisticTxSuspendResumeTest.java:728)
> at 
> org.apache.ignite.internal.processors.cache.distributed.IgniteOptimisticTxSuspendResumeTest.testTxTimeoutOnResumed(IgniteOptimisticTxSuspendResumeTest.java:431)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
> at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> at java.lang.reflect.Method.invoke(Method.java:498)
> at junit.framework.TestCase.runTest(TestCase.java:176)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.runTestInternal(GridAbstractTest.java:2018)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.access$000(GridAbstractTest.java:136)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest$5.run(GridAbstractTest.java:1933)
> at java.lang.Thread.run(Thread.java:745)
> {noformat}
> Test history
> https://ci.ignite.apache.org/project.html?projectId=IgniteTests24Java8=-7194341254453895210=testDetails



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (IGNITE-172) [Test] [Rare] GridTcpCommunicationSpiRecoveryAckSelfTest and IgniteTcpCommunicationRecoveryAckClosureSelfTest

2018-05-11 Thread Vitaliy Biryukov (JIRA)

[ 
https://issues.apache.org/jira/browse/IGNITE-172?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16471768#comment-16471768
 ] 

Vitaliy Biryukov commented on IGNITE-172:
-

[~NSAmelchev] , LGTM.

> [Test] [Rare] GridTcpCommunicationSpiRecoveryAckSelfTest and 
> IgniteTcpCommunicationRecoveryAckClosureSelfTest
> -
>
> Key: IGNITE-172
> URL: https://issues.apache.org/jira/browse/IGNITE-172
> Project: Ignite
>  Issue Type: Bug
>  Components: general
>Affects Versions: 1.5.0.final
>Reporter: Irina Vasilinets
>Assignee: Amelchev Nikita
>Priority: Major
>  Labels: MakeTeamcityGreenAgain, Muted_test
> Fix For: 2.6
>
>
> GridTcpCommunicationSpiRecoveryAckSelfTest.testQueueOverflow and 
> GridTcpCommunicationSpiTcpNoDelayOffSelfTest.testSendToManyNodes 
>  fail sometimes.
> IgniteTcpCommunicationRecoveryAckClosureSelfTest.testQueueOverflow - 1 from 10



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (IGNITE-7986) GridPartitionStateMap.entrySet() optimization.

2018-05-04 Thread Vitaliy Biryukov (JIRA)

[ 
https://issues.apache.org/jira/browse/IGNITE-7986?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16463793#comment-16463793
 ] 

Vitaliy Biryukov commented on IGNITE-7986:
--

[~dpavlov] , Done.

I've used *TreeMap* to test compatibility with other *Map* implementations, 
because of order.

> GridPartitionStateMap.entrySet() optimization.
> --
>
> Key: IGNITE-7986
> URL: https://issues.apache.org/jira/browse/IGNITE-7986
> Project: Ignite
>  Issue Type: Improvement
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
> Fix For: 2.6
>
> Attachments: GridPartitionStateMapBench.java, fullResult.txt
>
>
> GridPartitionStateMap based on BitSet. And the size of a BitSet depends on 
> the maximum key element, and not on the number of elements. 
> Just using the "BitSet.nextSetBit" method, will improve the performance of 
> the iterator for big clusters or caches with a large number of partitions.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (IGNITE-7986) GridPartitionStateMap.entrySet() optimization.

2018-05-03 Thread Vitaliy Biryukov (JIRA)

[ 
https://issues.apache.org/jira/browse/IGNITE-7986?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16462951#comment-16462951
 ] 

Vitaliy Biryukov commented on IGNITE-7986:
--

[~dpavlov] , added javadocs and case with hasNext() method to the test.

> GridPartitionStateMap.entrySet() optimization.
> --
>
> Key: IGNITE-7986
> URL: https://issues.apache.org/jira/browse/IGNITE-7986
> Project: Ignite
>  Issue Type: Improvement
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
> Fix For: 2.6
>
> Attachments: GridPartitionStateMapBench.java, fullResult.txt
>
>
> GridPartitionStateMap based on BitSet. And the size of a BitSet depends on 
> the maximum key element, and not on the number of elements. 
> Just using the "BitSet.nextSetBit" method, will improve the performance of 
> the iterator for big clusters or caches with a large number of partitions.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (IGNITE-7986) GridPartitionStateMap.entrySet() optimization.

2018-04-26 Thread Vitaliy Biryukov (JIRA)

[ 
https://issues.apache.org/jira/browse/IGNITE-7986?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16454321#comment-16454321
 ] 

Vitaliy Biryukov commented on IGNITE-7986:
--

[~dpavlov] Done.

> GridPartitionStateMap.entrySet() optimization.
> --
>
> Key: IGNITE-7986
> URL: https://issues.apache.org/jira/browse/IGNITE-7986
> Project: Ignite
>  Issue Type: Improvement
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
> Fix For: 2.6
>
> Attachments: GridPartitionStateMapBench.java, fullResult.txt
>
>
> GridPartitionStateMap based on BitSet. And the size of a BitSet depends on 
> the maximum key element, and not on the number of elements. 
> Just using the "BitSet.nextSetBit" method, will improve the performance of 
> the iterator for big clusters or caches with a large number of partitions.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (IGNITE-8088) Flaky assertion in testJoinClientStaticCacheConfigurationOnJoin for cache presence

2018-04-17 Thread Vitaliy Biryukov (JIRA)

[ 
https://issues.apache.org/jira/browse/IGNITE-8088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16440808#comment-16440808
 ] 

Vitaliy Biryukov commented on IGNITE-8088:
--

[~xtern] , LGTM.

> Flaky assertion in testJoinClientStaticCacheConfigurationOnJoin for cache 
> presence
> --
>
> Key: IGNITE-8088
> URL: https://issues.apache.org/jira/browse/IGNITE-8088
> Project: Ignite
>  Issue Type: Test
>Reporter: Dmitriy Pavlov
>Assignee: Pavel Pereslegin
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
>
>  
> IgniteStandByClusterSuite: 
> JoinInActiveNodeToActiveCluster.testJoinClientStaticCacheConfigurationOnJoin 
> (master fail rate 12,8%) 
> IgniteStandByClusterSuite: 
> JoinActiveNodeToActiveCluster.testJoinClientStaticCacheConfigurationOnJoin 
> (master fail rate 10,0%) 
> Link to test histories:
> https://ci.ignite.apache.org/project.html?projectId=IgniteTests24Java8=-1780719797264285338=%3Cdefault%3E=testDetails
> https://ci.ignite.apache.org/project.html?projectId=IgniteTests24Java8=5703653634172546268=%3Cdefault%3E=testDetails
> {noformat}
> java.lang.AssertionError
> at org.junit.Assert.fail(Assert.java:86)
> at org.junit.Assert.assertTrue(Assert.java:41)
> at org.junit.Assert.assertNotNull(Assert.java:621)
> at org.junit.Assert.assertNotNull(Assert.java:631)
> at 
> org.apache.ignite.internal.processors.cache.persistence.standbycluster.AbstractNodeJoinTemplate$JoinNodeTestPlanBuilder$9.apply(AbstractNodeJoinTemplate.java:801)
> at 
> org.apache.ignite.internal.processors.cache.persistence.standbycluster.AbstractNodeJoinTemplate$JoinNodeTestPlanBuilder$9.apply(AbstractNodeJoinTemplate.java:791)
> at 
> org.apache.ignite.internal.processors.cache.persistence.standbycluster.AbstractNodeJoinTemplate$JoinNodeTestPlanBuilder$10.run(AbstractNodeJoinTemplate.java:824)
> at 
> org.apache.ignite.internal.processors.cache.persistence.standbycluster.AbstractNodeJoinTemplate$JoinNodeTestPlanBuilder.execute(AbstractNodeJoinTemplate.java:611)
> at 
> org.apache.ignite.internal.processors.cache.persistence.standbycluster.join.JoinInActiveNodeToActiveCluster.testJoinClientStaticCacheConfigurationOnJoin(JoinInActiveNodeToActiveCluster.java:228)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
> at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> at java.lang.reflect.Method.invoke(Method.java:498)
> at junit.framework.TestCase.runTest(TestCase.java:176)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.runTestInternal(GridAbstractTest.java:2002)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest.access$000(GridAbstractTest.java:133)
> at 
> org.apache.ignite.testframework.junits.GridAbstractTest$5.run(GridAbstractTest.java:1917)
> at java.lang.Thread.run(Thread.java:745)
> {noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (IGNITE-7844) Transaction incorrect state after client reconnected

2018-04-17 Thread Vitaliy Biryukov (JIRA)

[ 
https://issues.apache.org/jira/browse/IGNITE-7844?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16440693#comment-16440693
 ] 

Vitaliy Biryukov commented on IGNITE-7844:
--

[~Alexey Kuznetsov] , LGTM.

> Transaction incorrect state after client reconnected
> 
>
> Key: IGNITE-7844
> URL: https://issues.apache.org/jira/browse/IGNITE-7844
> Project: Ignite
>  Issue Type: Bug
>Affects Versions: 2.3
>Reporter: Alexey Kuznetsov
>Assignee: Alexey Kuznetsov
>Priority: Major
> Fix For: 2.6
>
>
> Transaction is started on client node.
>  Client reconnects, transaction rollbacks, but its state is left ACTIVE, 
> which is incorrect.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (IGNITE-8257) GridFutureAdapterSelfTest#testChaining flaky-fails on TC (rarely)

2018-04-13 Thread Vitaliy Biryukov (JIRA)
Vitaliy Biryukov created IGNITE-8257:


 Summary: GridFutureAdapterSelfTest#testChaining flaky-fails on TC 
(rarely)
 Key: IGNITE-8257
 URL: https://issues.apache.org/jira/browse/IGNITE-8257
 Project: Ignite
  Issue Type: Test
Reporter: Vitaliy Biryukov
Assignee: Vitaliy Biryukov
 Fix For: 2.6



{code:java}
class org.apache.ignite.internal.IgniteFutureTimeoutCheckedException: Timeout 
was reached before computation completed.
at 
org.apache.ignite.internal.util.future.GridFutureAdapter.get0(GridFutureAdapter.java:242)
at 
org.apache.ignite.internal.util.future.GridFutureAdapter.get(GridFutureAdapter.java:159)
at 
org.apache.ignite.internal.util.future.GridFutureAdapter.get(GridFutureAdapter.java:151)
at 
org.apache.ignite.internal.util.future.GridFutureAdapterSelfTest.checkChaining(GridFutureAdapterSelfTest.java:283)
at 
org.apache.ignite.internal.util.future.GridFutureAdapterSelfTest.testChaining(GridFutureAdapterSelfTest.java:237)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:498)
at junit.framework.TestCase.runTest(TestCase.java:176)
at 
org.apache.ignite.testframework.junits.GridAbstractTest.runTestInternal(GridAbstractTest.java:2080)
at 
org.apache.ignite.testframework.junits.GridAbstractTest.access$000(GridAbstractTest.java:140)
at 
org.apache.ignite.testframework.junits.GridAbstractTest$5.run(GridAbstractTest.java:1995)
at java.lang.Thread.run(Thread.java:745)
{code}




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (IGNITE-6224) Node stoping does not wait all transactions completion

2018-04-13 Thread Vitaliy Biryukov (JIRA)

 [ 
https://issues.apache.org/jira/browse/IGNITE-6224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov updated IGNITE-6224:
-
Fix Version/s: 2.6

> Node stoping does not wait all transactions completion
> --
>
> Key: IGNITE-6224
> URL: https://issues.apache.org/jira/browse/IGNITE-6224
> Project: Ignite
>  Issue Type: Bug
>Affects Versions: 2.1
>Reporter: Vladislav Pyatkov
>Assignee: Vitaliy Biryukov
>Priority: Major
> Fix For: 2.6
>
> Attachments: TransactionBehindStopNodeTest.java
>
>
> I have started grid node and executing transaction over some cache. After I 
> stopped the node in the middle execution of transaction. I got transaction 
> execution exception:
> {noformat}
> java.lang.IllegalStateException: class 
> org.apache.ignite.internal.processors.cache.CacheStoppedException: Failed to 
> perform cache operation (cache is stopped): cache
>   at 
> org.apache.ignite.internal.processors.cache.GridCacheGateway.enter(GridCacheGateway.java:164)
>   at 
> org.apache.ignite.internal.processors.cache.GatewayProtectedCacheProxy.onEnter(GatewayProtectedCacheProxy.java:1656)
>   at 
> org.apache.ignite.internal.processors.cache.GatewayProtectedCacheProxy.put(GatewayProtectedCacheProxy.java:869)
>   at 
> org.apache.ignite.TransactionBehindStopNodeTest.testOneNode(TransactionBehindStopNodeTest.java:56)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.lang.reflect.Method.invoke(Method.java:498)
>   at junit.framework.TestCase.runTest(TestCase.java:176)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest.runTestInternal(GridAbstractTest.java:2000)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest.access$000(GridAbstractTest.java:132)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest$5.run(GridAbstractTest.java:1915)
>   at java.lang.Thread.run(Thread.java:745)
> {noformat}
> although I stopped node with _false_ {{cancel}} flag.
> {code}
> G.stop(getTestIgniteInstanceName(0), false);
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (IGNITE-6793) NPE in InitNewCoordinatorFuture leading to Cache3 suite hang

2018-04-11 Thread Vitaliy Biryukov (JIRA)

 [ 
https://issues.apache.org/jira/browse/IGNITE-6793?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov updated IGNITE-6793:
-
Fix Version/s: (was: 2.5)
   2.6

> NPE in InitNewCoordinatorFuture leading to Cache3 suite hang
> 
>
> Key: IGNITE-6793
> URL: https://issues.apache.org/jira/browse/IGNITE-6793
> Project: Ignite
>  Issue Type: Bug
>Affects Versions: 2.3
>Reporter: Alexey Goncharuk
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
> Fix For: 2.6
>
>
> Got the following exception in the IgniteCacheGroupsTest run:
> {code}
> [17:59:25]W:   [org.apache.ignite:ignite-core] [2017-10-30 
> 14:59:25,159][ERROR][sys-#35070%cache.IgniteCacheGroupsTest2%][GridCacheIoManager]
>  Failed processing message [senderId=9a523ce6-a252-457f-8175-7246b6c4, 
> msg=GridDhtPartitionsSingleMessage [parts={3181548=GridDhtPartitionMap 
> [moving=0, top=AffinityTopologyVersion [topVer=42, minorTopVer=2], 
> updateSeq=1098, size=191], -2100569601=GridDhtPartitionMap [moving=0, 
> top=AffinityTopologyVersion [topVer=42, minorTopVer=2], updateSeq=654, 
> size=100]}, 
> partCntrs={3181548=o.a.i.i.processors.cache.distributed.dht.preloader.CachePartitionPartialCountersMap@78b1774,
>  
> -2100569601=o.a.i.i.processors.cache.distributed.dht.preloader.CachePartitionPartialCountersMap@3cb0c48a},
>  partHistCntrs=null, err=null, client=false, compress=false, 
> finishMsg=GridDhtPartitionsFullMessage [parts=null, partCntrs=null, 
> partCntrs2=null, partHistSuppliers=null, partsToReload=null, 
> topVer=AffinityTopologyVersion [topVer=42, minorTopVer=2], errs=null, 
> compress=false, resTopVer=null, partCnt=0, 
> super=GridDhtPartitionsAbstractMessage [exchId=GridDhtPartitionExchangeId 
> [topVer=AffinityTopologyVersion [topVer=42, minorTopVer=2], discoEvt=null, 
> nodeId=33c058c3, evt=DISCOVERY_CUSTOM_EVT], lastVer=GridCacheVersion 
> [topVer=120855515, order=1509375572161, nodeOrder=32], super=GridCacheMessage 
> [msgId=4153599, depInfo=null, err=null, skipPrepare=false]]], 
> super=GridDhtPartitionsAbstractMessage [exchId=GridDhtPartitionExchangeId 
> [topVer=AffinityTopologyVersion [topVer=42, minorTopVer=2], discoEvt=null, 
> nodeId=33c058c3, evt=DISCOVERY_CUSTOM_EVT], lastVer=GridCacheVersion 
> [topVer=120855515, order=1509375572153, nodeOrder=38], super=GridCacheMessage 
> [msgId=4153605, depInfo=null, err=null, skipPrepare=false
> [17:59:25]W:   [org.apache.ignite:ignite-core] 
> java.lang.NullPointerException
> [17:59:25]W:   [org.apache.ignite:ignite-core]at 
> org.apache.ignite.internal.processors.cache.distributed.dht.preloader.InitNewCoordinatorFuture.onMessage(InitNewCoordinatorFuture.java:238)
> [17:59:25]W:   [org.apache.ignite:ignite-core]at 
> org.apache.ignite.internal.processors.cache.distributed.dht.preloader.GridDhtPartitionsExchangeFuture.onReceiveSingleMessage(GridDhtPartitionsExchangeFuture.java:1749)
> [17:59:25]W:   [org.apache.ignite:ignite-core]at 
> org.apache.ignite.internal.processors.cache.GridCachePartitionExchangeManager.processSinglePartitionUpdate(GridCachePartitionExchangeManager.java:1484)
> [17:59:25]W:   [org.apache.ignite:ignite-core]at 
> org.apache.ignite.internal.processors.cache.GridCachePartitionExchangeManager.access$1000(GridCachePartitionExchangeManager.java:131)
> [17:59:25]W:   [org.apache.ignite:ignite-core]at 
> org.apache.ignite.internal.processors.cache.GridCachePartitionExchangeManager$2.onMessage(GridCachePartitionExchangeManager.java:327)
> [17:59:25]W:   [org.apache.ignite:ignite-core]at 
> org.apache.ignite.internal.processors.cache.GridCachePartitionExchangeManager$2.onMessage(GridCachePartitionExchangeManager.java:307)
> [17:59:25]W:   [org.apache.ignite:ignite-core]at 
> org.apache.ignite.internal.processors.cache.GridCachePartitionExchangeManager$MessageHandler.apply(GridCachePartitionExchangeManager.java:2627)
> [17:59:25]W:   [org.apache.ignite:ignite-core]at 
> org.apache.ignite.internal.processors.cache.GridCachePartitionExchangeManager$MessageHandler.apply(GridCachePartitionExchangeManager.java:2606)
> [17:59:25]W:   [org.apache.ignite:ignite-core]at 
> org.apache.ignite.internal.processors.cache.GridCacheIoManager.processMessage(GridCacheIoManager.java:1060)
> [17:59:25]W:   [org.apache.ignite:ignite-core]at 
> org.apache.ignite.internal.processors.cache.GridCacheIoManager.onMessage0(GridCacheIoManager.java:579)
> [17:59:25]W:   [org.apache.ignite:ignite-core]at 
> org.apache.ignite.internal.processors.cache.GridCacheIoManager.handleMessage(GridCacheIoManager.java:378)

[jira] [Updated] (IGNITE-6445) IgniteTxManager.txLocksInfo method misses locks

2018-04-11 Thread Vitaliy Biryukov (JIRA)

 [ 
https://issues.apache.org/jira/browse/IGNITE-6445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov updated IGNITE-6445:
-
Fix Version/s: (was: 2.5)
   2.6

> IgniteTxManager.txLocksInfo method misses locks
> ---
>
> Key: IGNITE-6445
> URL: https://issues.apache.org/jira/browse/IGNITE-6445
> Project: Ignite
>  Issue Type: Bug
>Affects Versions: 2.2
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
> Fix For: 2.6
>
>
> In some cases "IgniteTxManager.txLocksInfo" method (searches for locks) 
> misses locks.
> For example:
> # In case of a configuration with near cache, entries are created for the 
> near cache and for the ordinal cache. For each entry, their own MVCC 
> candidates are created.
> # For non-custom objects of type (Integer, etc.), the entry stored in 
> "GridNearTxLocal" is not associated with MVCC candidates with which the same 
> entity is associated in another format stored in "GridDhtTxLocal"



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (IGNITE-7986) GridPartitionStateMap.entrySet() optimization.

2018-04-11 Thread Vitaliy Biryukov (JIRA)

 [ 
https://issues.apache.org/jira/browse/IGNITE-7986?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov updated IGNITE-7986:
-
Fix Version/s: (was: 2.5)
   2.6

> GridPartitionStateMap.entrySet() optimization.
> --
>
> Key: IGNITE-7986
> URL: https://issues.apache.org/jira/browse/IGNITE-7986
> Project: Ignite
>  Issue Type: Improvement
>Reporter: Vitaliy Biryukov
>Assignee: Vitaliy Biryukov
>Priority: Major
> Fix For: 2.6
>
> Attachments: GridPartitionStateMapBench.java, fullResult.txt
>
>
> GridPartitionStateMap based on BitSet. And the size of a BitSet depends on 
> the maximum key element, and not on the number of elements. 
> Just using the "BitSet.nextSetBit" method, will improve the performance of 
> the iterator for big clusters or caches with a large number of partitions.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Resolved] (IGNITE-3464) Possible race between partition exchange and prepare/finish requests

2018-04-09 Thread Vitaliy Biryukov (JIRA)

 [ 
https://issues.apache.org/jira/browse/IGNITE-3464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov resolved IGNITE-3464.
--
Resolution: Duplicate

> Possible race between partition exchange and prepare/finish requests
> 
>
> Key: IGNITE-3464
> URL: https://issues.apache.org/jira/browse/IGNITE-3464
> Project: Ignite
>  Issue Type: Bug
>  Components: cache
>Affects Versions: ignite-1.4
>Reporter: Alexey Goncharuk
>Assignee: Vitaliy Biryukov
>Priority: Major
> Fix For: 2.6
>
>
> Consider the following scenario:
> Two nodes A (coordinator), B. Node C is joining the grid. Current topology 
> version is 2.
>  - Node A starts a transaction on version 2 and sends a prepare request to 
> node B
>  - Discovery event happens on node A. Exchange future is created, captures 
> the transaction and waits for this transaction to finish.
>  - Discovery event happens on node B. Exchange future is created, but since 
> there is no transaction on this node (the request has not been processed 
> yet), partition release future is completed and exchange waits for an ACK 
> from coordinator.
>  - Prepare request is processed on node B
>  - Node A commits the transaction locally, partition release future is 
> completed. Both finish request and exchange message are sent to the node B.
>  - Node B processes the exchange message first and completes exchange.
>  - Node C starts rebalancing from node B and acquires stale value of the key 
> which was supposed to be updated in the transaction.
>  - Node B processes finish request and commits the transaction.
> As a result, node B and C have different values stored in the cache.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Assigned] (IGNITE-6793) NPE in InitNewCoordinatorFuture leading to Cache3 suite hang

2018-04-05 Thread Vitaliy Biryukov (JIRA)

 [ 
https://issues.apache.org/jira/browse/IGNITE-6793?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Biryukov reassigned IGNITE-6793:


Assignee: Vitaliy Biryukov

> NPE in InitNewCoordinatorFuture leading to Cache3 suite hang
> 
>
> Key: IGNITE-6793
> URL: https://issues.apache.org/jira/browse/IGNITE-6793
> Project: Ignite
>  Issue Type: Bug
>Affects Versions: 2.3
>Reporter: Alexey Goncharuk
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
> Fix For: 2.5
>
>
> Got the following exception in the IgniteCacheGroupsTest run:
> {code}
> [17:59:25]W:   [org.apache.ignite:ignite-core] [2017-10-30 
> 14:59:25,159][ERROR][sys-#35070%cache.IgniteCacheGroupsTest2%][GridCacheIoManager]
>  Failed processing message [senderId=9a523ce6-a252-457f-8175-7246b6c4, 
> msg=GridDhtPartitionsSingleMessage [parts={3181548=GridDhtPartitionMap 
> [moving=0, top=AffinityTopologyVersion [topVer=42, minorTopVer=2], 
> updateSeq=1098, size=191], -2100569601=GridDhtPartitionMap [moving=0, 
> top=AffinityTopologyVersion [topVer=42, minorTopVer=2], updateSeq=654, 
> size=100]}, 
> partCntrs={3181548=o.a.i.i.processors.cache.distributed.dht.preloader.CachePartitionPartialCountersMap@78b1774,
>  
> -2100569601=o.a.i.i.processors.cache.distributed.dht.preloader.CachePartitionPartialCountersMap@3cb0c48a},
>  partHistCntrs=null, err=null, client=false, compress=false, 
> finishMsg=GridDhtPartitionsFullMessage [parts=null, partCntrs=null, 
> partCntrs2=null, partHistSuppliers=null, partsToReload=null, 
> topVer=AffinityTopologyVersion [topVer=42, minorTopVer=2], errs=null, 
> compress=false, resTopVer=null, partCnt=0, 
> super=GridDhtPartitionsAbstractMessage [exchId=GridDhtPartitionExchangeId 
> [topVer=AffinityTopologyVersion [topVer=42, minorTopVer=2], discoEvt=null, 
> nodeId=33c058c3, evt=DISCOVERY_CUSTOM_EVT], lastVer=GridCacheVersion 
> [topVer=120855515, order=1509375572161, nodeOrder=32], super=GridCacheMessage 
> [msgId=4153599, depInfo=null, err=null, skipPrepare=false]]], 
> super=GridDhtPartitionsAbstractMessage [exchId=GridDhtPartitionExchangeId 
> [topVer=AffinityTopologyVersion [topVer=42, minorTopVer=2], discoEvt=null, 
> nodeId=33c058c3, evt=DISCOVERY_CUSTOM_EVT], lastVer=GridCacheVersion 
> [topVer=120855515, order=1509375572153, nodeOrder=38], super=GridCacheMessage 
> [msgId=4153605, depInfo=null, err=null, skipPrepare=false
> [17:59:25]W:   [org.apache.ignite:ignite-core] 
> java.lang.NullPointerException
> [17:59:25]W:   [org.apache.ignite:ignite-core]at 
> org.apache.ignite.internal.processors.cache.distributed.dht.preloader.InitNewCoordinatorFuture.onMessage(InitNewCoordinatorFuture.java:238)
> [17:59:25]W:   [org.apache.ignite:ignite-core]at 
> org.apache.ignite.internal.processors.cache.distributed.dht.preloader.GridDhtPartitionsExchangeFuture.onReceiveSingleMessage(GridDhtPartitionsExchangeFuture.java:1749)
> [17:59:25]W:   [org.apache.ignite:ignite-core]at 
> org.apache.ignite.internal.processors.cache.GridCachePartitionExchangeManager.processSinglePartitionUpdate(GridCachePartitionExchangeManager.java:1484)
> [17:59:25]W:   [org.apache.ignite:ignite-core]at 
> org.apache.ignite.internal.processors.cache.GridCachePartitionExchangeManager.access$1000(GridCachePartitionExchangeManager.java:131)
> [17:59:25]W:   [org.apache.ignite:ignite-core]at 
> org.apache.ignite.internal.processors.cache.GridCachePartitionExchangeManager$2.onMessage(GridCachePartitionExchangeManager.java:327)
> [17:59:25]W:   [org.apache.ignite:ignite-core]at 
> org.apache.ignite.internal.processors.cache.GridCachePartitionExchangeManager$2.onMessage(GridCachePartitionExchangeManager.java:307)
> [17:59:25]W:   [org.apache.ignite:ignite-core]at 
> org.apache.ignite.internal.processors.cache.GridCachePartitionExchangeManager$MessageHandler.apply(GridCachePartitionExchangeManager.java:2627)
> [17:59:25]W:   [org.apache.ignite:ignite-core]at 
> org.apache.ignite.internal.processors.cache.GridCachePartitionExchangeManager$MessageHandler.apply(GridCachePartitionExchangeManager.java:2606)
> [17:59:25]W:   [org.apache.ignite:ignite-core]at 
> org.apache.ignite.internal.processors.cache.GridCacheIoManager.processMessage(GridCacheIoManager.java:1060)
> [17:59:25]W:   [org.apache.ignite:ignite-core]at 
> org.apache.ignite.internal.processors.cache.GridCacheIoManager.onMessage0(GridCacheIoManager.java:579)
> [17:59:25]W:   [org.apache.ignite:ignite-core]at 
> org.apache.ignite.internal.processors.cache.GridCacheIoManager.handleMessage(GridCacheIoManager.java:378)
> [17:59:25]W: 

[jira] [Comment Edited] (IGNITE-5978) [Test Failed] IgnitePartitionedCountDownLatchSelfTest.testLatchMultinode1

2018-04-04 Thread Vitaliy Biryukov (JIRA)

[ 
https://issues.apache.org/jira/browse/IGNITE-5978?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16425779#comment-16425779
 ] 

Vitaliy Biryukov edited comment on IGNITE-5978 at 4/4/18 4:33 PM:
--

[~NIzhikov] please take a look.

Latch creates with "autoDel=true", and there was a race between last 

"latch.countDown()" and resiving latch in threads.


was (Author: vitaliyb):
[~NIzhikov] please take a look.

> [Test Failed] IgnitePartitionedCountDownLatchSelfTest.testLatchMultinode1
> -
>
> Key: IGNITE-5978
> URL: https://issues.apache.org/jira/browse/IGNITE-5978
> Project: Ignite
>  Issue Type: Bug
>Affects Versions: 2.1
>Reporter: Eduard Shangareev
>Assignee: Vitaliy Biryukov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
> Fix For: 2.5
>
>
> Fails locally.
> Example of failing - 
> http://ci.ignite.apache.org/viewLog.html?buildId=759891=buildResultsDiv=Ignite20Tests_IgniteDataStrucutures#testNameId677264269171099154.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


  1   2   3   >