[jira] [Updated] (IGNITE-9235) Transitivity violation in GridMergeIndex Comparator

2018-08-10 Thread Evgenii Zagumennov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-9235?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Evgenii Zagumennov updated IGNITE-9235:
---
Fix Version/s: 2.5

> Transitivity violation in GridMergeIndex Comparator
> ---
>
> Key: IGNITE-9235
> URL: https://issues.apache.org/jira/browse/IGNITE-9235
> Project: Ignite
>  Issue Type: Bug
>Affects Versions: 2.5
>Reporter: Andrew Medvedev
>Assignee: Andrew Medvedev
>Priority: Major
> Fix For: 2.5
>
>
> Currently comparator in 
> org.apache.ignite.internal.processors.query.h2.twostep.GridMergeIndex is:
>  
> Private final Comparator streamCmp = new Comparator() {
>  @Override public int compare(RowStream o1, RowStream o2) {
>  // Nulls at the beginning.
>  if (o1 == null)
>  return -1;
>  if (o2 == null)
>  return 1;
>  return compareRows(o1.get(), o2.get());
>  }
> };
> --
>  
> This comparator violates transitivity when o1 and o2 are null. Thus we get 
> exception in JDK1.8:
>  
>  
> {color:#d04437}Caused by: java.lang.IllegalArgumentException: Comparison 
> method violates its general contract!{color}
> {color:#d04437}  at java.util.TimSort.mergeHi(TimSort.java:899){color}
> {color:#d04437}  at java.util.TimSort.mergeAt(TimSort.java:516){color}
> {color:#d04437}  at java.util.TimSort.mergeCollapse(TimSort.java:441){color}
> {color:#d04437}  at java.util.TimSort.sort(TimSort.java:245){color}
> {color:#d04437}  at java.util.Arrays.sort(Arrays.java:1438){color}
> {color:#d04437}  at 
> org.apache.ignite.internal.processors.query.h2.twostep.GridMergeIndexSorted$MergeStreamIterator.goFirst(GridMergeIndexSorted.java:248){color}
> {color:#d04437}  at 
> org.apache.ignite.internal.processors.query.h2.twostep.GridMergeIndexSorted$MergeStreamIterator.hasNext(GridMergeIndexSorted.java:270){color}
> {color:#d04437}  at 
> org.apache.ignite.internal.processors.query.h2.twostep.GridMergeIndex$FetchingCursor.fetchRows(GridMergeIndex.java:614){color}
> {color:#d04437}  at 
> org.apache.ignite.internal.processors.query.h2.twostep.GridMergeIndex$FetchingCursor.next(GridMergeIndex.java:658){color}
> {color:#d04437}  at org.h2.index.IndexCursor.next(IndexCursor.java:305){color}
> {color:#d04437}  at org.h2.table.TableFilter.next(TableFilter.java:499){color}
> {color:#d04437}  at 
> org.h2.command.dml.Select$LazyResultQueryFlat.fetchNextRow(Select.java:1452){color}
> {color:#d04437}  at 
> org.h2.result.LazyResult.hasNext(LazyResult.java:79){color}
> {color:#d04437}  at org.h2.result.LazyResult.next(LazyResult.java:59){color}
> {color:#d04437}  at 
> org.h2.command.dml.Select.queryFlat(Select.java:519){color}
> {color:#d04437}  at 
> org.h2.command.dml.Select.queryWithoutCache(Select.java:625){color}
> {color:#d04437}  at 
> org.h2.command.dml.Query.queryWithoutCacheLazyCheck(Query.java:114){color}
> {color:#d04437}  at org.h2.command.dml.Query.query(Query.java:352){color}
> {color:#d04437}  at org.h2.command.dml.Query.query(Query.java:333){color}
> {color:#d04437}  at 
> org.h2.command.CommandContainer.query(CommandContainer.java:113){color}
> {color:#d04437}  at 
> org.h2.command.Command.executeQuery(Command.java:201){color}
> {color:#d04437} ... 44 more{color}
>   
> WA: use -Djava.util.Arrays.useLegacyMergeSort=true
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (IGNITE-9144) A client node leaving a grid may trigger the wrong message about coordinator change in the logs

2018-08-01 Thread Evgenii Zagumennov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-9144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16565115#comment-16565115
 ] 

Evgenii Zagumennov commented on IGNITE-9144:


https://ci.ignite.apache.org/viewLog.html?buildId=1572242=buildResultsDiv=IgniteTests24Java8_RunAll

> A client node leaving a grid may trigger the wrong message about coordinator 
> change in the logs
> ---
>
> Key: IGNITE-9144
> URL: https://issues.apache.org/jira/browse/IGNITE-9144
> Project: Ignite
>  Issue Type: Bug
>Reporter: Ivan Artukhov
>Assignee: Evgenii Zagumennov
>Priority: Major
>
> The issue was introduced by https://issues.apache.org/jira/browse/IGNITE-8738.
> Suppose we have a grid with X server nodes and Y client nodes. Server nodes 
> are restarted periodically while client nodes are left untouched. In this 
> case *order* of current coordinator might be greater than *order* of any 
> client node. Then when some client node leaves the grid, we will erroneously 
> print the *Coordinator changed* message with *client* node being the previous 
> coordinator. E.g.:
> {noformat}
> [2018-07-19 14:55:28,897][INFO ][disco-event-worker-#61] Node left topology: 
> TcpDiscoveryNode [id=7240957f-a51b-452d-bfc8-420e8ef9ea68, addrs=[127.0.0.1, 
> 172.17.0.1, 172.25.1.15], sockAddrs=[/172.17.0.1:0, /127.0.0.1:0, 
> lab15.gridgain.local/172.25.1.15:0], discPort=0, order=16, intOrder=11, 
> lastExchangeTime=1532001260398, loc=false, ver=2.5.1#20180717-sha1:80e51c80, 
> isClient=true]
> [2018-07-19 14:55:28,899][INFO ][disco-event-worker-#61] Topology snapshot 
> [ver=27, servers=3, clients=4, CPUs=96, offheap=260.0GB, heap=56.0GB]
> [2018-07-19 14:55:28,899][INFO ][disco-event-worker-#61] Coordinator changed 
> [prev=TcpDiscoveryNode [id=7240957f-a51b-452d-bfc8-420e8ef9ea68, 
> addrs=[127.0.0.1, 172.17.0.1, 172.25.1.15], sockAddrs=[/172.17.0.1:0, 
> /127.0.0.1:0, lab15.gridgain.local/172.25.1.15:0], discPort=0, order=16, 
> intOrder=11, lastExchangeTime=1532001260398, loc=false, 
> ver=2.5.1#20180717-sha1:80e51c80, isClient=true], cur=TcpDiscoveryNode 
> [id=760fd8f2-b9d7-4953-aa86-3954c05c9feb, addrs=[127.0.0.1, 172.17.0.1, 
> 172.25.1.21], sockAddrs=[/172.17.0.1:47500, 
> lab21.gridgain.local/172.25.1.21:47500, /127.0.0.1:47500], discPort=47500, 
> order=21, intOrder=15, lastExchangeTime=1532001260428, loc=false, 
> ver=2.5.1#20180717-sha1:80e51c80, isClient=false]]
> [2018-07-19 14:55:28,899][INFO ][disco-event-worker-#61]   ^-- Node 
> [id=22B15E97-9944-48B5-A473-5C64E75A4D5A, clusterState=ACTIVE]
> [2018-07-19 14:55:28,899][INFO ][disco-event-worker-#61]   ^-- Baseline 
> [id=6, size=3, online=3, offline=0]
> [2018-07-19 14:55:28,899][INFO ][disco-event-worker-#61] Data Regions 
> Configured:
> [2018-07-19 14:55:28,900][INFO ][disco-event-worker-#61]   ^-- default 
> [initSize=256.0 MiB, maxSize=60.0 GiB, persistenceEnabled=true]
> {noformat}
> The *Coordinator changed* message should not be here because in fact the 
> coordinator was not changed.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Assigned] (IGNITE-9144) A client node leaving a grid may trigger the wrong message about coordinator change in the logs

2018-07-31 Thread Evgenii Zagumennov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-9144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Evgenii Zagumennov reassigned IGNITE-9144:
--

Assignee: Evgenii Zagumennov

> A client node leaving a grid may trigger the wrong message about coordinator 
> change in the logs
> ---
>
> Key: IGNITE-9144
> URL: https://issues.apache.org/jira/browse/IGNITE-9144
> Project: Ignite
>  Issue Type: Bug
>Reporter: Ivan Artukhov
>Assignee: Evgenii Zagumennov
>Priority: Major
>
> The issue was introduced by https://issues.apache.org/jira/browse/IGNITE-8738.
> Suppose we have a grid with X server nodes and Y client nodes. Server nodes 
> are restarted periodically while client nodes are left untouched. In this 
> case *order* of current coordinator might be greater than *order* of any 
> client node. Then when some client node leaves the grid, we will erroneously 
> print the *Coordinator changed* message with *client* node being the previous 
> coordinator. E.g.:
> {noformat}
> [2018-07-19 14:55:28,897][INFO ][disco-event-worker-#61] Node left topology: 
> TcpDiscoveryNode [id=7240957f-a51b-452d-bfc8-420e8ef9ea68, addrs=[127.0.0.1, 
> 172.17.0.1, 172.25.1.15], sockAddrs=[/172.17.0.1:0, /127.0.0.1:0, 
> lab15.gridgain.local/172.25.1.15:0], discPort=0, order=16, intOrder=11, 
> lastExchangeTime=1532001260398, loc=false, ver=2.5.1#20180717-sha1:80e51c80, 
> isClient=true]
> [2018-07-19 14:55:28,899][INFO ][disco-event-worker-#61] Topology snapshot 
> [ver=27, servers=3, clients=4, CPUs=96, offheap=260.0GB, heap=56.0GB]
> [2018-07-19 14:55:28,899][INFO ][disco-event-worker-#61] Coordinator changed 
> [prev=TcpDiscoveryNode [id=7240957f-a51b-452d-bfc8-420e8ef9ea68, 
> addrs=[127.0.0.1, 172.17.0.1, 172.25.1.15], sockAddrs=[/172.17.0.1:0, 
> /127.0.0.1:0, lab15.gridgain.local/172.25.1.15:0], discPort=0, order=16, 
> intOrder=11, lastExchangeTime=1532001260398, loc=false, 
> ver=2.5.1#20180717-sha1:80e51c80, isClient=true], cur=TcpDiscoveryNode 
> [id=760fd8f2-b9d7-4953-aa86-3954c05c9feb, addrs=[127.0.0.1, 172.17.0.1, 
> 172.25.1.21], sockAddrs=[/172.17.0.1:47500, 
> lab21.gridgain.local/172.25.1.21:47500, /127.0.0.1:47500], discPort=47500, 
> order=21, intOrder=15, lastExchangeTime=1532001260428, loc=false, 
> ver=2.5.1#20180717-sha1:80e51c80, isClient=false]]
> [2018-07-19 14:55:28,899][INFO ][disco-event-worker-#61]   ^-- Node 
> [id=22B15E97-9944-48B5-A473-5C64E75A4D5A, clusterState=ACTIVE]
> [2018-07-19 14:55:28,899][INFO ][disco-event-worker-#61]   ^-- Baseline 
> [id=6, size=3, online=3, offline=0]
> [2018-07-19 14:55:28,899][INFO ][disco-event-worker-#61] Data Regions 
> Configured:
> [2018-07-19 14:55:28,900][INFO ][disco-event-worker-#61]   ^-- default 
> [initSize=256.0 MiB, maxSize=60.0 GiB, persistenceEnabled=true]
> {noformat}
> The *Coordinator changed* message should not be here because in fact the 
> coordinator was not changed.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Resolved] (IGNITE-9117) Need to log GridReduceQueryExecutor#query long execution

2018-07-29 Thread Evgenii Zagumennov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-9117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Evgenii Zagumennov resolved IGNITE-9117.

Resolution: Duplicate

> Need to log GridReduceQueryExecutor#query long execution
> 
>
> Key: IGNITE-9117
> URL: https://issues.apache.org/jira/browse/IGNITE-9117
> Project: Ignite
>  Issue Type: Improvement
>  Components: compute, sql
>Reporter: Evgenii Zagumennov
>Priority: Major
>
> In case of long query execution *GridReduceQueryExecutor.query()* should log 
> the root cause and throw an exception by timeout.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (IGNITE-9117) Need to log GridReduceQueryExecutor#query long execution

2018-07-29 Thread Evgenii Zagumennov (JIRA)
Evgenii Zagumennov created IGNITE-9117:
--

 Summary: Need to log GridReduceQueryExecutor#query long execution
 Key: IGNITE-9117
 URL: https://issues.apache.org/jira/browse/IGNITE-9117
 Project: Ignite
  Issue Type: Improvement
  Components: compute, sql
Reporter: Evgenii Zagumennov


In case of long query execution *GridReduceQueryExecutor.query()* should log 
the root cause and throw an exception by timeout.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Assigned] (IGNITE-8939) Transaction string reprsentation unhandled exception

2018-07-29 Thread Evgenii Zagumennov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-8939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Evgenii Zagumennov reassigned IGNITE-8939:
--

Assignee: Stanilovsky Evgeny  (was: Evgenii Zagumennov)

> Transaction string reprsentation unhandled exception
> 
>
> Key: IGNITE-8939
> URL: https://issues.apache.org/jira/browse/IGNITE-8939
> Project: Ignite
>  Issue Type: Bug
>  Components: binary
>Reporter: Evgenii Zagumennov
>Assignee: Stanilovsky Evgeny
>Priority: Major
>
> IgniteTxHandler.finishDhtLocal() (IgniteTxHandler.java:957)
> {code:java}
> U.error(log, "Failed completing transaction [commit=" + req.commit() + ", 
> tx=" + *tx* + ']', e);{code}
> tx.toString() can lead to excepion (in GridToStringBuilder.toStringImpl()), 
> and original exception in transaction will be lost. We need to log original 
> exception and catch probable tx.toString() exception.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Assigned] (IGNITE-8939) Transaction string reprsentation unhandled exception

2018-07-26 Thread Evgenii Zagumennov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-8939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Evgenii Zagumennov reassigned IGNITE-8939:
--

Assignee: Evgenii Zagumennov

> Transaction string reprsentation unhandled exception
> 
>
> Key: IGNITE-8939
> URL: https://issues.apache.org/jira/browse/IGNITE-8939
> Project: Ignite
>  Issue Type: Bug
>  Components: binary
>Reporter: Evgenii Zagumennov
>Assignee: Evgenii Zagumennov
>Priority: Major
>
> IgniteTxHandler.finishDhtLocal() (IgniteTxHandler.java:957)
> {code:java}
> U.error(log, "Failed completing transaction [commit=" + req.commit() + ", 
> tx=" + *tx* + ']', e);{code}
> tx.toString() can lead to excepion (in GridToStringBuilder.toStringImpl()), 
> and original exception in transaction will be lost. We need to log original 
> exception and catch probable tx.toString() exception.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (IGNITE-8866) Need attempt to upload class until node leave or fail topology by discovery SPI

2018-07-25 Thread Evgenii Zagumennov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-8866?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Evgenii Zagumennov updated IGNITE-8866:
---
Description: 
After one fail attempt to upload a class, client code getting exception:
{noformat}
10:04:46,253 INFO  [stdout] (Thread-732) java.lang.NoClassDefFoundError: 
com/company/deposit_pf_api/core/utils/DplUtils
10:04:46,253 INFO  [stdout] (Thread-732)   at 
com.company.deposit_pf_api.comparators.CommonPredicate.nodeIdIgnite(CommonPredicate.java:225)
10:04:46,253 INFO  [stdout] (Thread-732)   at 
com.company.deposit_pf_api.comparators.CommonPredicate.cacheEntities(CommonPredicate.java:191)
10:04:46,253 INFO  [stdout] (Thread-732)   at 
com.company.deposit_pf_api.comparators.CommonPredicate.(CommonPredicate.java:116)
{noformat}
And log contains some related warnings:
{noformat}
018-06-19 10:04:18.459 [WARN 
][pub-#3308%DPL_GRID%DplGridNodeName%][o.a.i.i.m.d.GridDeploymentCommunication] 
Failed to receive peer response from node within duration 
[node=5861d763-a552-463e-817a-0742f7aad114, duration=5008]
2018-06-19 10:04:18.459 [WARN 
][pub-#3308%DPL_GRID%DplGridNodeName%][o.a.i.i.m.d.GridDeploymentPerVersionStore]
 Failed to send class-loading request to node (is node alive?) 
[node=5861d763-a552-463e-817a-0742f7aad114, 
clsName=com.company.deposit_pf_api.core.utils.DplUtils, 
clsPath=com/company/deposit_pf_api/core/utils/DplUtils.class, 
clsLdrId=370f1361461-5861d763-a552-463e-817a-0742f7aad114, 
parentClsLdr=com.company.dpl.gridgain.ignite.NodeClassLoader@1ce4a752]
{noformat}
I think should to upload class through p2p until node present in topology.

Look at the [^P2PClassDeploymentDelay.java] reproducer.

  was:
After one fail attempt to upload a class, client code getting exception:

{noformat}
10:04:46,253 INFO  [stdout] (Thread-732) java.lang.NoClassDefFoundError: 
ru/sbt/deposit_pf_api/core/utils/DplUtils
10:04:46,253 INFO  [stdout] (Thread-732)   at 
ru.sbt.deposit_pf_api.comparators.CommonPredicate.nodeIdIgnite(CommonPredicate.java:225)
10:04:46,253 INFO  [stdout] (Thread-732)   at 
ru.sbt.deposit_pf_api.comparators.CommonPredicate.cacheEntities(CommonPredicate.java:191)
10:04:46,253 INFO  [stdout] (Thread-732)   at 
ru.sbt.deposit_pf_api.comparators.CommonPredicate.(CommonPredicate.java:116)
{noformat}

And log contains some related warnings:
{noformat}
018-06-19 10:04:18.459 [WARN 
][pub-#3308%DPL_GRID%DplGridNodeName%][o.a.i.i.m.d.GridDeploymentCommunication] 
Failed to receive peer response from node within duration 
[node=5861d763-a552-463e-817a-0742f7aad114, duration=5008]
2018-06-19 10:04:18.459 [WARN 
][pub-#3308%DPL_GRID%DplGridNodeName%][o.a.i.i.m.d.GridDeploymentPerVersionStore]
 Failed to send class-loading request to node (is node alive?) 
[node=5861d763-a552-463e-817a-0742f7aad114, 
clsName=ru.sbt.deposit_pf_api.core.utils.DplUtils, 
clsPath=ru/sbt/deposit_pf_api/core/utils/DplUtils.class, 
clsLdrId=370f1361461-5861d763-a552-463e-817a-0742f7aad114, 
parentClsLdr=com.sbt.dpl.gridgain.ignite.NodeClassLoader@1ce4a752]
{noformat}

I think should to upload class through p2p until node present in topology.

Look at the  [^P2PClassDeploymentDelay.java] reproducer.



> Need attempt to upload class until node leave or fail topology by discovery 
> SPI
> ---
>
> Key: IGNITE-8866
> URL: https://issues.apache.org/jira/browse/IGNITE-8866
> Project: Ignite
>  Issue Type: Bug
>Reporter: Vladislav Pyatkov
>Assignee: Evgenii Zagumennov
>Priority: Major
> Attachments: P2PClassDeploymentDelay.java
>
>
> After one fail attempt to upload a class, client code getting exception:
> {noformat}
> 10:04:46,253 INFO  [stdout] (Thread-732) java.lang.NoClassDefFoundError: 
> com/company/deposit_pf_api/core/utils/DplUtils
> 10:04:46,253 INFO  [stdout] (Thread-732)   at 
> com.company.deposit_pf_api.comparators.CommonPredicate.nodeIdIgnite(CommonPredicate.java:225)
> 10:04:46,253 INFO  [stdout] (Thread-732)   at 
> com.company.deposit_pf_api.comparators.CommonPredicate.cacheEntities(CommonPredicate.java:191)
> 10:04:46,253 INFO  [stdout] (Thread-732)   at 
> com.company.deposit_pf_api.comparators.CommonPredicate.(CommonPredicate.java:116)
> {noformat}
> And log contains some related warnings:
> {noformat}
> 018-06-19 10:04:18.459 [WARN 
> ][pub-#3308%DPL_GRID%DplGridNodeName%][o.a.i.i.m.d.GridDeploymentCommunication]
>  Failed to receive peer response from node within duration 
> [node=5861d763-a552-463e-817a-0742f7aad114, duration=5008]
> 2018-06-19 10:04:18.459 [WARN 
> ][pub-#3308%DPL_GRID%DplGridNodeName%][o.a.i.i.m.d.GridDeploymentPerVersionStore]
>  Failed to send class-loading request to node (is node alive?) 
> [node=5861d763-a552-463e-817a-0742f7aad114, 
> clsName=com.company.deposit_pf_api.core.utils.DplUtils, 
> 

[jira] [Commented] (IGNITE-8866) Need attempt to upload class until node leave or fail topology by discovery SPI

2018-07-24 Thread Evgenii Zagumennov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-8866?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16554252#comment-16554252
 ] 

Evgenii Zagumennov commented on IGNITE-8866:


https://ci.ignite.apache.org/viewLog.html?buildId=1534268=buildResultsDiv=IgniteTests24Java8_RunAll

> Need attempt to upload class until node leave or fail topology by discovery 
> SPI
> ---
>
> Key: IGNITE-8866
> URL: https://issues.apache.org/jira/browse/IGNITE-8866
> Project: Ignite
>  Issue Type: Bug
>Reporter: Vladislav Pyatkov
>Assignee: Evgenii Zagumennov
>Priority: Major
> Attachments: P2PClassDeploymentDelay.java
>
>
> After one fail attempt to upload a class, client code getting exception:
> {noformat}
> 10:04:46,253 INFO  [stdout] (Thread-732) java.lang.NoClassDefFoundError: 
> ru/sbt/deposit_pf_api/core/utils/DplUtils
> 10:04:46,253 INFO  [stdout] (Thread-732)   at 
> ru.sbt.deposit_pf_api.comparators.CommonPredicate.nodeIdIgnite(CommonPredicate.java:225)
> 10:04:46,253 INFO  [stdout] (Thread-732)   at 
> ru.sbt.deposit_pf_api.comparators.CommonPredicate.cacheEntities(CommonPredicate.java:191)
> 10:04:46,253 INFO  [stdout] (Thread-732)   at 
> ru.sbt.deposit_pf_api.comparators.CommonPredicate.(CommonPredicate.java:116)
> {noformat}
> And log contains some related warnings:
> {noformat}
> 018-06-19 10:04:18.459 [WARN 
> ][pub-#3308%DPL_GRID%DplGridNodeName%][o.a.i.i.m.d.GridDeploymentCommunication]
>  Failed to receive peer response from node within duration 
> [node=5861d763-a552-463e-817a-0742f7aad114, duration=5008]
> 2018-06-19 10:04:18.459 [WARN 
> ][pub-#3308%DPL_GRID%DplGridNodeName%][o.a.i.i.m.d.GridDeploymentPerVersionStore]
>  Failed to send class-loading request to node (is node alive?) 
> [node=5861d763-a552-463e-817a-0742f7aad114, 
> clsName=ru.sbt.deposit_pf_api.core.utils.DplUtils, 
> clsPath=ru/sbt/deposit_pf_api/core/utils/DplUtils.class, 
> clsLdrId=370f1361461-5861d763-a552-463e-817a-0742f7aad114, 
> parentClsLdr=com.sbt.dpl.gridgain.ignite.NodeClassLoader@1ce4a752]
> {noformat}
> I think should to upload class through p2p until node present in topology.
> Look at the  [^P2PClassDeploymentDelay.java] reproducer.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (IGNITE-9048) Local SqlFieldsQuery on client leads to exception

2018-07-21 Thread Evgenii Zagumennov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-9048?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Evgenii Zagumennov updated IGNITE-9048:
---
Description: 
SqlFieldsQuery with setLocal(true) on client node throws exception

 
{code:java}
Caused by: java.lang.NullPointerException
    at 
org.apache.ignite.internal.processors.query.h2.database.H2TreeIndex.segmentsCount(H2TreeIndex.java:187)
    at 
org.apache.ignite.internal.processors.query.h2.opt.GridH2IndexBase.threadLocalSegment(GridH2IndexBase.java:182)
    at 
org.apache.ignite.internal.processors.query.h2.database.H2TreeIndex.find(H2TreeIndex.java:195)
    at org.h2.index.BaseIndex.find(BaseIndex.java:128)
    at org.h2.index.IndexCursor.find(IndexCursor.java:169)
    at org.h2.table.TableFilter.next(TableFilter.java:468)
    at 
org.h2.command.dml.Select$LazyResultQueryFlat.fetchNextRow(Select.java:1452)
    at org.h2.result.LazyResult.hasNext(LazyResult.java:79)
    at org.h2.result.LazyResult.next(LazyResult.java:59)
    at org.h2.command.dml.Select.queryFlat(Select.java:519)
    at org.h2.command.dml.Select.queryWithoutCache(Select.java:625)
    at org.h2.command.dml.Query.queryWithoutCacheLazyCheck(Query.java:114)
    at org.h2.command.dml.Query.query(Query.java:352)
    at org.h2.command.dml.Query.query(Query.java:333)
    at org.h2.command.CommandContainer.query(CommandContainer.java:113)
    at org.h2.command.Command.executeQuery(Command.java:201)
    ... 18 more{code}
 

*Reproducer "LocalSqlFieldsQueryOnClientFailTest.java" is in the attachement*

  was:
SqlFieldsQuery with setLocal(true) on client node throws exception

 
{code:java}
Caused by: java.lang.NullPointerException
    at 
org.apache.ignite.internal.processors.query.h2.database.H2TreeIndex.segmentsCount(H2TreeIndex.java:187)
    at 
org.apache.ignite.internal.processors.query.h2.opt.GridH2IndexBase.threadLocalSegment(GridH2IndexBase.java:182)
    at 
org.apache.ignite.internal.processors.query.h2.database.H2TreeIndex.find(H2TreeIndex.java:195)
    at org.h2.index.BaseIndex.find(BaseIndex.java:128)
    at org.h2.index.IndexCursor.find(IndexCursor.java:169)
    at org.h2.table.TableFilter.next(TableFilter.java:468)
    at 
org.h2.command.dml.Select$LazyResultQueryFlat.fetchNextRow(Select.java:1452)
    at org.h2.result.LazyResult.hasNext(LazyResult.java:79)
    at org.h2.result.LazyResult.next(LazyResult.java:59)
    at org.h2.command.dml.Select.queryFlat(Select.java:519)
    at org.h2.command.dml.Select.queryWithoutCache(Select.java:625)
    at org.h2.command.dml.Query.queryWithoutCacheLazyCheck(Query.java:114)
    at org.h2.command.dml.Query.query(Query.java:352)
    at org.h2.command.dml.Query.query(Query.java:333)
    at org.h2.command.CommandContainer.query(CommandContainer.java:113)
    at org.h2.command.Command.executeQuery(Command.java:201)
    ... 18 more{code}
 

Reproducer "LocalSqlFieldsQueryOnClientFailTest.java" is in the attachement


> Local SqlFieldsQuery on client leads to exception
> -
>
> Key: IGNITE-9048
> URL: https://issues.apache.org/jira/browse/IGNITE-9048
> Project: Ignite
>  Issue Type: Bug
>  Components: cache, clients, sql
>Reporter: Evgenii Zagumennov
>Priority: Major
> Attachments: LocalSqlFieldsQueryOnClientFailTest.java
>
>
> SqlFieldsQuery with setLocal(true) on client node throws exception
>  
> {code:java}
> Caused by: java.lang.NullPointerException
>     at 
> org.apache.ignite.internal.processors.query.h2.database.H2TreeIndex.segmentsCount(H2TreeIndex.java:187)
>     at 
> org.apache.ignite.internal.processors.query.h2.opt.GridH2IndexBase.threadLocalSegment(GridH2IndexBase.java:182)
>     at 
> org.apache.ignite.internal.processors.query.h2.database.H2TreeIndex.find(H2TreeIndex.java:195)
>     at org.h2.index.BaseIndex.find(BaseIndex.java:128)
>     at org.h2.index.IndexCursor.find(IndexCursor.java:169)
>     at org.h2.table.TableFilter.next(TableFilter.java:468)
>     at 
> org.h2.command.dml.Select$LazyResultQueryFlat.fetchNextRow(Select.java:1452)
>     at org.h2.result.LazyResult.hasNext(LazyResult.java:79)
>     at org.h2.result.LazyResult.next(LazyResult.java:59)
>     at org.h2.command.dml.Select.queryFlat(Select.java:519)
>     at org.h2.command.dml.Select.queryWithoutCache(Select.java:625)
>     at org.h2.command.dml.Query.queryWithoutCacheLazyCheck(Query.java:114)
>     at org.h2.command.dml.Query.query(Query.java:352)
>     at org.h2.command.dml.Query.query(Query.java:333)
>     at org.h2.command.CommandContainer.query(CommandContainer.java:113)
>     at org.h2.command.Command.executeQuery(Command.java:201)
>     ... 18 more{code}
>  
> *Reproducer "LocalSqlFieldsQueryOnClientFailTest.java" is in the attachement*



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (IGNITE-9048) Local SqlFieldsQuery on client leads to exception

2018-07-21 Thread Evgenii Zagumennov (JIRA)
Evgenii Zagumennov created IGNITE-9048:
--

 Summary: Local SqlFieldsQuery on client leads to exception
 Key: IGNITE-9048
 URL: https://issues.apache.org/jira/browse/IGNITE-9048
 Project: Ignite
  Issue Type: Bug
  Components: cache, clients, sql
Reporter: Evgenii Zagumennov
 Attachments: LocalSqlFieldsQueryOnClientFailTest.java

SqlFieldsQuery with setLocal(true) on client node throws exception

 
{code:java}
Caused by: java.lang.NullPointerException
    at 
org.apache.ignite.internal.processors.query.h2.database.H2TreeIndex.segmentsCount(H2TreeIndex.java:187)
    at 
org.apache.ignite.internal.processors.query.h2.opt.GridH2IndexBase.threadLocalSegment(GridH2IndexBase.java:182)
    at 
org.apache.ignite.internal.processors.query.h2.database.H2TreeIndex.find(H2TreeIndex.java:195)
    at org.h2.index.BaseIndex.find(BaseIndex.java:128)
    at org.h2.index.IndexCursor.find(IndexCursor.java:169)
    at org.h2.table.TableFilter.next(TableFilter.java:468)
    at 
org.h2.command.dml.Select$LazyResultQueryFlat.fetchNextRow(Select.java:1452)
    at org.h2.result.LazyResult.hasNext(LazyResult.java:79)
    at org.h2.result.LazyResult.next(LazyResult.java:59)
    at org.h2.command.dml.Select.queryFlat(Select.java:519)
    at org.h2.command.dml.Select.queryWithoutCache(Select.java:625)
    at org.h2.command.dml.Query.queryWithoutCacheLazyCheck(Query.java:114)
    at org.h2.command.dml.Query.query(Query.java:352)
    at org.h2.command.dml.Query.query(Query.java:333)
    at org.h2.command.CommandContainer.query(CommandContainer.java:113)
    at org.h2.command.Command.executeQuery(Command.java:201)
    ... 18 more{code}
 

Reproducer "LocalSqlFieldsQueryOnClientFailTest.java" is in the attachement



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (IGNITE-8866) Need attempt to upload class until node leave or fail topology by discovery SPI

2018-07-20 Thread Evgenii Zagumennov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-8866?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16550744#comment-16550744
 ] 

Evgenii Zagumennov commented on IGNITE-8866:


I guess, sending requests untill a node leave the topology is a wrong way. 
Resource node may not respond because of GC-pause or another reason, but number 
of requests will grow. I've added to 
GridDeploymentCommunication#sendResourceRequest ability to retry to send 
request. Number of retries is definable.

> Need attempt to upload class until node leave or fail topology by discovery 
> SPI
> ---
>
> Key: IGNITE-8866
> URL: https://issues.apache.org/jira/browse/IGNITE-8866
> Project: Ignite
>  Issue Type: Bug
>Reporter: Vladislav Pyatkov
>Assignee: Evgenii Zagumennov
>Priority: Major
> Attachments: P2PClassDeploymentDelay.java
>
>
> After one fail attempt to upload a class, client code getting exception:
> {noformat}
> 10:04:46,253 INFO  [stdout] (Thread-732) java.lang.NoClassDefFoundError: 
> ru/sbt/deposit_pf_api/core/utils/DplUtils
> 10:04:46,253 INFO  [stdout] (Thread-732)   at 
> ru.sbt.deposit_pf_api.comparators.CommonPredicate.nodeIdIgnite(CommonPredicate.java:225)
> 10:04:46,253 INFO  [stdout] (Thread-732)   at 
> ru.sbt.deposit_pf_api.comparators.CommonPredicate.cacheEntities(CommonPredicate.java:191)
> 10:04:46,253 INFO  [stdout] (Thread-732)   at 
> ru.sbt.deposit_pf_api.comparators.CommonPredicate.(CommonPredicate.java:116)
> {noformat}
> And log contains some related warnings:
> {noformat}
> 018-06-19 10:04:18.459 [WARN 
> ][pub-#3308%DPL_GRID%DplGridNodeName%][o.a.i.i.m.d.GridDeploymentCommunication]
>  Failed to receive peer response from node within duration 
> [node=5861d763-a552-463e-817a-0742f7aad114, duration=5008]
> 2018-06-19 10:04:18.459 [WARN 
> ][pub-#3308%DPL_GRID%DplGridNodeName%][o.a.i.i.m.d.GridDeploymentPerVersionStore]
>  Failed to send class-loading request to node (is node alive?) 
> [node=5861d763-a552-463e-817a-0742f7aad114, 
> clsName=ru.sbt.deposit_pf_api.core.utils.DplUtils, 
> clsPath=ru/sbt/deposit_pf_api/core/utils/DplUtils.class, 
> clsLdrId=370f1361461-5861d763-a552-463e-817a-0742f7aad114, 
> parentClsLdr=com.sbt.dpl.gridgain.ignite.NodeClassLoader@1ce4a752]
> {noformat}
> I think should to upload class through p2p until node present in topology.
> Look at the  [^P2PClassDeploymentDelay.java] reproducer.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (IGNITE-8738) Improve coordinator change information

2018-07-09 Thread Evgenii Zagumennov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-8738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16536602#comment-16536602
 ] 

Evgenii Zagumennov commented on IGNITE-8738:


[~agoncharuk], I've implemented your suggestion. 

> Improve coordinator change information
> --
>
> Key: IGNITE-8738
> URL: https://issues.apache.org/jira/browse/IGNITE-8738
> Project: Ignite
>  Issue Type: Improvement
>Reporter: Alexey Goncharuk
>Assignee: Evgenii Zagumennov
>Priority: Major
>  Time Spent: 16h
>  Remaining Estimate: 0h
>
> When topology changes and coordinator is also changed, we need to print out 
> this alongside with topology information.
> An example of such message:
> {{Coordinator changed [prev=node.tostring(), cur=node.tostr()]}}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (IGNITE-8939) Transaction string reprsentation unhandled exception

2018-07-05 Thread Evgenii Zagumennov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-8939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Evgenii Zagumennov updated IGNITE-8939:
---
Description: 
IgniteTxHandler.finishDhtLocal() (IgniteTxHandler.java:957)
{code:java}
U.error(log, "Failed completing transaction [commit=" + req.commit() + ", tx=" 
+ *tx* + ']', e);{code}
tx.toString() can lead to excepion (in GridToStringBuilder.toStringImpl()), and 
original exception in transaction will be lost. We need to log original 
exception and catch probable tx.toString() exception.

  was:
IgniteTxHandler.finishDhtLocal() (IgniteTxHandler.java:957)

U.error(log, "Failed completing transaction [commit=" + req.commit() + ", tx=" 
+ *tx* + ']', e);

tx.toString() can lead to excepion (because of errors in metadata), and 
original exception in transaction will be lost. We need to log original 
exception and catch probable tx.toString() exception.


> Transaction string reprsentation unhandled exception
> 
>
> Key: IGNITE-8939
> URL: https://issues.apache.org/jira/browse/IGNITE-8939
> Project: Ignite
>  Issue Type: Bug
>  Components: binary
>Reporter: Evgenii Zagumennov
>Priority: Major
>
> IgniteTxHandler.finishDhtLocal() (IgniteTxHandler.java:957)
> {code:java}
> U.error(log, "Failed completing transaction [commit=" + req.commit() + ", 
> tx=" + *tx* + ']', e);{code}
> tx.toString() can lead to excepion (in GridToStringBuilder.toStringImpl()), 
> and original exception in transaction will be lost. We need to log original 
> exception and catch probable tx.toString() exception.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (IGNITE-8939) Transaction string reprsentation unhandled exception

2018-07-05 Thread Evgenii Zagumennov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-8939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Evgenii Zagumennov updated IGNITE-8939:
---
Description: 
IgniteTxHandler.finishDhtLocal() (IgniteTxHandler.java:957)

U.error(log, "Failed completing transaction [commit=" + req.commit() + ", tx=" 
+ *tx* + ']', e);

tx.toString() can lead to excepion (because of errors in metadata), and 
original exception in transaction will be lost. We need to log original 
exception and catch probable tx.toString() exception.

> Transaction string reprsentation unhandled exception
> 
>
> Key: IGNITE-8939
> URL: https://issues.apache.org/jira/browse/IGNITE-8939
> Project: Ignite
>  Issue Type: Bug
>  Components: binary
>Reporter: Evgenii Zagumennov
>Priority: Major
>
> IgniteTxHandler.finishDhtLocal() (IgniteTxHandler.java:957)
> U.error(log, "Failed completing transaction [commit=" + req.commit() + ", 
> tx=" + *tx* + ']', e);
> tx.toString() can lead to excepion (because of errors in metadata), and 
> original exception in transaction will be lost. We need to log original 
> exception and catch probable tx.toString() exception.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (IGNITE-8939) Transaction string reprsentation unhandled exception

2018-07-05 Thread Evgenii Zagumennov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-8939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Evgenii Zagumennov updated IGNITE-8939:
---
Summary: Transaction string reprsentation unhandled exception  (was: Error 
in binary meta data after RESTORE with wal_compaction)

> Transaction string reprsentation unhandled exception
> 
>
> Key: IGNITE-8939
> URL: https://issues.apache.org/jira/browse/IGNITE-8939
> Project: Ignite
>  Issue Type: Bug
>  Components: binary
>Reporter: Evgenii Zagumennov
>Priority: Major
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (IGNITE-8939) Error in binary meta data after RESTORE with wal_compaction

2018-07-05 Thread Evgenii Zagumennov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-8939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Evgenii Zagumennov updated IGNITE-8939:
---
Description: (was: Steps to reproduce:
 # CREATE snapshot
 # RESTORE from snapshot
 # CHECK snapshot
 # Restart client nodes
 # Run some jobs on grid

Node fails with exception:

 
 org.apache.ignite.binary.BinaryObjectException: Cannot find schema for object 
with compact footer [typeName=...]
     at 
org.apache.ignite.internal.binary.BinaryReaderExImpl.getOrCreateSchema(BinaryReaderExImpl.java:2033)
     at 
org.apache.ignite.internal.binary.BinaryReaderExImpl.(BinaryReaderExImpl.java:286)
     at 
org.apache.ignite.internal.binary.BinaryReaderExImpl.(BinaryReaderExImpl.java:185)
     at 
org.apache.ignite.internal.binary.BinaryObjectImpl.reader(BinaryObjectImpl.java:832)
     at 
org.apache.ignite.internal.binary.BinaryObjectImpl.reader(BinaryObjectImpl.java:846)
     at 
org.apache.ignite.internal.binary.BinaryObjectImpl.field(BinaryObjectImpl.java:626)
     at 
org.apache.ignite.internal.binary.BinaryObjectExImpl.toString(BinaryObjectExImpl.java:225)
     at 
org.apache.ignite.internal.binary.BinaryObjectExImpl.appendValue(BinaryObjectExImpl.java:280)
     at 
org.apache.ignite.internal.binary.BinaryObjectExImpl.toString(BinaryObjectExImpl.java:229)
     at 
org.apache.ignite.internal.binary.BinaryObjectExImpl.toString(BinaryObjectExImpl.java:186)

 

*wlaModeEnabled = true*)

> Error in binary meta data after RESTORE with wal_compaction
> ---
>
> Key: IGNITE-8939
> URL: https://issues.apache.org/jira/browse/IGNITE-8939
> Project: Ignite
>  Issue Type: Bug
>  Components: binary
>Reporter: Evgenii Zagumennov
>Priority: Major
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (IGNITE-8939) Error in binary meta data after RESTORE with wal_compaction

2018-07-05 Thread Evgenii Zagumennov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-8939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Evgenii Zagumennov updated IGNITE-8939:
---
Description: 
Steps to reproduce:
 # CREATE snapshot
 # RESTORE from snapshot
 # CHECK snapshot
 # Restart client nodes
 # Run some jobs on grid

Node fails with exception:

 
 org.apache.ignite.binary.BinaryObjectException: Cannot find schema for object 
with compact footer [typeName=...]
     at 
org.apache.ignite.internal.binary.BinaryReaderExImpl.getOrCreateSchema(BinaryReaderExImpl.java:2033)
     at 
org.apache.ignite.internal.binary.BinaryReaderExImpl.(BinaryReaderExImpl.java:286)
     at 
org.apache.ignite.internal.binary.BinaryReaderExImpl.(BinaryReaderExImpl.java:185)
     at 
org.apache.ignite.internal.binary.BinaryObjectImpl.reader(BinaryObjectImpl.java:832)
     at 
org.apache.ignite.internal.binary.BinaryObjectImpl.reader(BinaryObjectImpl.java:846)
     at 
org.apache.ignite.internal.binary.BinaryObjectImpl.field(BinaryObjectImpl.java:626)
     at 
org.apache.ignite.internal.binary.BinaryObjectExImpl.toString(BinaryObjectExImpl.java:225)
     at 
org.apache.ignite.internal.binary.BinaryObjectExImpl.appendValue(BinaryObjectExImpl.java:280)
     at 
org.apache.ignite.internal.binary.BinaryObjectExImpl.toString(BinaryObjectExImpl.java:229)
     at 
org.apache.ignite.internal.binary.BinaryObjectExImpl.toString(BinaryObjectExImpl.java:186)

 

*wlaModeEnabled = true*

  was:
Steps to reproduce:
 # CREATE snapshot
 # RESTORE from snapshot
 # CHECK snapshot
 # Restart client nodes
 # Run some jobs on grid

Node fails with exception:

 
org.apache.ignite.binary.BinaryObjectException: Cannot find schema for object 
with compact footer [typeName=...]
    at 
org.apache.ignite.internal.binary.BinaryReaderExImpl.getOrCreateSchema(BinaryReaderExImpl.java:2033)
    at 
org.apache.ignite.internal.binary.BinaryReaderExImpl.(BinaryReaderExImpl.java:286)
    at 
org.apache.ignite.internal.binary.BinaryReaderExImpl.(BinaryReaderExImpl.java:185)
    at 
org.apache.ignite.internal.binary.BinaryObjectImpl.reader(BinaryObjectImpl.java:832)
    at 
org.apache.ignite.internal.binary.BinaryObjectImpl.reader(BinaryObjectImpl.java:846)
    at 
org.apache.ignite.internal.binary.BinaryObjectImpl.field(BinaryObjectImpl.java:626)
    at 
org.apache.ignite.internal.binary.BinaryObjectExImpl.toString(BinaryObjectExImpl.java:225)
    at 
org.apache.ignite.internal.binary.BinaryObjectExImpl.appendValue(BinaryObjectExImpl.java:280)
    at 
org.apache.ignite.internal.binary.BinaryObjectExImpl.toString(BinaryObjectExImpl.java:229)
    at 
org.apache.ignite.internal.binary.BinaryObjectExImpl.toString(BinaryObjectExImpl.java:186)


> Error in binary meta data after RESTORE with wal_compaction
> ---
>
> Key: IGNITE-8939
> URL: https://issues.apache.org/jira/browse/IGNITE-8939
> Project: Ignite
>  Issue Type: Bug
>  Components: binary
>Reporter: Evgenii Zagumennov
>Priority: Major
>
> Steps to reproduce:
>  # CREATE snapshot
>  # RESTORE from snapshot
>  # CHECK snapshot
>  # Restart client nodes
>  # Run some jobs on grid
> Node fails with exception:
>  
>  org.apache.ignite.binary.BinaryObjectException: Cannot find schema for 
> object with compact footer [typeName=...]
>      at 
> org.apache.ignite.internal.binary.BinaryReaderExImpl.getOrCreateSchema(BinaryReaderExImpl.java:2033)
>      at 
> org.apache.ignite.internal.binary.BinaryReaderExImpl.(BinaryReaderExImpl.java:286)
>      at 
> org.apache.ignite.internal.binary.BinaryReaderExImpl.(BinaryReaderExImpl.java:185)
>      at 
> org.apache.ignite.internal.binary.BinaryObjectImpl.reader(BinaryObjectImpl.java:832)
>      at 
> org.apache.ignite.internal.binary.BinaryObjectImpl.reader(BinaryObjectImpl.java:846)
>      at 
> org.apache.ignite.internal.binary.BinaryObjectImpl.field(BinaryObjectImpl.java:626)
>      at 
> org.apache.ignite.internal.binary.BinaryObjectExImpl.toString(BinaryObjectExImpl.java:225)
>      at 
> org.apache.ignite.internal.binary.BinaryObjectExImpl.appendValue(BinaryObjectExImpl.java:280)
>      at 
> org.apache.ignite.internal.binary.BinaryObjectExImpl.toString(BinaryObjectExImpl.java:229)
>      at 
> org.apache.ignite.internal.binary.BinaryObjectExImpl.toString(BinaryObjectExImpl.java:186)
>  
> *wlaModeEnabled = true*



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (IGNITE-8939) Error in binary meta data after RESTORE with wal_compaction

2018-07-05 Thread Evgenii Zagumennov (JIRA)
Evgenii Zagumennov created IGNITE-8939:
--

 Summary: Error in binary meta data after RESTORE with 
wal_compaction
 Key: IGNITE-8939
 URL: https://issues.apache.org/jira/browse/IGNITE-8939
 Project: Ignite
  Issue Type: Bug
  Components: binary
Reporter: Evgenii Zagumennov


Steps to reproduce:
 # CREATE snapshot
 # RESTORE from snapshot
 # CHECK snapshot
 # Restart client nodes
 # Run some jobs on grid

Node fails with exception:

 
org.apache.ignite.binary.BinaryObjectException: Cannot find schema for object 
with compact footer [typeName=...]
    at 
org.apache.ignite.internal.binary.BinaryReaderExImpl.getOrCreateSchema(BinaryReaderExImpl.java:2033)
    at 
org.apache.ignite.internal.binary.BinaryReaderExImpl.(BinaryReaderExImpl.java:286)
    at 
org.apache.ignite.internal.binary.BinaryReaderExImpl.(BinaryReaderExImpl.java:185)
    at 
org.apache.ignite.internal.binary.BinaryObjectImpl.reader(BinaryObjectImpl.java:832)
    at 
org.apache.ignite.internal.binary.BinaryObjectImpl.reader(BinaryObjectImpl.java:846)
    at 
org.apache.ignite.internal.binary.BinaryObjectImpl.field(BinaryObjectImpl.java:626)
    at 
org.apache.ignite.internal.binary.BinaryObjectExImpl.toString(BinaryObjectExImpl.java:225)
    at 
org.apache.ignite.internal.binary.BinaryObjectExImpl.appendValue(BinaryObjectExImpl.java:280)
    at 
org.apache.ignite.internal.binary.BinaryObjectExImpl.toString(BinaryObjectExImpl.java:229)
    at 
org.apache.ignite.internal.binary.BinaryObjectExImpl.toString(BinaryObjectExImpl.java:186)



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Assigned] (IGNITE-8866) Need attempt to upload class until node leave or fail topology by discovery SPI

2018-07-05 Thread Evgenii Zagumennov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-8866?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Evgenii Zagumennov reassigned IGNITE-8866:
--

Assignee: Evgenii Zagumennov

> Need attempt to upload class until node leave or fail topology by discovery 
> SPI
> ---
>
> Key: IGNITE-8866
> URL: https://issues.apache.org/jira/browse/IGNITE-8866
> Project: Ignite
>  Issue Type: Bug
>Reporter: Vladislav Pyatkov
>Assignee: Evgenii Zagumennov
>Priority: Major
> Attachments: P2PClassDeploymentDelay.java
>
>
> After one fail attempt to upload a class, client code getting exception:
> {noformat}
> 10:04:46,253 INFO  [stdout] (Thread-732) java.lang.NoClassDefFoundError: 
> ru/sbt/deposit_pf_api/core/utils/DplUtils
> 10:04:46,253 INFO  [stdout] (Thread-732)   at 
> ru.sbt.deposit_pf_api.comparators.CommonPredicate.nodeIdIgnite(CommonPredicate.java:225)
> 10:04:46,253 INFO  [stdout] (Thread-732)   at 
> ru.sbt.deposit_pf_api.comparators.CommonPredicate.cacheEntities(CommonPredicate.java:191)
> 10:04:46,253 INFO  [stdout] (Thread-732)   at 
> ru.sbt.deposit_pf_api.comparators.CommonPredicate.(CommonPredicate.java:116)
> {noformat}
> And log contains some related warnings:
> {noformat}
> 018-06-19 10:04:18.459 [WARN 
> ][pub-#3308%DPL_GRID%DplGridNodeName%][o.a.i.i.m.d.GridDeploymentCommunication]
>  Failed to receive peer response from node within duration 
> [node=5861d763-a552-463e-817a-0742f7aad114, duration=5008]
> 2018-06-19 10:04:18.459 [WARN 
> ][pub-#3308%DPL_GRID%DplGridNodeName%][o.a.i.i.m.d.GridDeploymentPerVersionStore]
>  Failed to send class-loading request to node (is node alive?) 
> [node=5861d763-a552-463e-817a-0742f7aad114, 
> clsName=ru.sbt.deposit_pf_api.core.utils.DplUtils, 
> clsPath=ru/sbt/deposit_pf_api/core/utils/DplUtils.class, 
> clsLdrId=370f1361461-5861d763-a552-463e-817a-0742f7aad114, 
> parentClsLdr=com.sbt.dpl.gridgain.ignite.NodeClassLoader@1ce4a752]
> {noformat}
> I think should to upload class through p2p until node present in topology.
> Look at the  [^P2PClassDeploymentDelay.java] reproducer.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (IGNITE-8738) Improve coordinator change information

2018-06-27 Thread Evgenii Zagumennov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-8738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16524987#comment-16524987
 ] 

Evgenii Zagumennov commented on IGNITE-8738:


After discussion with [~ivandasch], removed filed currCrd and used 
DiscoCache#oldestAliveServerNode. That should prevent race conditions.

> Improve coordinator change information
> --
>
> Key: IGNITE-8738
> URL: https://issues.apache.org/jira/browse/IGNITE-8738
> Project: Ignite
>  Issue Type: Improvement
>Reporter: Alexey Goncharuk
>Assignee: Evgenii Zagumennov
>Priority: Major
>
> When topology changes and coordinator is also changed, we need to print out 
> this alongside with topology information.
> An example of such message:
> {{Coordinator changed [prev=node.tostring(), cur=node.tostr()]}}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (IGNITE-8798) Move transaction recovery logging to INFO level

2018-06-15 Thread Evgenii Zagumennov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-8798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16514113#comment-16514113
 ] 

Evgenii Zagumennov commented on IGNITE-8798:


[~ascherbakov],

I've implemented your remarks and squashed commits.

> Move transaction recovery logging to INFO level
> ---
>
> Key: IGNITE-8798
> URL: https://issues.apache.org/jira/browse/IGNITE-8798
> Project: Ignite
>  Issue Type: Improvement
>Reporter: Alexey Goncharuk
>Assignee: Evgenii Zagumennov
>Priority: Major
>
> Currently we log transaction recovery state changes to {{DEBUG}}, however, 
> this information is critically important for production deployment and 
> incident analysis. I suggest to move corresponding logging 
> ({{GridCacheTxRecoveryFuture}} and surrounding code) to {{INFO}} level.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Assigned] (IGNITE-8798) Move transaction recovery logging to INFO level

2018-06-15 Thread Evgenii Zagumennov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-8798?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Evgenii Zagumennov reassigned IGNITE-8798:
--

Assignee: Evgenii Zagumennov

> Move transaction recovery logging to INFO level
> ---
>
> Key: IGNITE-8798
> URL: https://issues.apache.org/jira/browse/IGNITE-8798
> Project: Ignite
>  Issue Type: Improvement
>Reporter: Alexey Goncharuk
>Assignee: Evgenii Zagumennov
>Priority: Major
>
> Currently we log transaction recovery state changes to {{DEBUG}}, however, 
> this information is critically important for production deployment and 
> incident analysis. I suggest to move corresponding logging 
> ({{GridCacheTxRecoveryFuture}} and surrounding code) to {{INFO}} level.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Assigned] (IGNITE-8738) Improve coordinator change information

2018-06-15 Thread Evgenii Zagumennov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-8738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Evgenii Zagumennov reassigned IGNITE-8738:
--

Assignee: Evgenii Zagumennov

> Improve coordinator change information
> --
>
> Key: IGNITE-8738
> URL: https://issues.apache.org/jira/browse/IGNITE-8738
> Project: Ignite
>  Issue Type: Improvement
>Reporter: Alexey Goncharuk
>Assignee: Evgenii Zagumennov
>Priority: Major
>
> When topology changes and coordinator is also changed, we need to print out 
> this alongside with topology information.
> An example of such message:
> {{Coordinator changed [prev=node.tostring(), cur=node.tostr()]}}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Assigned] (IGNITE-8745) Add ability to monitor TCP discovery ring information

2018-06-08 Thread Evgenii Zagumennov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-8745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Evgenii Zagumennov reassigned IGNITE-8745:
--

Assignee: Evgenii Zagumennov

> Add ability to monitor TCP discovery ring information
> -
>
> Key: IGNITE-8745
> URL: https://issues.apache.org/jira/browse/IGNITE-8745
> Project: Ignite
>  Issue Type: Improvement
>Reporter: Alexey Goncharuk
>Assignee: Evgenii Zagumennov
>Priority: Major
>
> We should add the following modifications:
> 1) Add a method on TCP discovery MBean to dump the ring structure on local 
> node and on all nodes in the grid
> 2) Make tcp-disco-worker thread name reflect the node to which the local node 
> is connected
> 3) Add a method on TCP discovery MBean to return current topology version



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Comment Edited] (IGNITE-7766) Ignite Queries 2: Test always failed IgniteCacheQueryNodeRestartTxSelfTest.testRestarts

2018-06-01 Thread Evgenii Zagumennov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-7766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16497766#comment-16497766
 ] 

Evgenii Zagumennov edited comment on IGNITE-7766 at 6/1/18 9:15 AM:


[~ilantukh], [~dpavlov], i've implemented fix. Now, if the first cache in the 
cacheIds collection is not partitioned, we find it (if it is in cacheIds) and 
move to the index 0. 


was (Author: ezagumennov):
[~ilantukh], i've implemented your advice. Now, if the first cache in the 
cacheIds collection is not partitioned, we find it (if it is in cacheIds) and 
move to the index 0. 

> Ignite Queries 2: Test always failed 
> IgniteCacheQueryNodeRestartTxSelfTest.testRestarts
> ---
>
> Key: IGNITE-7766
> URL: https://issues.apache.org/jira/browse/IGNITE-7766
> Project: Ignite
>  Issue Type: Task
>  Components: sql
>Reporter: Dmitriy Pavlov
>Assignee: Evgenii Zagumennov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
>
> Ignite Queries 2 
>  IgniteBinaryCacheQueryTestSuite2: 
> IgniteCacheQueryNodeRestartTxSelfTest.testRestarts (fail rate 76,1%)
> IgniteCacheQueryNodeRestartTxSelfTest.testRestarts 
>  Current failure: refs/heads/master #345 No changes 11 Feb 18 03:03
> junit.framework.AssertionFailedError: On large page size must retry.
> Last runs fails with 100% probability.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (IGNITE-7766) Ignite Queries 2: Test always failed IgniteCacheQueryNodeRestartTxSelfTest.testRestarts

2018-06-01 Thread Evgenii Zagumennov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-7766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16497766#comment-16497766
 ] 

Evgenii Zagumennov commented on IGNITE-7766:


[~ilantukh], i've implemented your advice. Now, if the first cache in the 
cacheIds collection is not partitioned, we find it (if it is in cacheIds) and 
move to the index 0. 

> Ignite Queries 2: Test always failed 
> IgniteCacheQueryNodeRestartTxSelfTest.testRestarts
> ---
>
> Key: IGNITE-7766
> URL: https://issues.apache.org/jira/browse/IGNITE-7766
> Project: Ignite
>  Issue Type: Task
>  Components: sql
>Reporter: Dmitriy Pavlov
>Assignee: Evgenii Zagumennov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
>
> Ignite Queries 2 
>  IgniteBinaryCacheQueryTestSuite2: 
> IgniteCacheQueryNodeRestartTxSelfTest.testRestarts (fail rate 76,1%)
> IgniteCacheQueryNodeRestartTxSelfTest.testRestarts 
>  Current failure: refs/heads/master #345 No changes 11 Feb 18 03:03
> junit.framework.AssertionFailedError: On large page size must retry.
> Last runs fails with 100% probability.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (IGNITE-7766) Ignite Queries 2: Test always failed IgniteCacheQueryNodeRestartTxSelfTest.testRestarts

2018-05-29 Thread Evgenii Zagumennov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-7766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16493235#comment-16493235
 ] 

Evgenii Zagumennov commented on IGNITE-7766:


Hi, [~dpavlov]. Ok, got it. Moved to *Patch Available*

> Ignite Queries 2: Test always failed 
> IgniteCacheQueryNodeRestartTxSelfTest.testRestarts
> ---
>
> Key: IGNITE-7766
> URL: https://issues.apache.org/jira/browse/IGNITE-7766
> Project: Ignite
>  Issue Type: Task
>  Components: sql
>Reporter: Dmitriy Pavlov
>Assignee: Evgenii Zagumennov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
>
>Ignite Queries 2  
>  IgniteBinaryCacheQueryTestSuite2: 
> IgniteCacheQueryNodeRestartTxSelfTest.testRestarts (fail rate 76,1%) 
> IgniteCacheQueryNodeRestartTxSelfTest.testRestarts
>   Current failure:refs/heads/master   #345No changes  
> 11 Feb 18 03:03
>
> junit.framework.AssertionFailedError: On large page size must retry.
>  
> Last runs fails with 100% probability.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Reopened] (IGNITE-7766) Ignite Queries 2: Test always failed IgniteCacheQueryNodeRestartTxSelfTest.testRestarts

2018-05-29 Thread Evgenii Zagumennov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-7766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Evgenii Zagumennov reopened IGNITE-7766:


> Ignite Queries 2: Test always failed 
> IgniteCacheQueryNodeRestartTxSelfTest.testRestarts
> ---
>
> Key: IGNITE-7766
> URL: https://issues.apache.org/jira/browse/IGNITE-7766
> Project: Ignite
>  Issue Type: Task
>  Components: sql
>Reporter: Dmitriy Pavlov
>Assignee: Evgenii Zagumennov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
>
>Ignite Queries 2  
>  IgniteBinaryCacheQueryTestSuite2: 
> IgniteCacheQueryNodeRestartTxSelfTest.testRestarts (fail rate 76,1%) 
> IgniteCacheQueryNodeRestartTxSelfTest.testRestarts
>   Current failure:refs/heads/master   #345No changes  
> 11 Feb 18 03:03
>
> junit.framework.AssertionFailedError: On large page size must retry.
>  
> Last runs fails with 100% probability.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Issue Comment Deleted] (IGNITE-7766) Ignite Queries 2: Test always failed IgniteCacheQueryNodeRestartTxSelfTest.testRestarts

2018-05-16 Thread Evgenii Zagumennov (JIRA)

 [ 
https://issues.apache.org/jira/browse/IGNITE-7766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Evgenii Zagumennov updated IGNITE-7766:
---
Comment: was deleted

(was: I've investigated the issue (with "git bisect"). This test started to 
fail after [IGNITE-7195|https://issues.apache.org/jira/browse/IGNITE-7195] was 
fixed. I think, the subject test 
(IgniteCacheQueryNodeRestartTxSelfTest.testRestarts) is not actual anymore.)

> Ignite Queries 2: Test always failed 
> IgniteCacheQueryNodeRestartTxSelfTest.testRestarts
> ---
>
> Key: IGNITE-7766
> URL: https://issues.apache.org/jira/browse/IGNITE-7766
> Project: Ignite
>  Issue Type: Task
>  Components: sql
>Reporter: Dmitriy Pavlov
>Assignee: Evgenii Zagumennov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
>
>Ignite Queries 2  
>  IgniteBinaryCacheQueryTestSuite2: 
> IgniteCacheQueryNodeRestartTxSelfTest.testRestarts (fail rate 76,1%) 
> IgniteCacheQueryNodeRestartTxSelfTest.testRestarts
>   Current failure:refs/heads/master   #345No changes  
> 11 Feb 18 03:03
>
> junit.framework.AssertionFailedError: On large page size must retry.
>  
> Last runs fails with 100% probability.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (IGNITE-7766) Ignite Queries 2: Test always failed IgniteCacheQueryNodeRestartTxSelfTest.testRestarts

2018-05-06 Thread Evgenii Zagumennov (JIRA)

[ 
https://issues.apache.org/jira/browse/IGNITE-7766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16465066#comment-16465066
 ] 

Evgenii Zagumennov commented on IGNITE-7766:


I've investigated the issue (with "git bisect"). This test started to fail 
after [IGNITE-7195|https://issues.apache.org/jira/browse/IGNITE-7195] was 
fixed. I think, the subject test 
(IgniteCacheQueryNodeRestartTxSelfTest.testRestarts) is not actual anymore.

> Ignite Queries 2: Test always failed 
> IgniteCacheQueryNodeRestartTxSelfTest.testRestarts
> ---
>
> Key: IGNITE-7766
> URL: https://issues.apache.org/jira/browse/IGNITE-7766
> Project: Ignite
>  Issue Type: Task
>  Components: sql
>Reporter: Dmitriy Pavlov
>Assignee: Evgenii Zagumennov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
>
>Ignite Queries 2  
>  IgniteBinaryCacheQueryTestSuite2: 
> IgniteCacheQueryNodeRestartTxSelfTest.testRestarts (fail rate 76,1%) 
> IgniteCacheQueryNodeRestartTxSelfTest.testRestarts
>   Current failure:refs/heads/master   #345No changes  
> 11 Feb 18 03:03
>
> junit.framework.AssertionFailedError: On large page size must retry.
>  
> Last runs fails with 100% probability.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Assigned] (IGNITE-7766) Ignite Queries 2: Test always failed IgniteCacheQueryNodeRestartTxSelfTest.testRestarts

2018-04-20 Thread Evgenii Zagumennov (JIRA)

 [ 
https://issues.apache.org/jira/browse/IGNITE-7766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Evgenii Zagumennov reassigned IGNITE-7766:
--

Assignee: Evgenii Zagumennov  (was: Alexei Scherbakov)

> Ignite Queries 2: Test always failed 
> IgniteCacheQueryNodeRestartTxSelfTest.testRestarts
> ---
>
> Key: IGNITE-7766
> URL: https://issues.apache.org/jira/browse/IGNITE-7766
> Project: Ignite
>  Issue Type: Task
>  Components: sql
>Reporter: Dmitriy Pavlov
>Assignee: Evgenii Zagumennov
>Priority: Major
>  Labels: MakeTeamcityGreenAgain
>
>Ignite Queries 2  
>  IgniteBinaryCacheQueryTestSuite2: 
> IgniteCacheQueryNodeRestartTxSelfTest.testRestarts (fail rate 76,1%) 
> IgniteCacheQueryNodeRestartTxSelfTest.testRestarts
>   Current failure:refs/heads/master   #345No changes  
> 11 Feb 18 03:03
>
> junit.framework.AssertionFailedError: On large page size must retry.
>  
> Last runs fails with 100% probability.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Assigned] (IGNITE-8185) There is deadlock when client tries to reconnect and node stop was initiated simultaneously

2018-04-12 Thread Evgenii Zagumennov (JIRA)

 [ 
https://issues.apache.org/jira/browse/IGNITE-8185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Evgenii Zagumennov reassigned IGNITE-8185:
--

Assignee: Evgenii Zagumennov

> There is deadlock when client tries to reconnect and node stop was initiated 
> simultaneously
> ---
>
> Key: IGNITE-8185
> URL: https://issues.apache.org/jira/browse/IGNITE-8185
> Project: Ignite
>  Issue Type: Bug
>Reporter: Eduard Shangareev
>Assignee: Evgenii Zagumennov
>Priority: Critical
>  Labels: MakeTeamcityGreenAgain
>
> Stack traces of these 2 activities which deadlock:
> {code}
> "tcp-client-disco-msg-worker-#493%client%" #7712 prio=5 os_prio=0 
> tid=0x7f5e06731000 nid=0x520f waiting for monitor entry 
> [0x7f5c7adeb000]
>java.lang.Thread.State: BLOCKED (on object monitor)
>   at 
> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.stop0(IgnitionEx.java:2519)
>   - waiting to lock <0x0007a5fe6138> (a 
> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance)
>   at 
> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.stop(IgnitionEx.java:2511)
>   at org.apache.ignite.internal.IgnitionEx.stop(IgnitionEx.java:368)
>   at org.apache.ignite.Ignition.stop(Ignition.java:224)
>   at org.apache.ignite.internal.IgniteKernal.close(IgniteKernal.java:3439)
>   at 
> org.apache.ignite.internal.IgniteKernal.onReconnected(IgniteKernal.java:3926)
>   at 
> org.apache.ignite.internal.managers.discovery.GridDiscoveryManager$4.onDiscovery0(GridDiscoveryManager.java:812)
>   at 
> org.apache.ignite.internal.managers.discovery.GridDiscoveryManager$4.onDiscovery(GridDiscoveryManager.java:578)
>   - locked <0x0007a5fe6358> (a java.lang.Object)
>   at 
> org.apache.ignite.spi.discovery.tcp.ClientImpl$MessageWorker.notifyDiscovery(ClientImpl.java:2425)
>   at 
> org.apache.ignite.spi.discovery.tcp.ClientImpl$MessageWorker.notifyDiscovery(ClientImpl.java:2404)
>   at 
> org.apache.ignite.spi.discovery.tcp.ClientImpl$MessageWorker.processNodeAddFinishedMessage(ClientImpl.java:2049)
>   at 
> org.apache.ignite.spi.discovery.tcp.ClientImpl$MessageWorker.processDiscoveryMessage(ClientImpl.java:1898)
>   at 
> org.apache.ignite.spi.discovery.tcp.ClientImpl$MessageWorker.body(ClientImpl.java:1792)
>   at org.apache.ignite.spi.IgniteSpiThread.run(IgniteSpiThread.java:62)
> {code}
> {code}
> "main" #1 prio=5 os_prio=0 tid=0x7f5e0400e800 nid=0x340c in Object.wait() 
> [0x7f5e09fbb000]
>java.lang.Thread.State: WAITING (on object monitor)
>   at java.lang.Object.wait(Native Method)
>   at java.lang.Thread.join(Thread.java:1252)
>   - locked <0x0007a5fe5f98> (a 
> org.apache.ignite.spi.discovery.tcp.ClientImpl$MessageWorker)
>   at java.lang.Thread.join(Thread.java:1326)
>   at 
> org.apache.ignite.internal.util.IgniteUtils.join(IgniteUtils.java:4603)
>   at 
> org.apache.ignite.spi.discovery.tcp.ClientImpl.spiStop(ClientImpl.java:313)
>   at 
> org.apache.ignite.spi.discovery.tcp.TcpDiscoverySpi.spiStop(TcpDiscoverySpi.java:2061)
>   at 
> org.apache.ignite.internal.managers.GridManagerAdapter.stopSpi(GridManagerAdapter.java:330)
>   at 
> org.apache.ignite.internal.managers.discovery.GridDiscoveryManager.stop(GridDiscoveryManager.java:1612)
>   at org.apache.ignite.internal.IgniteKernal.stop0(IgniteKernal.java:2238)
>   at org.apache.ignite.internal.IgniteKernal.stop(IgniteKernal.java:2116)
>   at 
> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.stop0(IgnitionEx.java:2548)
>   - locked <0x0007a5fe6138> (a 
> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance)
>   at 
> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance.stop(IgnitionEx.java:2511)
>   at org.apache.ignite.internal.IgnitionEx.stop(IgnitionEx.java:368)
>   at org.apache.ignite.Ignition.stop(Ignition.java:224)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest.stopGrid(GridAbstractTest.java:1027)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest.stopAllGrids(GridAbstractTest.java:1067)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest.stopAllGrids(GridAbstractTest.java:1048)
>   at 
> org.gridgain.grid.internal.processors.cache.database.IgniteDbSnapshotNotStableTopologiesTest.afterTest(IgniteDbSnapshotNotStableTopologiesTest.java:241)
>   at 
> org.gridgain.grid.internal.processors.cache.database.IgniteDbSnapshotNotStableTopologiesMultiNodeTest.afterTest(IgniteDbSnapshotNotStableTopologiesMultiNodeTest.java:83)
>   at 
> org.apache.ignite.testframework.junits.GridAbstractTest.tearDown(GridAbstractTest.java:1624)
>   at 
>