[ 
https://issues.apache.org/jira/browse/IGNITE-8497?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ivan Artukhov updated IGNITE-8497:
----------------------------------
    Description: 
*Steps*
Start Ignite server node with enabled PDS (see the attached  
[^example-cache.xml] config file)
Activate the cluster with _./bin/control.sh --activate_
Put some data into cluster (with _CachePutGetExample.java_ for example)
Stop Ignite server node with SIGINT

*Actual result*
Ignite server node invokes the shutdown hook, checkpoint procedure starts, but 
Ignite node *does not wait for checkpoint to finish* and terminates the node.

An excerpt from  [^srv.1.log] :
{noformat}
[2018-05-15 15:20:59,976][INFO ][Thread-3][G] Invoking shutdown hook...
[2018-05-15 15:20:59,979][INFO ][Thread-3][GridTcpRestProtocol] Command 
protocol successfully stopped: TCP binary
[2018-05-15 15:20:59,998][INFO 
][db-checkpoint-thread-#50][GridCacheDatabaseSharedManager] Checkpoint started 
[checkpointId=f0dde95a-6027-40dd-b3f3-4311aa8508c3, startPtr=FileWALPointer 
[idx=0, fileOff=460751, len=40871], checkpointLockWait=0ms, 
checkpointLockHoldTime=6ms, pages=167, reason='timeout']
[2018-05-15 15:21:00,011][INFO ][Thread-3][GridCacheProcessor] Stopped cache 
[cacheName=default]
[2018-05-15 15:21:00,011][INFO ][Thread-3][GridCacheProcessor] Stopped cache 
[cacheName=ignite-sys-cache]
[2018-05-15 15:21:00,012][INFO ][Thread-3][GridCacheProcessor] Stopped cache 
[cacheName=CachePutGetExample]
[2018-05-15 15:21:00,049][INFO ][Thread-3][IgniteKernal] 

>>> +-----------------------------------------------------+
>>> Ignite ver. 2.4.0-SNAPSHOT#19700101-sha1:DEV stopped OK
>>> +-----------------------------------------------------+
>>> Grid uptime: 00:00:36.228
{noformat}

When one starts the node again, the following warning appears in the log ( 
[^srv.2.log] ):
{noformat}
[2018-05-15 15:21:39,848][WARN ][main][GridCacheDatabaseSharedManager] Ignite 
node stopped in the middle of checkpoint. Will restore memory state and finish 
checkpoint on node start.
{noformat}

*Note*
Same behavior when stopping a server node via "Ignition.stop(true)" call.

  was:
*Steps*
Start Ignite server node with enabled PDS (see the attached  
[^example-cache.xml] config file)
Activate the cluster with _./bin/control.sh --activate_
Put some data into cluster (with _CachePutGetExample.java_ for example)
Stop Ignite server node with SIGINT

*Actual result*
Ignite server node invokes the shutdown hook, checkpoint procedure starts, but 
Ignite node *does not wait for checkpoint to finish* and terminates the node.

An excerpt from  [^srv.1.log] :
{noformat}
[2018-05-15 15:20:59,976][INFO ][Thread-3][G] Invoking shutdown hook...
[2018-05-15 15:20:59,979][INFO ][Thread-3][GridTcpRestProtocol] Command 
protocol successfully stopped: TCP binary
[2018-05-15 15:20:59,998][INFO 
][db-checkpoint-thread-#50][GridCacheDatabaseSharedManager] Checkpoint started 
[checkpointId=f0dde95a-6027-40dd-b3f3-4311aa8508c3, startPtr=FileWALPointer 
[idx=0, fileOff=460751, len=40871], checkpointLockWait=0ms, 
checkpointLockHoldTime=6ms, pages=167, reason='timeout']
[2018-05-15 15:21:00,011][INFO ][Thread-3][GridCacheProcessor] Stopped cache 
[cacheName=default]
[2018-05-15 15:21:00,011][INFO ][Thread-3][GridCacheProcessor] Stopped cache 
[cacheName=ignite-sys-cache]
[2018-05-15 15:21:00,012][INFO ][Thread-3][GridCacheProcessor] Stopped cache 
[cacheName=CachePutGetExample]
[2018-05-15 15:21:00,049][INFO ][Thread-3][IgniteKernal] 

>>> +-----------------------------------------------------+
>>> Ignite ver. 2.4.0-SNAPSHOT#19700101-sha1:DEV stopped OK
>>> +-----------------------------------------------------+
>>> Grid uptime: 00:00:36.228
{noformat}

When one starts the node again, the following warning appears in the log ( 
[^srv.2.log] ):
{noformat}
[2018-05-15 15:21:39,848][WARN ][main][GridCacheDatabaseSharedManager] Ignite 
node stopped in the middle of checkpoint. Will restore memory state and finish 
checkpoint on node start.
{noformat}


> Ignite always stops the node in the middle of checkpointing upon receiving a 
> SIGINT
> -----------------------------------------------------------------------------------
>
>                 Key: IGNITE-8497
>                 URL: https://issues.apache.org/jira/browse/IGNITE-8497
>             Project: Ignite
>          Issue Type: Bug
>          Components: persistence
>    Affects Versions: 2.4
>         Environment: Ubuntu 17.10
>            Reporter: Ivan Artukhov
>            Priority: Minor
>         Attachments: example-cache.xml, srv.1.log, srv.2.log
>
>
> *Steps*
> Start Ignite server node with enabled PDS (see the attached  
> [^example-cache.xml] config file)
> Activate the cluster with _./bin/control.sh --activate_
> Put some data into cluster (with _CachePutGetExample.java_ for example)
> Stop Ignite server node with SIGINT
> *Actual result*
> Ignite server node invokes the shutdown hook, checkpoint procedure starts, 
> but Ignite node *does not wait for checkpoint to finish* and terminates the 
> node.
> An excerpt from  [^srv.1.log] :
> {noformat}
> [2018-05-15 15:20:59,976][INFO ][Thread-3][G] Invoking shutdown hook...
> [2018-05-15 15:20:59,979][INFO ][Thread-3][GridTcpRestProtocol] Command 
> protocol successfully stopped: TCP binary
> [2018-05-15 15:20:59,998][INFO 
> ][db-checkpoint-thread-#50][GridCacheDatabaseSharedManager] Checkpoint 
> started [checkpointId=f0dde95a-6027-40dd-b3f3-4311aa8508c3, 
> startPtr=FileWALPointer [idx=0, fileOff=460751, len=40871], 
> checkpointLockWait=0ms, checkpointLockHoldTime=6ms, pages=167, 
> reason='timeout']
> [2018-05-15 15:21:00,011][INFO ][Thread-3][GridCacheProcessor] Stopped cache 
> [cacheName=default]
> [2018-05-15 15:21:00,011][INFO ][Thread-3][GridCacheProcessor] Stopped cache 
> [cacheName=ignite-sys-cache]
> [2018-05-15 15:21:00,012][INFO ][Thread-3][GridCacheProcessor] Stopped cache 
> [cacheName=CachePutGetExample]
> [2018-05-15 15:21:00,049][INFO ][Thread-3][IgniteKernal] 
> >>> +-----------------------------------------------------+
> >>> Ignite ver. 2.4.0-SNAPSHOT#19700101-sha1:DEV stopped OK
> >>> +-----------------------------------------------------+
> >>> Grid uptime: 00:00:36.228
> {noformat}
> When one starts the node again, the following warning appears in the log ( 
> [^srv.2.log] ):
> {noformat}
> [2018-05-15 15:21:39,848][WARN ][main][GridCacheDatabaseSharedManager] Ignite 
> node stopped in the middle of checkpoint. Will restore memory state and 
> finish checkpoint on node start.
> {noformat}
> *Note*
> Same behavior when stopping a server node via "Ignition.stop(true)" call.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to