Vladislav Pyatkov created IGNITE-8710:
-----------------------------------------
Summary: Applying WAL works long time or fail at all, when *.wal
files been removed
Key: IGNITE-8710
URL: https://issues.apache.org/jira/browse/IGNITE-8710
Project: Ignite
Issue Type: Bug
Reporter: Vladislav Pyatkov
In specific cases when removed *.wal files or unmounted wal directories we got
some warning message on start:
{noformat}
2018-06-02 12:10:06.127[INFO
][Thread-100][o.a.i.i.p.c.p.GridCacheDatabaseSharedManager] Checking memory
state [lastValidPos=FileWALPointer [idx=0, fileOff=0, len=0],
lastMarked=FileWALPointer [idx=0, fileOff=0, len=0],
lastCheckpointId=00000000-0000-0000-0000-000000000000]
2018-06-02 12:10:06.546[WARN
][Thread-100][o.a.i.i.p.c.p.GridCacheDatabaseSharedManager] Found unexpected
checkpoint marker, skipping [cpId=94b5ce03-87b7-489e-b08b-b4c5dc522bd5,
expCpId=00000000-0000-0000-0000-000000000000, pos=FileWALPointer [idx=0,
fileOff=44266869, len=977]]
2018-06-02 12:10:57.860[WARN
][Thread-100][o.a.i.i.p.c.p.GridCacheDatabaseSharedManager] Found unexpected
checkpoint marker, skipping [cpId=3f6ab238-23f7-4924-b4ef-0cb68d914a04,
expCpId=00000000-0000-0000-0000-000000000000, pos=FileWALPointer [idx=7,
fileOff=872888269, len=460112]]
2018-06-02 12:11:46.600[INFO
][Thread-100][o.a.i.i.p.c.p.w.FileWriteAheadLogManager] Stopping WAL iteration
due to an exception: EOF at position [1073741824] expected to read [1] bytes,
ptr=FileWALPointer [idx=15, fileOff=1073741824, len=0]
2018-06-02 12:12:21.181[WARN
][Thread-100][o.a.i.i.p.c.p.GridCacheDatabaseSharedManager] Found unexpected
checkpoint marker, skipping [cpId=3fe33806-ee11-49b7-8c47-648cd1adacbc,
expCpId=00000000-0000-0000-0000-000000000000, pos=FileWALPointer [idx=23,
fileOff=693360866, len=460112]]
{noformat}
And trying to recovery from WAL hangs a long try without success.
Should to stop the node and print message about not found necessary wal-files.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)