[ 
https://issues.apache.org/jira/browse/IGNITE-13912?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17269490#comment-17269490
 ] 

Kirill Tkalenko commented on IGNITE-13912:
------------------------------------------

Hi, [~shm]! 

>From such messages, I see that the archive is being cleaned, for example, is 
>there *000000000000009.wal* segment after the test?
{noformat}
[2021-01-21T13:49:18,110][DEBUG][wal-file-cleaner%null-#82][FileWriteAheadLogManager]
 Last truncated WAL segment: 9
[2021-01-21T13:49:18,111][INFO 
][wal-file-cleaner%null-#82][FileWriteAheadLogManager] Finish clean WAL archive 
[cleanCnt=1, currSize=3.0 GB, maxSize=10.0 GB]
{noformat}

>From the attached stack, I see that a historical rebalance is taking place in 
>the test, which does not allow clearing segments while there are changes in 
>the topology.

{noformat}
[2021-01-21T13:47:37,493][DEBUG][sys-#310][FileWriteAheadLogManager] Reserved 
WAL pointer: WALPointer [idx=11, fileOff=540780430, len=9572]
[2021-01-21T13:47:37,493][WARN ][sys-#310][FileWriteAheadLogManager] Reserved 
WAL stack
        at 
org.apache.ignite.internal.processors.cache.persistence.wal.FileWriteAheadLogManager.reserve(FileWriteAheadLogManager.java:1015)
 [ignite-core-2.11.0-SNAPSHOT.jar:2.11.0-SNAPSHOT]
{noformat}



> Incorrect calculation of WAL segments that should be deleted from WAL archive
> -----------------------------------------------------------------------------
>
>                 Key: IGNITE-13912
>                 URL: https://issues.apache.org/jira/browse/IGNITE-13912
>             Project: Ignite
>          Issue Type: Bug
>          Components: persistence
>            Reporter: Kirill Tkalenko
>            Assignee: Kirill Tkalenko
>            Priority: Critical
>             Fix For: 2.10
>
>         Attachments: server1-full-wal-checkpoint.log, wal-checkpoint-logs, 
> wal_dir_contents, wal_grows_from_peak.PNG, wal_issue_reproduced.PNG, 
> wal_usage.PNG, wal_usage_dec12.PNG, wal_usage_dec22nd_binary.PNG
>
>          Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> Now there is an incorrect calculation of WAL segments that should be deleted 
> from WAL archive. Since we delete only those segments whose total size should 
> not exceed *DataStorageConfiguration#maxWalArchiveSize * 
> IGNITE_THRESHOLD_WAL_ARCHIVE_SIZE_PERCENTAGE*, but should be up to  
> DataStorageConfiguration#maxWalArchiveSize * 
> IGNITE_THRESHOLD_WAL_ARCHIVE_SIZE_PERCENTAGE*. Therefore, an excess of 
> *DataStorageConfiguration#maxWalArchiveSize* occurs.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to