[jira] [Commented] (IGNITE-13912) Incorrect calculation of WAL segments that should be deleted from WAL archive

shivakumar (Jira) Thu, 21 Jan 2021 10:11:06 -0800


    [ 
https://issues.apache.org/jira/browse/IGNITE-13912?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17269501#comment-17269501
 ]


shivakumar commented on IGNITE-13912:
-------------------------------------

Hi [[email protected]]

The last such message in [^server1-full-wal-checkpoint.log]

is
{code:java}
[2021-01-21T13:49:18,110][DEBUG][wal-file-cleaner%null-#82][FileWriteAheadLogManager]
 Add current WAL archive size: -1073741408.0 B
[2021-01-21T13:49:18,110][DEBUG][wal-file-cleaner%null-#82][FileWriteAheadLogManager]
 Last truncated WAL segment: 9{code}
 But after that I see messages like this
{code:java}
[2021-01-21T13:53:39,924][DEBUG][wal-file-cleaner%null-#82][FileWriteAheadLogManager]
 Available truncate WAL segments: 0{code}
As part of my testing I just connect/disconnect few times to visor to monitor 
cache(last time also we observed this issue when we connect to visor). Apart 
from this only one client program which ingest data (java program connects to 
ignite over JDBC) is invoked in a loop. 

My client program makes connection like this 

 // Open Connection to Ignite
 System.out.println("Connecting to Ignite database...");
 String url = IGNITE_DB_URL + ";" + IGNITE_DB_URL_PARAMETERS;
 conn = DriverManager.getConnection(url, IGNITE_USER, IGNITE_PASS); 

And ingest 10 million records and disconnect from database. 

This connect–> ingest–>disconnect is in loop.

Not sure what is causing topology change but for sure there are no restarts of 
server nodes.

> Incorrect calculation of WAL segments that should be deleted from WAL archive
> -----------------------------------------------------------------------------
>
>                 Key: IGNITE-13912
>                 URL: https://issues.apache.org/jira/browse/IGNITE-13912
>             Project: Ignite
>          Issue Type: Bug
>          Components: persistence
>            Reporter: Kirill Tkalenko
>            Assignee: Kirill Tkalenko
>            Priority: Critical
>             Fix For: 2.10
>
>         Attachments: server1-full-wal-checkpoint.log, wal-checkpoint-logs, 
> wal_dir_contents, wal_grows_from_peak.PNG, wal_issue_reproduced.PNG, 
> wal_usage.PNG, wal_usage_dec12.PNG, wal_usage_dec22nd_binary.PNG
>
>          Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> Now there is an incorrect calculation of WAL segments that should be deleted 
> from WAL archive. Since we delete only those segments whose total size should 
> not exceed *DataStorageConfiguration#maxWalArchiveSize * 
> IGNITE_THRESHOLD_WAL_ARCHIVE_SIZE_PERCENTAGE*, but should be up to  
> DataStorageConfiguration#maxWalArchiveSize * 
> IGNITE_THRESHOLD_WAL_ARCHIVE_SIZE_PERCENTAGE*. Therefore, an excess of 
> *DataStorageConfiguration#maxWalArchiveSize* occurs.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Commented] (IGNITE-13912) Incorrect calculation of WAL segments that should be deleted from WAL archive

Reply via email to