Re: Move WAL archive cleanup from checkpoint to rollover

2020-12-15 Thread Andrey Gura
Kirill,

thanks for adding motivation to the issue description.

On Wed, Dec 9, 2020 at 5:24 PM Andrey Gura  wrote:
>
> Kiriill,
>
> Issue description contains the following:
>
> > At the moment, WAL archive is cleared at the end of the checkpoint, which 
> > does not seem correct and needs to be moved
>
> Could you please explain why existing behavior is not correct. It
> seems that it is not enough motivation for change.
>
> On Wed, Dec 9, 2020 at 5:05 PM vbm  wrote:
> >
> > Hi Kirill Tkalenko,
> >
> > Is there any relation to rate of ingestion of data to ignite ?
> >
> > We had seen the issue of WAL growing infinitely recently in our K8s cluster.
> > We were ingesting data at around 2Mbps.
> > In other clusters where we did not have such a fast ingestion of data, this
> > issue was not observed.
> >
> >
> > Regards,
> > Vishwas
> >
> >
> >
> > --
> > Sent from: http://apache-ignite-developers.2346864.n4.nabble.com/


Re: Move WAL archive cleanup from checkpoint to rollover

2020-12-11 Thread Данилов Семён
Hello! I also have an issue in progress regarding WAL archive: 
https://issues.apache.org/jira/browse/IGNITE-12892. In this ticket I tried 
clarifying WAL archive size configuration, removing usage of the deprecated 
walHistSize property. Also, IGNITE_PDS_MAX_CHECKPOINT_MEMORY_HISTORY_SIZE is 
now only used when archive is managed externally and you can set 
maxWalArchiveSize to -1 so WAL truncation is disabled. PR is already submitted 
and approved https://github.com/apache/ignite/pull/8550

Regards,
Semyon.

09.12.2020, 19:17, "ткаленко кирилл" :
> Hi, Andrey!
>
> Users expect DataStorageConfiguration#maxWalArchiveSize to mean that WAL 
> archive will not exceed this value, but it is not.
> It seems that to reduce the chance of getting into a situation when we exceed 
> WAL archive, it will be lowed when we clean it when switching to a new 
> segment than at the end of the checkpoint.
> After that, we can think about and make a hard limit on WAL archive, but for 
> this will need to solve a few more problems.
>
> 09.12.2020, 17:24, "Andrey Gura" :
>>  Kiriill,
>>
>>  Issue description contains the following:
>>
>>>   At the moment, WAL archive is cleared at the end of the checkpoint, which 
>>> does not seem correct and needs to be moved
>>
>>  Could you please explain why existing behavior is not correct. It
>>  seems that it is not enough motivation for change.
>>
>>  On Wed, Dec 9, 2020 at 5:05 PM vbm  wrote:
>>>   Hi Kirill Tkalenko,
>>>
>>>   Is there any relation to rate of ingestion of data to ignite ?
>>>
>>>   We had seen the issue of WAL growing infinitely recently in our K8s 
>>> cluster.
>>>   We were ingesting data at around 2Mbps.
>>>   In other clusters where we did not have such a fast ingestion of data, 
>>> this
>>>   issue was not observed.
>>>
>>>   Regards,
>>>   Vishwas
>>>
>>>   --
>>>   Sent from: http://apache-ignite-developers.2346864.n4.nabble.com/


Re: Move WAL archive cleanup from checkpoint to rollover

2020-12-09 Thread ткаленко кирилл
Hi, Andrey!

Users expect DataStorageConfiguration#maxWalArchiveSize to mean that WAL 
archive will not exceed this value, but it is not. 
It seems that to reduce the chance of getting into a situation when we exceed 
WAL archive, it will be lowed when we clean it when switching to a new segment 
than at the end of the checkpoint.
After that, we can think about and make a hard limit on WAL archive, but for 
this will need to solve a few more problems.

09.12.2020, 17:24, "Andrey Gura" :
> Kiriill,
>
> Issue description contains the following:
>
>>  At the moment, WAL archive is cleared at the end of the checkpoint, which 
>> does not seem correct and needs to be moved
>
> Could you please explain why existing behavior is not correct. It
> seems that it is not enough motivation for change.
>
> On Wed, Dec 9, 2020 at 5:05 PM vbm  wrote:
>>  Hi Kirill Tkalenko,
>>
>>  Is there any relation to rate of ingestion of data to ignite ?
>>
>>  We had seen the issue of WAL growing infinitely recently in our K8s cluster.
>>  We were ingesting data at around 2Mbps.
>>  In other clusters where we did not have such a fast ingestion of data, this
>>  issue was not observed.
>>
>>  Regards,
>>  Vishwas
>>
>>  --
>>  Sent from: http://apache-ignite-developers.2346864.n4.nabble.com/


Re: Move WAL archive cleanup from checkpoint to rollover

2020-12-09 Thread ткаленко кирилл
Hi, Vishwas!

Speed of uploading data directly associated with the growth of WAL archive.

09.12.2020, 17:05, "vbm" :
> Hi Kirill Tkalenko,
>
> Is there any relation to rate of ingestion of data to ignite ?
>
> We had seen the issue of WAL growing infinitely recently in our K8s cluster.
> We were ingesting data at around 2Mbps.
> In other clusters where we did not have such a fast ingestion of data, this
> issue was not observed.
>
> Regards,
> Vishwas
>
> --
> Sent from: http://apache-ignite-developers.2346864.n4.nabble.com/


Re: Move WAL archive cleanup from checkpoint to rollover

2020-12-09 Thread Andrey Gura
Kiriill,

Issue description contains the following:

> At the moment, WAL archive is cleared at the end of the checkpoint, which 
> does not seem correct and needs to be moved

Could you please explain why existing behavior is not correct. It
seems that it is not enough motivation for change.

On Wed, Dec 9, 2020 at 5:05 PM vbm  wrote:
>
> Hi Kirill Tkalenko,
>
> Is there any relation to rate of ingestion of data to ignite ?
>
> We had seen the issue of WAL growing infinitely recently in our K8s cluster.
> We were ingesting data at around 2Mbps.
> In other clusters where we did not have such a fast ingestion of data, this
> issue was not observed.
>
>
> Regards,
> Vishwas
>
>
>
> --
> Sent from: http://apache-ignite-developers.2346864.n4.nabble.com/


Re: Move WAL archive cleanup from checkpoint to rollover

2020-12-09 Thread vbm
Hi Kirill Tkalenko,

Is there any relation to rate of ingestion of data to ignite ?

We had seen the issue of WAL growing infinitely recently in our K8s cluster.
We were ingesting data at around 2Mbps. 
In other clusters where we did not have such a fast ingestion of data, this
issue was not observed.
 

Regards,
Vishwas



--
Sent from: http://apache-ignite-developers.2346864.n4.nabble.com/


Re: Move WAL archive cleanup from checkpoint to rollover

2020-12-09 Thread ткаленко кирилл
And this property IGNITE_THRESHOLD_WAL_ARCHIVE_SIZE_PERCENTAGE


09.12.2020, 16:28, "ткаленко кирилл" :
> And this property IGNITE_THRESHOLD_WAL_ARCHIVE_SIZE_PERCENTAGE
>
> 09.12.2020, 16:25, "ткаленко кирилл" :
>> Hi, Shiva!
>>
>> I am not aware of such tickets yet. Later I think I can deal with this issue.
>> Now you can try to increase the frequency of checkpoints, the maximum WAL 
>> archive size and try to change the system property 
>> IGNITE_CHECKPOINT_TRIGGER_ARCHIVE_SIZE_PERCENTAGE.
>> This will not solve the problem, but it may help to delay it.
>>
>> 09.12.2020, 16:15, "shm" :
>>>  Hi Kirill Tkalenko,
>>>  Thanks for your reply!
>>>  Do you know any known issue related to problem I'm facing ? or any ticket
>>>  related to infinite WAL growing issue which can be correlated to my issue?
>>>
>>>  Regards,
>>>  Shiva
>>>
>>>  --
>>>  Sent from: http://apache-ignite-developers.2346864.n4.nabble.com/


Re: Move WAL archive cleanup from checkpoint to rollover

2020-12-09 Thread ткаленко кирилл
Hi, Shiva!

I am not aware of such tickets yet. Later I think I can deal with this issue. 
Now you can try to increase the frequency of checkpoints, the maximum WAL 
archive size and try to change the system property 
IGNITE_CHECKPOINT_TRIGGER_ARCHIVE_SIZE_PERCENTAGE.
This will not solve the problem, but it may help to delay it.

09.12.2020, 16:15, "shm" :
> Hi Kirill Tkalenko,
> Thanks for your reply!
> Do you know any known issue related to problem I'm facing ? or any ticket
> related to infinite WAL growing issue which can be correlated to my issue?
>
> Regards,
> Shiva
>
> --
> Sent from: http://apache-ignite-developers.2346864.n4.nabble.com/


Re: Move WAL archive cleanup from checkpoint to rollover

2020-12-09 Thread shm
Hi Kirill Tkalenko,
Thanks for your reply!
Do you know any known issue related to problem I'm facing ? or any ticket
related to infinite WAL growing issue which can be correlated to my issue?

Regards,
Shiva 




--
Sent from: http://apache-ignite-developers.2346864.n4.nabble.com/


Re: Move WAL archive cleanup from checkpoint to rollover

2020-12-09 Thread ткаленко кирилл
Hi, Shiva!

Yes, this ticket will only be about moving WAL archive cleanup.
I think further it is possible to solve the problem of WAL archive overflow, 
but before that we need to solve several problems and deal with heuristics.

09.12.2020, 15:02, "shm" :
> Hi Kirill Tkalenko
>
> Is this ticket is just for moving WAL archive clean-up logic from "post
> checkpoint" to FileWriteAheadLogManager#rollOver ? Or is there any plan to
> change the logic which considers Archived WAL segments for clean-up ?
>
> I'm facing a Major issue related to WAL usage and WAL usage growing
> infinitely with heavy write load even maxWalArchiveSize is set.
>
> Ref:
> http://apache-ignite-users.70518.x6.nabble.com/Could-not-clear-historyMap-due-to-WAL-reservation-on-cp-td34779.html
>
> http://apache-ignite-users.70518.x6.nabble.com/connecting-to-visor-shell-permanently-stops-unwanted-WAL-clean-up-td34737.html
>
> Shiva
>
> --
> Sent from: http://apache-ignite-developers.2346864.n4.nabble.com/


Re: Move WAL archive cleanup from checkpoint to rollover

2020-12-09 Thread shm
Hi Kirill Tkalenko

Is this ticket is just for moving WAL archive clean-up logic from "post
checkpoint" to FileWriteAheadLogManager#rollOver  ? Or is there any plan to
change the logic which considers Archived WAL segments for clean-up ?

I'm facing a Major issue related to WAL usage and WAL usage growing
infinitely with heavy write load even maxWalArchiveSize is set.

Ref:
http://apache-ignite-users.70518.x6.nabble.com/Could-not-clear-historyMap-due-to-WAL-reservation-on-cp-td34779.html

http://apache-ignite-users.70518.x6.nabble.com/connecting-to-visor-shell-permanently-stops-unwanted-WAL-clean-up-td34737.html

Shiva



--
Sent from: http://apache-ignite-developers.2346864.n4.nabble.com/


Move WAL archive cleanup from checkpoint to rollover

2020-12-09 Thread ткаленко кирилл
Hello to all!

At the moment, the archive is cleared at the end of the checkpoint, it seems it 
should happen in FileWriteAheadLogManager. 
I suggest moving it into the FileWriteAheadLogManager#rollOver when the 
DataStorageConfiguration#maxWalArchiveSize is reached.

To do this, I created a ticket 
https://issues.apache.org/jira/browse/IGNITE-13831


[jira] [Created] (IGNITE-13831) Move WAL archive cleanup from checkpoint to rollover

2020-12-09 Thread Kirill Tkalenko (Jira)
Kirill Tkalenko created IGNITE-13831:


 Summary: Move WAL archive cleanup from checkpoint to rollover
 Key: IGNITE-13831
 URL: https://issues.apache.org/jira/browse/IGNITE-13831
 Project: Ignite
  Issue Type: Improvement
  Components: persistence
Reporter: Kirill Tkalenko
Assignee: Kirill Tkalenko
 Fix For: 2.10


At the moment, WAL archive is cleared at the end of the checkpoint, which does 
not seem correct and needs to be moved to the *FileWriteAheadLogManager*. 

It is suggested to clear WAL archive in *FileWriteAheadLogManager#rollOver* 
when the *DataStorageConfiguration#maxWalArchiveSize* is reached.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)