Hi

I deployed 5 node Ignite 2.9.0 on k8s with below configuration
Total RAM per instance 64 GB 
JVM 32 GB
Default data region 12 GB
Persistence storage 500GB volume
WAL + WAL archive 30 GB volume
After this I started ingesting data to 3 tables created, the data ingestion
is using basic JDBC batch insertions.
After around 14 hours it generated around 100GB of persistence data on each
node in 3 tables (each having backup of 1).
But suddenly 2 PODs crashed and when I check the logs, *there was errors
which says no space left on the storage volume* configured for WAL+WAL
archive.
I'm not sure what exactly caused this issue, but i couldn't recover from
this POD crash on K8S as i cannot expand the volume attached to ignite PODS.
The only operation I did when pods crashed was select count(*) from table; 
and there were around 21 crore records in that table.

Does WAL archive is needed ? how I can avoid these kind of issues, which end
up cluster in unusable state.

Your help is greatly appreciated 

Thank you 
Shiva




--
Sent from: http://apache-ignite-users.70518.x6.nabble.com/

Reply via email to