Something is going wrong.  –storage.tsdb.retention.size controls the 
*total* storage retained, not the WAL - and the WAL is only about 2 hours 
worth of data.

What filesystem is /apps/prometheus/prometheus_data stored on?

What do you see inside the wal directory? e.g. on a system here:

# *ls -lhR wal*
wal:
total 501M
-rw-r--r-- 1 prometheus prometheus 128M Dec  1 17:45 00031696
-rw-r--r-- 1 prometheus prometheus 128M Dec  1 18:30 00031697
-rw-r--r-- 1 prometheus prometheus  85M Dec  1 19:00 00031698
-rw-r--r-- 1 prometheus prometheus 128M Dec  1 19:45 00031699
-rw-r--r-- 1 prometheus prometheus  32M Dec  1 19:56 00031700
drwxr-xr-x 1 prometheus prometheus   16 Dec  1 19:00 checkpoint.00031695

wal/checkpoint.00031695:
total 7.6M
-rw-r--r-- 1 prometheus prometheus 7.6M Dec  1 19:00 00000000

Do you have a massive churn in timeseries?  Go to the web interface, select 
Status > TSDB Status, and what does the Head Stats section say? e.g.

Number of Series
Number of Chunks
Number of Label Pairs
Current Min Time
Current Max Time
=>
499826
499826
13719
2021-12-01T18:00:00.410Z (1638381600410)
2021-12-01T19:58:08.988Z (1638388688988)


-- 
You received this message because you are subscribed to the Google Groups 
"Prometheus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/prometheus-users/ea067387-0dc7-41fa-bed1-8d4cefa1bfb8n%40googlegroups.com.

Reply via email to