GitHub user yatriks edited a discussion: Memory leak and confusing logs

Hi all,

I've been really enjoying my use of `kvrocks` so far, and everyone has done a 
fantastic job on it.

I'm running into some issues now that my database is in the ~100s of GBs.  I'm 
using the default kvrocks.conf except I have the `repl-namespace-enabled` set 
to yes, because I set it to that when I was testing maybe having a read replica.
(I am running the 2.12.1).

1) It seemed like the container would run out of memory (and not quite like 
this https://github.com/apache/kvrocks/discussions/2784 where the reported 
memory in kvrocks is different from the memory visible to the OS) seemingly 
randomly, though looking at my prometheus stats it seems like a compaction is 
pending right before the memory starts to rise. I have the docker container 
limit at 12GB, but it always agrees (very closely...) with the kvrocks reported 
memory. And in any case, the MALLOC changes in that thread didn't make a 
difference)

This is hard to describe, so please let me know what information I can get that 
helps (I am also trying to recreate this in a self-contained docker compose 
file with the default container, but haven't been able to yet -- which means 
maybe I'm doing something wrong in my setup...). I've attached a screenshot of 
seemingly relevant metrics from `kvrocks-exporter` but I can get whatever other 
information.

![image](https://github.com/user-attachments/assets/9b1c1502-5f58-49f4-92fa-5985c35a7c38)
![image](https://github.com/user-attachments/assets/119aa1e6-ad3b-4234-ba3a-d175e6020100)
This last one is taken during the period of the memory spike.

2) The only logs I see lately are: 
```
kv-1  | E20250520 06:16:23.964952 140052344587968 compaction_checker.cc:88] 
[compaction checker] Parse total_keys error: out of range of integer type
```

That hasn't happened since I restarted the container recently, but it was 
consistent before (though it didn't seem to affect anything...)

Again, it's a lot of weirdness, so please let me know what other information I 
can get. I'm trying to reliably reproduce it but when I saw that it had just 
spiked when I wasn't testing anything at all, I think I need some help.

Thanks again.



GitHub link: https://github.com/apache/kvrocks/discussions/2984

----
This is an automatically sent email for [email protected].
To unsubscribe, please send an email to: [email protected]

Reply via email to