[ceph-users] Re: osd_pglog memory hoarding - another case

2020-12-22 Thread Kalle Happonen
van der Ster" , "ceph-users" > > Sent: Monday, 14 December, 2020 10:25:32 > Subject: [ceph-users] Re: osd_pglog memory hoarding - another case > Hi all, > Ok, so I have some updates on this. > > We noticed that we had a bucket with tons of RGW garbage collectio

[ceph-users] Re: osd_pglog memory hoarding - another case

2020-12-14 Thread Kalle Happonen
> Cc: "ceph-users" > Sent: Tuesday, 1 December, 2020 16:53:50 > Subject: Re: [ceph-users] Re: osd_pglog memory hoarding - another case > Hi Kalle, > > Thanks for the update. Unfortunately I haven't made any progress on > understanding the root cause of this issue. &g

[ceph-users] Re: osd_pglog memory hoarding - another case

2020-12-01 Thread Dan van der Ster
lle > > - Original Message - > > From: "Kalle Happonen" > > To: "Dan van der Ster" > > Cc: "ceph-users" > > Sent: Tuesday, 1 December, 2020 15:09:37 > > Subject: [ceph-users] Re: osd_pglog memory hoarding - another case >

[ceph-users] Re: osd_pglog memory hoarding - another case

2020-12-01 Thread Kalle Happonen
Happonen" > To: "Dan van der Ster" > Cc: "ceph-users" > Sent: Tuesday, 1 December, 2020 15:09:37 > Subject: [ceph-users] Re: osd_pglog memory hoarding - another case > Hi All, > back to this. Dan, it seems we're following exactly in your footsteps. > >

[ceph-users] Re: osd_pglog memory hoarding - another case

2020-12-01 Thread Kalle Happonen
To: "Dan van der Ster" > Cc: "ceph-users" > Sent: Thursday, 19 November, 2020 13:56:37 > Subject: [ceph-users] Re: osd_pglog memory hoarding - another case > Hello, > I thought I'd post an update. > > Setting the pg_log size to 500, and running the offlin

[ceph-users] Re: osd_pglog memory hoarding - another case

2020-11-19 Thread Kalle Happonen
. Cheers, Kalle - Original Message - > From: "Kalle Happonen" > To: "Dan van der Ster" > Cc: "ceph-users" > Sent: Tuesday, 17 November, 2020 16:07:03 > Subject: [ceph-users] Re: osd_pglog memory hoarding - another case > Hi, > >

[ceph-users] Re: osd_pglog memory hoarding - another case

2020-11-17 Thread Kalle Happonen
t;> >> >> - Original Message - >> > From: "Kalle Happonen" >> > To: "Dan van der Ster" >> > Cc: "ceph-users" >> > Sent: Tuesday, 17 November, 2020 12:45:25 >> > Subject: [ceph-users] Re: osd_pglo

[ceph-users] Re: osd_pglog memory hoarding - another case

2020-11-17 Thread Mark Nelson
t; Sent: Tuesday, 17 November, 2020 12:45:25 Subject: [ceph-users] Re: osd_pglog memory hoarding - another case Hi Dan @ co., Thanks for the support (moral and technical). That sounds like a good guess, but it seems like there is nothing alarming here. In all our pools, some pgs are a bit over 31

[ceph-users] Re: osd_pglog memory hoarding - another case

2020-11-17 Thread Dan van der Ster
> From: "Kalle Happonen" > > To: "Dan van der Ster" > > Cc: "ceph-users" > > Sent: Tuesday, 17 November, 2020 12:45:25 > > Subject: [ceph-users] Re: osd_pglog memory hoarding - another case > > > Hi Dan @ co., > > Thanks for the

[ceph-users] Re: osd_pglog memory hoarding - another case

2020-11-17 Thread Kalle Happonen
issues with memory. Cheers, Kalle - Original Message - > From: "Kalle Happonen" > To: "Dan van der Ster" > Cc: "ceph-users" > Sent: Tuesday, 17 November, 2020 12:45:25 > Subject: [ceph-users] Re: osd_pglog memory hoarding - another case &g

[ceph-users] Re: osd_pglog memory hoarding - another case

2020-11-17 Thread Dan van der Ster
On Tue, Nov 17, 2020 at 11:45 AM Kalle Happonen wrote: > > Hi Dan @ co., > Thanks for the support (moral and technical). > > That sounds like a good guess, but it seems like there is nothing alarming > here. In all our pools, some pgs are a bit over 3100, but not at any > exceptional values. >

[ceph-users] Re: osd_pglog memory hoarding - another case

2020-11-17 Thread Kalle Happonen
Hi Dan @ co., Thanks for the support (moral and technical). That sounds like a good guess, but it seems like there is nothing alarming here. In all our pools, some pgs are a bit over 3100, but not at any exceptional values. cat pgdumpfull.txt | jq '.pg_map.pg_stats[] | select(.ondisk_log_size

[ceph-users] Re: osd_pglog memory hoarding - another case

2020-11-17 Thread Dan van der Ster
Hi Kalle, Do you have active PGs now with huge pglogs? You can do something like this to find them: ceph pg dump -f json | jq '.pg_map.pg_stats[] | select(.ondisk_log_size > 3000)' If you find some, could you increase to debug_osd = 10 then share the osd log. I am interested in the debug

[ceph-users] Re: osd_pglog memory hoarding - another case

2020-11-17 Thread Dan van der Ster
Hi Xie, On Tue, Nov 17, 2020 at 11:14 AM wrote: > > Hi Dan, > > > > Given that it adds a case where the pg_log is not trimmed, I wonder if > > there could be an unforeseen condition where `last_update_ondisk` > > isn't being updated correctly, and therefore the osd stops trimming > > the pg_log

[ceph-users] Re: osd_pglog memory hoarding - another case

2020-11-17 Thread xie.xingguo
___ ceph-users mailing list -- ceph-users@ceph.io To unsubscribe send an email to ceph-users-le...@ceph.io

[ceph-users] Re: osd_pglog memory hoarding - another case

2020-11-17 Thread Dan van der Ster
Hi Kalle, Strangely and luckily, in our case the memory explosion didn't reoccur after that incident. So I can mostly only offer moral support. But if this bug indeed appeared between 14.2.8 and 14.2.13, then I think this is suspicious: b670715eb4 osd/PeeringState: do not trim pg log past