Re: [PATCH] mm: cma: fix corruption cma_sysfs_alloc_pages_count
On Wed, Mar 24, 2021 at 11:02:47PM +0300, Dmitry Osipenko wrote: > 24.03.2021 22:57, Minchan Kim пишет: > > On Wed, Mar 24, 2021 at 10:49:58PM +0300, Dmitry Osipenko wrote: > >> 24.03.2021 22:43, Dmitry Osipenko пишет: > >>> 24.03.2021 22:20, Minchan Kim пишет: > static int __init cma_sysfs_init(void) > { > -int i = 0; > +struct kobject *cma_kobj_root; > +struct cma_kobject *cma_kobj; > struct cma *cma; > +unsigned int i; > >>> > while (--i >= 0) { > >>> > >>> Do you realize that this doesn't work anymore? > >>> > cma = _areas[i]; > -kobject_put(>stat->kobj); > -} > > -kfree(cma_stats); > -kobject_put(cma_kobj); > +kobject_put(>cma_kobj->kobj); > +kfree(cma->cma_kobj); > >>> > >>> Freeing a null pointer? > >>> > +cma->cma_kobj = NULL; > +} > +kobject_put(cma_kobj_root); > >>> > >> > >> Please try to simulate the errors and check that error path is working > >> properly in the next version. > >> > >> Alternatively, we could remove the cma_kobj_release entirely, like Greg > >> suggested previously, and then don't care about cleaning up at all. > > > > Does he suggested it to remove cma_kobj_release?(Initially, I did but > > was rejected from Greg) > > > > Alright, I haven't followed the previous threads fully and only saw the > reply where he suggested to removed it. No problem. I just posted it new version. Hopefully, it tastes good for you. ;-) Thanks for the review!
Re: [PATCH] mm: cma: fix corruption cma_sysfs_alloc_pages_count
24.03.2021 22:57, Minchan Kim пишет: > On Wed, Mar 24, 2021 at 10:49:58PM +0300, Dmitry Osipenko wrote: >> 24.03.2021 22:43, Dmitry Osipenko пишет: >>> 24.03.2021 22:20, Minchan Kim пишет: static int __init cma_sysfs_init(void) { - int i = 0; + struct kobject *cma_kobj_root; + struct cma_kobject *cma_kobj; struct cma *cma; + unsigned int i; >>> while (--i >= 0) { >>> >>> Do you realize that this doesn't work anymore? >>> cma = _areas[i]; - kobject_put(>stat->kobj); - } - kfree(cma_stats); - kobject_put(cma_kobj); + kobject_put(>cma_kobj->kobj); + kfree(cma->cma_kobj); >>> >>> Freeing a null pointer? >>> + cma->cma_kobj = NULL; + } + kobject_put(cma_kobj_root); >>> >> >> Please try to simulate the errors and check that error path is working >> properly in the next version. >> >> Alternatively, we could remove the cma_kobj_release entirely, like Greg >> suggested previously, and then don't care about cleaning up at all. > > Does he suggested it to remove cma_kobj_release?(Initially, I did but > was rejected from Greg) > Alright, I haven't followed the previous threads fully and only saw the reply where he suggested to removed it.
Re: [PATCH] mm: cma: fix corruption cma_sysfs_alloc_pages_count
On Wed, Mar 24, 2021 at 10:49:58PM +0300, Dmitry Osipenko wrote: > 24.03.2021 22:43, Dmitry Osipenko пишет: > > 24.03.2021 22:20, Minchan Kim пишет: > >> static int __init cma_sysfs_init(void) > >> { > >> - int i = 0; > >> + struct kobject *cma_kobj_root; > >> + struct cma_kobject *cma_kobj; > >>struct cma *cma; > >> + unsigned int i; > > > >>while (--i >= 0) { > > > > Do you realize that this doesn't work anymore? > > > >>cma = _areas[i]; > >> - kobject_put(>stat->kobj); > >> - } > >> > >> - kfree(cma_stats); > >> - kobject_put(cma_kobj); > >> + kobject_put(>cma_kobj->kobj); > >> + kfree(cma->cma_kobj); > > > > Freeing a null pointer? > > > >> + cma->cma_kobj = NULL; > >> + } > >> + kobject_put(cma_kobj_root); > > > > Please try to simulate the errors and check that error path is working > properly in the next version. > > Alternatively, we could remove the cma_kobj_release entirely, like Greg > suggested previously, and then don't care about cleaning up at all. Does he suggested it to remove cma_kobj_release?(Initially, I did but was rejected from Greg)
Re: [PATCH] mm: cma: fix corruption cma_sysfs_alloc_pages_count
On Wed, Mar 24, 2021 at 08:53:26PM +0100, David Hildenbrand wrote: > On 24.03.21 20:45, John Hubbard wrote: > > On 3/24/21 12:20 PM, Minchan Kim wrote: > > > struct cma_stat's lifespan for cma_sysfs is different with > > > struct cma because kobject for sysfs requires dynamic object > > > while CMA is static object[1]. When CMA is initialized, > > > it couldn't use slab to allocate cma_stat since slab was not > > > initialized yet. Thus, it allocates the dynamic object > > > in subsys_initcall. > > > > > > However, the cma allocation can happens before subsys_initcall > > > then, it goes crash. > > > > > > Dmitry reported[2]: > > > > > > .. > > > [1.226190] [] (cma_sysfs_alloc_pages_count) from > > > [] (cma_alloc+0x153/0x274) > > > [1.226720] [] (cma_alloc) from [] > > > (__alloc_from_contiguous+0x37/0x8c) > > > [1.227272] [] (__alloc_from_contiguous) from [] > > > (atomic_pool_init+0x7b/0x126) > > > [1.233596] [] (atomic_pool_init) from [] > > > (do_one_initcall+0x45/0x1e4) > > > [1.234188] [] (do_one_initcall) from [] > > > (kernel_init_freeable+0x157/0x1a6) > > > [1.234741] [] (kernel_init_freeable) from [] > > > (kernel_init+0xd/0xe0) > > > [1.235289] [] (kernel_init) from [] > > > (ret_from_fork+0x11/0x1c) > > > > > > This patch moves those statistic fields of cma_stat into struct cma > > > and introduces cma_kobject wrapper to follow kobject's rule. > > > > > > At the same time, it fixes other routines based on suggestions[3][4]. > > > > > > [1] https://lore.kernel.org/linux-mm/ycoamxqt6dzkc...@kroah.com/ > > > [2] > > > https://lore.kernel.org/linux-mm/fead70a2-4330-79ff-e79a-d8511eab1...@gmail.com/ > > > [3] > > > https://lore.kernel.org/linux-mm/20210323195050.2577017-1-minc...@kernel.org/ > > > [4] > > > https://lore.kernel.org/linux-mm/20210324010547.4134370-1-minc...@kernel.org/ > > > > > > Reported-by: Dmitry Osipenko > > > Tested-by: Dmitry Osipenko > > > Suggested-by: Dmitry Osipenko > > > Suggested-by: John Hubbard > > > Suggested-by: Matthew Wilcox > > > Signed-off-by: Minchan Kim > > > --- > > > I belive it's worth to have separate patch rather than replacing > > > original patch. It will also help to merge without conflict > > > since we already filed other patch based on it. > > > Strictly speaking, separating fix part and readbility part > > > in this patch would be better but it's gray to separate them > > > since most code in this patch was done while we were fixing > > > the bug. Since we don't release it yet, I hope it will work. > > > Otherwise, I can send a replacement patch inclucing all of > > > changes happend until now with gathering SoB. > > > > If we still have a choice, we should not merge a patch that has a known > > serious problem, such as a crash. That's only done if the broken problematic > > patch has already been committed to a tree that doesn't allow rebasing, > > such as of course the main linux.git. > > > > Here, I *think* it's just in linux-next and mmotm, so we still are allowed > > to fix the original patch. > > Yes, that's what we should do in case it's not upstream yet. Clean resend + > re-apply. Okay, let me replace the original one including all other patches.
Re: [PATCH] mm: cma: fix corruption cma_sysfs_alloc_pages_count
On 24.03.21 20:45, John Hubbard wrote: On 3/24/21 12:20 PM, Minchan Kim wrote: struct cma_stat's lifespan for cma_sysfs is different with struct cma because kobject for sysfs requires dynamic object while CMA is static object[1]. When CMA is initialized, it couldn't use slab to allocate cma_stat since slab was not initialized yet. Thus, it allocates the dynamic object in subsys_initcall. However, the cma allocation can happens before subsys_initcall then, it goes crash. Dmitry reported[2]: .. [1.226190] [] (cma_sysfs_alloc_pages_count) from [] (cma_alloc+0x153/0x274) [1.226720] [] (cma_alloc) from [] (__alloc_from_contiguous+0x37/0x8c) [1.227272] [] (__alloc_from_contiguous) from [] (atomic_pool_init+0x7b/0x126) [1.233596] [] (atomic_pool_init) from [] (do_one_initcall+0x45/0x1e4) [1.234188] [] (do_one_initcall) from [] (kernel_init_freeable+0x157/0x1a6) [1.234741] [] (kernel_init_freeable) from [] (kernel_init+0xd/0xe0) [1.235289] [] (kernel_init) from [] (ret_from_fork+0x11/0x1c) This patch moves those statistic fields of cma_stat into struct cma and introduces cma_kobject wrapper to follow kobject's rule. At the same time, it fixes other routines based on suggestions[3][4]. [1] https://lore.kernel.org/linux-mm/ycoamxqt6dzkc...@kroah.com/ [2] https://lore.kernel.org/linux-mm/fead70a2-4330-79ff-e79a-d8511eab1...@gmail.com/ [3] https://lore.kernel.org/linux-mm/20210323195050.2577017-1-minc...@kernel.org/ [4] https://lore.kernel.org/linux-mm/20210324010547.4134370-1-minc...@kernel.org/ Reported-by: Dmitry Osipenko Tested-by: Dmitry Osipenko Suggested-by: Dmitry Osipenko Suggested-by: John Hubbard Suggested-by: Matthew Wilcox Signed-off-by: Minchan Kim --- I belive it's worth to have separate patch rather than replacing original patch. It will also help to merge without conflict since we already filed other patch based on it. Strictly speaking, separating fix part and readbility part in this patch would be better but it's gray to separate them since most code in this patch was done while we were fixing the bug. Since we don't release it yet, I hope it will work. Otherwise, I can send a replacement patch inclucing all of changes happend until now with gathering SoB. If we still have a choice, we should not merge a patch that has a known serious problem, such as a crash. That's only done if the broken problematic patch has already been committed to a tree that doesn't allow rebasing, such as of course the main linux.git. Here, I *think* it's just in linux-next and mmotm, so we still are allowed to fix the original patch. Yes, that's what we should do in case it's not upstream yet. Clean resend + re-apply. -- Thanks, David / dhildenb
Re: [PATCH] mm: cma: fix corruption cma_sysfs_alloc_pages_count
24.03.2021 22:43, Dmitry Osipenko пишет: > 24.03.2021 22:20, Minchan Kim пишет: >> static int __init cma_sysfs_init(void) >> { >> -int i = 0; >> +struct kobject *cma_kobj_root; >> +struct cma_kobject *cma_kobj; >> struct cma *cma; >> +unsigned int i; > >> while (--i >= 0) { > > Do you realize that this doesn't work anymore? > >> cma = _areas[i]; >> -kobject_put(>stat->kobj); >> -} >> >> -kfree(cma_stats); >> -kobject_put(cma_kobj); >> +kobject_put(>cma_kobj->kobj); >> +kfree(cma->cma_kobj); > > Freeing a null pointer? > >> +cma->cma_kobj = NULL; >> +} >> +kobject_put(cma_kobj_root); > Please try to simulate the errors and check that error path is working properly in the next version. Alternatively, we could remove the cma_kobj_release entirely, like Greg suggested previously, and then don't care about cleaning up at all.
Re: [PATCH] mm: cma: fix corruption cma_sysfs_alloc_pages_count
On Wed, Mar 24, 2021 at 10:43:49PM +0300, Dmitry Osipenko wrote: > 24.03.2021 22:20, Minchan Kim пишет: > > static int __init cma_sysfs_init(void) > > { > > - int i = 0; > > + struct kobject *cma_kobj_root; > > + struct cma_kobject *cma_kobj; > > struct cma *cma; > > + unsigned int i; > > > while (--i >= 0) { > > Do you realize that this doesn't work anymore? > > > cma = _areas[i]; > > - kobject_put(>stat->kobj); > > - } > > > > - kfree(cma_stats); > > - kobject_put(cma_kobj); > > + kobject_put(>cma_kobj->kobj); > > + kfree(cma->cma_kobj); > > Freeing a null pointer? Need coffee. diff --git a/mm/cma_sysfs.c b/mm/cma_sysfs.c index a670a80aad6f..73463be08df7 100644 --- a/mm/cma_sysfs.c +++ b/mm/cma_sysfs.c @@ -79,8 +79,7 @@ static int __init cma_sysfs_init(void) struct kobject *cma_kobj_root; struct cma_kobject *cma_kobj; struct cma *cma; - unsigned int i; - int err; + int i, err; cma_kobj_root = kobject_create_and_add("cma", mm_kobj); if (!cma_kobj_root) @@ -108,10 +107,7 @@ static int __init cma_sysfs_init(void) out: while (--i >= 0) { cma = _areas[i]; - kobject_put(>cma_kobj->kobj); - kfree(cma->cma_kobj); - cma->cma_kobj = NULL; } kobject_put(cma_kobj_root);
Re: [PATCH] mm: cma: fix corruption cma_sysfs_alloc_pages_count
On 3/24/21 12:20 PM, Minchan Kim wrote: struct cma_stat's lifespan for cma_sysfs is different with struct cma because kobject for sysfs requires dynamic object while CMA is static object[1]. When CMA is initialized, it couldn't use slab to allocate cma_stat since slab was not initialized yet. Thus, it allocates the dynamic object in subsys_initcall. However, the cma allocation can happens before subsys_initcall then, it goes crash. Dmitry reported[2]: .. [1.226190] [] (cma_sysfs_alloc_pages_count) from [] (cma_alloc+0x153/0x274) [1.226720] [] (cma_alloc) from [] (__alloc_from_contiguous+0x37/0x8c) [1.227272] [] (__alloc_from_contiguous) from [] (atomic_pool_init+0x7b/0x126) [1.233596] [] (atomic_pool_init) from [] (do_one_initcall+0x45/0x1e4) [1.234188] [] (do_one_initcall) from [] (kernel_init_freeable+0x157/0x1a6) [1.234741] [] (kernel_init_freeable) from [] (kernel_init+0xd/0xe0) [1.235289] [] (kernel_init) from [] (ret_from_fork+0x11/0x1c) This patch moves those statistic fields of cma_stat into struct cma and introduces cma_kobject wrapper to follow kobject's rule. At the same time, it fixes other routines based on suggestions[3][4]. [1] https://lore.kernel.org/linux-mm/ycoamxqt6dzkc...@kroah.com/ [2] https://lore.kernel.org/linux-mm/fead70a2-4330-79ff-e79a-d8511eab1...@gmail.com/ [3] https://lore.kernel.org/linux-mm/20210323195050.2577017-1-minc...@kernel.org/ [4] https://lore.kernel.org/linux-mm/20210324010547.4134370-1-minc...@kernel.org/ Reported-by: Dmitry Osipenko Tested-by: Dmitry Osipenko Suggested-by: Dmitry Osipenko Suggested-by: John Hubbard Suggested-by: Matthew Wilcox Signed-off-by: Minchan Kim --- I belive it's worth to have separate patch rather than replacing original patch. It will also help to merge without conflict since we already filed other patch based on it. Strictly speaking, separating fix part and readbility part in this patch would be better but it's gray to separate them since most code in this patch was done while we were fixing the bug. Since we don't release it yet, I hope it will work. Otherwise, I can send a replacement patch inclucing all of changes happend until now with gathering SoB. If we still have a choice, we should not merge a patch that has a known serious problem, such as a crash. That's only done if the broken problematic patch has already been committed to a tree that doesn't allow rebasing, such as of course the main linux.git. Here, I *think* it's just in linux-next and mmotm, so we still are allowed to fix the original patch. thanks, -- John Hubbard NVIDIA
Re: [PATCH] mm: cma: fix corruption cma_sysfs_alloc_pages_count
24.03.2021 22:20, Minchan Kim пишет: > static int __init cma_sysfs_init(void) > { > - int i = 0; > + struct kobject *cma_kobj_root; > + struct cma_kobject *cma_kobj; > struct cma *cma; > + unsigned int i; > while (--i >= 0) { Do you realize that this doesn't work anymore? > cma = _areas[i]; > - kobject_put(>stat->kobj); > - } > > - kfree(cma_stats); > - kobject_put(cma_kobj); > + kobject_put(>cma_kobj->kobj); > + kfree(cma->cma_kobj); Freeing a null pointer? > + cma->cma_kobj = NULL; > + } > + kobject_put(cma_kobj_root);
[PATCH] mm: cma: fix corruption cma_sysfs_alloc_pages_count
struct cma_stat's lifespan for cma_sysfs is different with struct cma because kobject for sysfs requires dynamic object while CMA is static object[1]. When CMA is initialized, it couldn't use slab to allocate cma_stat since slab was not initialized yet. Thus, it allocates the dynamic object in subsys_initcall. However, the cma allocation can happens before subsys_initcall then, it goes crash. Dmitry reported[2]: .. [1.226190] [] (cma_sysfs_alloc_pages_count) from [] (cma_alloc+0x153/0x274) [1.226720] [] (cma_alloc) from [] (__alloc_from_contiguous+0x37/0x8c) [1.227272] [] (__alloc_from_contiguous) from [] (atomic_pool_init+0x7b/0x126) [1.233596] [] (atomic_pool_init) from [] (do_one_initcall+0x45/0x1e4) [1.234188] [] (do_one_initcall) from [] (kernel_init_freeable+0x157/0x1a6) [1.234741] [] (kernel_init_freeable) from [] (kernel_init+0xd/0xe0) [1.235289] [] (kernel_init) from [] (ret_from_fork+0x11/0x1c) This patch moves those statistic fields of cma_stat into struct cma and introduces cma_kobject wrapper to follow kobject's rule. At the same time, it fixes other routines based on suggestions[3][4]. [1] https://lore.kernel.org/linux-mm/ycoamxqt6dzkc...@kroah.com/ [2] https://lore.kernel.org/linux-mm/fead70a2-4330-79ff-e79a-d8511eab1...@gmail.com/ [3] https://lore.kernel.org/linux-mm/20210323195050.2577017-1-minc...@kernel.org/ [4] https://lore.kernel.org/linux-mm/20210324010547.4134370-1-minc...@kernel.org/ Reported-by: Dmitry Osipenko Tested-by: Dmitry Osipenko Suggested-by: Dmitry Osipenko Suggested-by: John Hubbard Suggested-by: Matthew Wilcox Signed-off-by: Minchan Kim --- I belive it's worth to have separate patch rather than replacing original patch. It will also help to merge without conflict since we already filed other patch based on it. Strictly speaking, separating fix part and readbility part in this patch would be better but it's gray to separate them since most code in this patch was done while we were fixing the bug. Since we don't release it yet, I hope it will work. Otherwise, I can send a replacement patch inclucing all of changes happend until now with gathering SoB. mm/cma.c | 4 +-- mm/cma.h | 25 +++--- mm/cma_sysfs.c | 88 -- 3 files changed, 65 insertions(+), 52 deletions(-) diff --git a/mm/cma.c b/mm/cma.c index 90e27458ddb7..08c45157911a 100644 --- a/mm/cma.c +++ b/mm/cma.c @@ -509,11 +509,11 @@ struct page *cma_alloc(struct cma *cma, size_t count, unsigned int align, out: if (page) { count_vm_event(CMA_ALLOC_SUCCESS); - cma_sysfs_alloc_pages_count(cma, count); + cma_sysfs_account_success_pages(cma, count); } else { count_vm_event(CMA_ALLOC_FAIL); if (cma) - cma_sysfs_fail_pages_count(cma, count); + cma_sysfs_account_fail_pages(cma, count); } return page; diff --git a/mm/cma.h b/mm/cma.h index 95d1aa2d808a..37b9b7858c8e 100644 --- a/mm/cma.h +++ b/mm/cma.h @@ -5,12 +5,8 @@ #include #include -struct cma_stat { - spinlock_t lock; - /* the number of CMA page successful allocations */ - unsigned long nr_pages_succeeded; - /* the number of CMA page allocation failures */ - unsigned long nr_pages_failed; +struct cma_kobject { + struct cma *cma; struct kobject kobj; }; @@ -27,7 +23,12 @@ struct cma { #endif char name[CMA_MAX_NAME]; #ifdef CONFIG_CMA_SYSFS - struct cma_stat *stat; + /* the number of CMA page successful allocations */ + atomic64_t nr_pages_succeeded; + /* the number of CMA page allocation failures */ + atomic64_t nr_pages_failed; + /* kobject requires dynamic object */ + struct cma_kobject *cma_kobj; #endif }; @@ -40,10 +41,12 @@ static inline unsigned long cma_bitmap_maxno(struct cma *cma) } #ifdef CONFIG_CMA_SYSFS -void cma_sysfs_alloc_pages_count(struct cma *cma, size_t count); -void cma_sysfs_fail_pages_count(struct cma *cma, size_t count); +void cma_sysfs_account_success_pages(struct cma *cma, unsigned long nr_pages); +void cma_sysfs_account_fail_pages(struct cma *cma, unsigned long nr_pages); #else -static inline void cma_sysfs_alloc_pages_count(struct cma *cma, size_t count) {}; -static inline void cma_sysfs_fail_pages_count(struct cma *cma, size_t count) {}; +static inline void cma_sysfs_account_success_pages(struct cma *cma, + unsigned long nr_pages) {}; +static inline void cma_sysfs_account_fail_pages(struct cma *cma, + unsigned long nr_pages) {}; #endif #endif diff --git a/mm/cma_sysfs.c b/mm/cma_sysfs.c index 3134b2b3a96d..a670a80aad6f 100644 --- a/mm/cma_sysfs.c +++ b/mm/cma_sysfs.c @@ -11,50 +11,54 @@ #include "cma.h" -static struct cma_stat *cma_stats; - -void