On Tue, Jul 02, 2019 at 02:31:51PM +0900, Eiichi Tsukata wrote:
> Put the boundary check before it accesses user space to prevent unnecessary
> access which might crash the machine.
> 
> Especially, ftrace preemptirq/irq_disable event with user stack trace
> option can trigger SEGV in pid 1 which leads to panic.
> 
> Reproducer:
> 
>   CONFIG_PREEMPTIRQ_TRACEPOINTS=y
>   # echo 1 > events/preemptirq/enable
>   # echo userstacktrace > trace_options
> 
> Output:
> 
>   Kernel panic - not syncing: Attempted to kill init! exitcode=0x0000000b
>   CPU: 1 PID: 1 Comm: systemd Not tainted 5.2.0-rc7+ #10

Killing systemd is a feature :-)

>   Hardware name: QEMU Standard PC (i440FX + PIIX, 1996)
>   Call Trace:
>    dump_stack+0x67/0x90
>    panic+0x100/0x2c6
>    do_exit.cold+0x4e/0x101
>    do_group_exit+0x3a/0xa0
>    get_signal+0x14a/0x8e0
>    do_signal+0x36/0x650
>    exit_to_usermode_loop+0x92/0xb0
>    prepare_exit_to_usermode+0x6f/0xb0
>    retint_user+0x8/0x18
>   RIP: 0033:0x55be7ad1c89f
>   Code: Bad RIP value.

^^^ that's weird, no amount of unwinding should affect regs->ip.

>   RSP: 002b:00007ffe329a4b00 EFLAGS: 00010202
>   RAX: 0000000000000768 RBX: 00007ffe329a4ba0 RCX: 00007ff0063aa469
>   RDX: 00007ff0066761de RSI: 00007ffe329a4b20 RDI: 0000000000000768
>   RBP: 000000000000000b R08: 0000000000000000 R09: 00007ffe329a4e2f
>   R10: 0000000000000000 R11: 0000000000000246 R12: 0000000000000768
>   R13: 0000000000000000 R14: 0000000000000004 R15: 000055be7b3d3560
>   Kernel Offset: 0x2a000000 from 0xffffffff81000000 (relocation range: 
> 0xffffffff80000000-0xffffffffbfffffff)
> 
> Fixes: 02b67518e2b1 ("tracing: add support for userspace stacktraces in 
> tracing/iter_ctrl")
> Signed-off-by: Eiichi Tsukata <de...@etsukata.com>
> ---
>  arch/x86/kernel/stacktrace.c | 4 ++--
>  1 file changed, 2 insertions(+), 2 deletions(-)
> 
> diff --git a/arch/x86/kernel/stacktrace.c b/arch/x86/kernel/stacktrace.c
> index 2abf27d7df6b..6d0c608ffe34 100644
> --- a/arch/x86/kernel/stacktrace.c
> +++ b/arch/x86/kernel/stacktrace.c
> @@ -123,12 +123,12 @@ void arch_stack_walk_user(stack_trace_consume_fn 
> consume_entry, void *cookie,
>       while (1) {
>               struct stack_frame_user frame;
>  
> +             if ((unsigned long)fp < regs->sp)
> +                     break;
>               frame.next_fp = NULL;
>               frame.ret_addr = 0;
>               if (!copy_stack_frame(fp, &frame))
>                       break;
> -             if ((unsigned long)fp < regs->sp)
> -                     break;

Aside of which, that doesn't make sense, even if copy_stack_frame() was
fed utter garbage it should never result in the user process being
affected.

It does: "pagefault_disable(); __copy_from_user_inatomic()", which
should take the fault and catch it in an extable and have it return
-EFAULT.

Something is really fishy here, maybe Josh has an idea?

>               if (frame.ret_addr) {
>                       if (!consume_entry(cookie, frame.ret_addr, false))
>                               return;
> -- 
> 2.21.0
> 

Reply via email to