On 2024-11-07 07:25, Marco Elver wrote:
prctl() is a complex syscall which multiplexes its functionality based
on a large set of PR_* options. Currently we count 64 such options. The
return value of unknown options is -EINVAL, and doesn't distinguish from
known options that were passed invalid args that also return -EINVAL.

To understand if programs are attempting to use prctl() options not yet
available on the running kernel, provide the task_prctl_unknown
tracepoint.

Note, this tracepoint is in an unlikely cold path, and would therefore
be suitable for continuous monitoring (e.g. via perf_event_open).

While the above is likely the simplest usecase, additionally this
tracepoint can help unlock some testing scenarios (where probing
sys_enter or sys_exit causes undesirable performance overheads):

   a. unprivileged triggering of a test module: test modules may register a
      probe to be called back on task_prctl_unknown, and pick a very large
      unknown prctl() option upon which they perform a test function for an
      unprivileged user;

   b. unprivileged triggering of an eBPF program function: similar
      as idea (a).

Example trace_pipe output:

   test-484     [000] .....   631.748104: task_prctl_unknown: comm=test 
option=1234 arg2=101 arg3=102 arg4=103 arg5=104


My concern is that we start adding tons of special-case
tracepoints to the implementation of system calls which
are redundant with the sys_enter/exit tracepoints.

Why favor this approach rather than hooking on sys_enter/exit ?

Thanks,

Mathieu

Signed-off-by: Marco Elver <el...@google.com>
---
v2:
* Remove "pid" in trace output (suggested by Steven).
---
  include/trace/events/task.h | 41 +++++++++++++++++++++++++++++++++++++
  kernel/sys.c                |  3 +++
  2 files changed, 44 insertions(+)

diff --git a/include/trace/events/task.h b/include/trace/events/task.h
index 47b527464d1a..9202cb2524c4 100644
--- a/include/trace/events/task.h
+++ b/include/trace/events/task.h
@@ -56,6 +56,47 @@ TRACE_EVENT(task_rename,
                __entry->newcomm, __entry->oom_score_adj)
  );
+/**
+ * task_prctl_unknown - called on unknown prctl() option
+ * @task:      pointer to the current task
+ * @option:    option passed
+ * @arg2:      arg2 passed
+ * @arg3:      arg3 passed
+ * @arg4:      arg4 passed
+ * @arg5:      arg5 passed
+ *
+ * Called on an unknown prctl() option.
+ */
+TRACE_EVENT(task_prctl_unknown,
+
+       TP_PROTO(struct task_struct *task, int option, unsigned long arg2, 
unsigned long arg3,
+                unsigned long arg4, unsigned long arg5),
+
+       TP_ARGS(task, option, arg2, arg3, arg4, arg5),
+
+       TP_STRUCT__entry(
+               __string(       comm,           task->comm   )
+               __field(        int,            option)
+               __field(        unsigned long,  arg2)
+               __field(        unsigned long,  arg3)
+               __field(        unsigned long,  arg4)
+               __field(        unsigned long,  arg5)
+       ),
+
+       TP_fast_assign(
+               __assign_str(comm);
+               __entry->option = option;
+               __entry->arg2 = arg2;
+               __entry->arg3 = arg3;
+               __entry->arg4 = arg4;
+               __entry->arg5 = arg5;
+       ),
+
+       TP_printk("comm=%s option=%d arg2=%ld arg3=%ld arg4=%ld arg5=%ld",
+                 __get_str(comm), __entry->option,
+                 __entry->arg2, __entry->arg3, __entry->arg4, __entry->arg5)
+);
+
  #endif
/* This part must be outside protection */
diff --git a/kernel/sys.c b/kernel/sys.c
index 4da31f28fda8..dd0a71b68558 100644
--- a/kernel/sys.c
+++ b/kernel/sys.c
@@ -75,6 +75,8 @@
  #include <asm/io.h>
  #include <asm/unistd.h>
+#include <trace/events/task.h>
+
  #include "uid16.h"
#ifndef SET_UNALIGN_CTL
@@ -2785,6 +2787,7 @@ SYSCALL_DEFINE5(prctl, int, option, unsigned long, arg2, 
unsigned long, arg3,
                error = RISCV_SET_ICACHE_FLUSH_CTX(arg2, arg3);
                break;
        default:
+               trace_task_prctl_unknown(me, option, arg2, arg3, arg4, arg5);
                error = -EINVAL;
                break;
        }

--
Mathieu Desnoyers
EfficiOS Inc.
https://www.efficios.com


Reply via email to