Hey,

> While testing a mechanism to trigger a modem restart based on port timeout 
> (and disposing of modem object in MM) I managed to segfault ModemManager.
> I'm using vanilla MM 1.12.4 built for armhf target.
>
> What I've found, is that a race condition is possible on 
> serial_port_timed_out_cb, which is called one more time than it should, 
> causing a use-after-free.
> I've found out that by the fact I used default value of max_timeouts (10) and 
> the method got called with n_consecutive_timeouts = 11, but according to the 
> condition there, the cancellable was triggered on previous call with 
> n_consecutive_timeouts = 10.
>
> The crash happened when calling g_dbus_object_get_object_path (G_DBUS_OBJECT 
> (self) when evaluating arguments for the error log in 
> serial_port_timed_out_cb :
>
> Stack trace:
> #0  g_dbus_object_get_object_path (object=0x68f1c0) at 
> /usr/src/debug/glib-2.0/1_2.48.2-r0/glib-2.48.2/gio/gdbusobject.c:110
> #1  0x0005844c in serial_port_timed_out_cb (port=<optimized out>, 
> n_consecutive_timeouts=11, self=0x68f1c0)
>     at 
> /usr/src/debug/modemmanager/1.12+gitrAUTOINC+cef03ce291-r6/git/src/mm-base-modem.c:130
> #2  0x7699ce48 in g_cclosure_marshal_VOID__FLAGSv (closure=<optimized out>, 
> return_value=<optimized out>, instance=<optimized out>, args=..., 
> marshal_data=0x0, n_params=1,
>     param_types=0x670fa0) at 
> /usr/src/debug/glib-2.0/1_2.48.2-r0/glib-2.48.2/gobject/gmarshal.c:1570
> #3  0x7699aaec in _g_closure_invoke_va (closure=closure@entry=0x66e548, 
> return_value=return_value@entry=0x0, instance=0x60ea18, 
> instance@entry=0x6ac628, args=args@entry=..., n_params=1,
>     param_types=0x670fa0) at 
> /usr/src/debug/glib-2.0/1_2.48.2-r0/glib-2.48.2/gobject/gclosure.c:867
> #4  0x769b5f44 in g_signal_emit_valist (instance=instance@entry=0x6ac628, 
> signal_id=<optimized out>, detail=detail@entry=0, var_args=..., 
> var_args@entry=...)
>     at /usr/src/debug/glib-2.0/1_2.48.2-r0/glib-2.48.2/gobject/gsignal.c:3294
> #5  0x769b6580 in g_signal_emit (instance=instance@entry=0x6ac628, 
> signal_id=<optimized out>, detail=0) at 
> /usr/src/debug/glib-2.0/1_2.48.2-r0/glib-2.48.2/gobject/gsignal.c:3441
> #6  0x000d225c in port_serial_timed_out (data=<optimized out>) at 
> /usr/src/debug/modemmanager/1.12+gitrAUTOINC+cef03ce291-r6/git/src/mm-port-serial.c:797
> #7  0x768d4758 in g_timeout_dispatch (source=0x682280, callback=0xd21d8 
> <port_serial_timed_out>, user_data=<optimized out>)
>     at /usr/src/debug/glib-2.0/1_2.48.2-r0/glib-2.48.2/glib/gmain.c:4577
> #8  0x768d3b1c in g_main_dispatch (context=0x614bf8) at 
> /usr/src/debug/glib-2.0/1_2.48.2-r0/glib-2.48.2/glib/gmain.c:3154
> #9  g_main_context_dispatch (context=context@entry=0x614bf8) at 
> /usr/src/debug/glib-2.0/1_2.48.2-r0/glib-2.48.2/glib/gmain.c:3769
> #10 0x768d3eb4 in g_main_context_iterate (context=0x614bf8, 
> block=block@entry=1, dispatch=dispatch@entry=1, self=<optimized out>)
>     at /usr/src/debug/glib-2.0/1_2.48.2-r0/glib-2.48.2/glib/gmain.c:3840
> #11 0x768d4210 in g_main_loop_run (loop=0x61ef10) at 
> /usr/src/debug/glib-2.0/1_2.48.2-r0/glib-2.48.2/glib/gmain.c:4034
> #12 0x00044714 in main (argc=<optimized out>, argv=<optimized out>) at 
> /usr/src/debug/modemmanager/1.12+gitrAUTOINC+cef03ce291-r6/git/src/main.c:206
>
> Frame #1:
> (gdb) info frame
> Stack level 0, frame at 0x7e9fca90:
>  pc = 0x76ae816c in g_dbus_object_get_object_path 
> (/usr/src/debug/glib-2.0/1_2.48.2-r0/glib-2.48.2/gio/gdbusobject.c:110); 
> saved pc = 0x5844c
>  called by frame at 0x7e9fcac0
>  source language c.
>  Arglist at 0x7e9fca88, args: object=0x68f1c0
>  Locals at 0x7e9fca88, Previous frame's sp is 0x7e9fca90
>  Saved registers:
>   r4 at 0x7e9fca88, lr at 0x7e9fca8c
> object = 0x68f1c0
>
> (gdb) info args
> port = <optimized out>
> n_consecutive_timeouts = 11
>
> (gdb) print *self
> $2 = {parent = {parent_instance = {parent_instance = {g_type_instance = 
> {g_class = 0x0}, ref_count = 0, qdata = 0x0}, priv = 0x68f1b0}, priv = 0x0}, 
> priv = 0x68f148}
> (gdb) print *self->priv
> $4 = {connection = 0x0, cancellable = 0x0, invalid_if_cancelled = 61, device 
> = 0x6b22f0 "XEb", drivers = 0x6abda0, plugin = 0x653fc0 "\230\275j", 
> vendor_id = 7111, product_id = 33,
>   hotplugged = 0, valid = 0, reprobe = 0, max_timeouts = 10, authp = 0x0, 
> authp_cancellable = 0x0, ports = 0x0, primary = 0x0, secondary = 0x0, qcdm = 
> 0x0, data = 0x0, gps_control = 0x0,
>   gps = 0x0, audio = 0x0, enable_tasks = 0x0, disable_tasks = 0x0, qmi = 0x0, 
> mbim = 0x0}
>
> I have a full session log and a core dump, so pretty much anything could be 
> extracted if needed. So far it only reproduced once. I'm also analyzing 
> myself how to fix it, but I'd be glad if you could take a look at this.
>

The error makes sense; the port object is living for longer than the
modem object, and so by the time the timed-out signal is emitted, the
modem object is already unrefed. The fix for this issue is to keep
track of all signal ids returned by g_signal_connect(port,
"timed-out",...) and make sure they are disconnected when the modem
object is unref-ed. Let me try to patch that.

-- 
Aleksander
https://aleksander.es
_______________________________________________
ModemManager-devel mailing list
ModemManager-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/modemmanager-devel

Reply via email to