Re: CVS (STABLE_0_3) works for Madwifi/Atheros, but starts too late
Quoting Dan Williams [EMAIL PROTECTED]: In GDB, when you get here, can you do: print dhcp_resp print dhcp_resp.dhcp_msg print dhcp_resp.dhcp_msg.options And send along the results? That should give me the actual packet the DHCP server passed back. It looks like the DHCP client is _thinking_ that it got data, but really it didn't. It also feels to me like it's a race condition, because it is much harder to reproduce since I installed the debuginfo package (whereas it happened really quick without it). -derek NetworkManager: DHCP: Got some data of length 66. NetworkManager: DHCP: Reply message was not UDP (ip_hdr-protocol = 6, IPPROTO_UDP = 17), won't use it. NetworkManager: DHCP: Got some data of length 52. NetworkManager: DHCP: Reply message was not UDP (ip_hdr-protocol = 6, IPPROTO_UDP = 17), won't use it. NetworkManager: DHCP: Got some data of length 58. NetworkManager: DHCP: Reply message was not UDP (ip_hdr-protocol = 6, IPPROTO_UDP = 17), won't use it. NetworkManager: DHCP: Got some data of length 52. NetworkManager: DHCP: Reply message was not UDP (ip_hdr-protocol = 6, IPPROTO_UDP = 17), won't use it. NetworkManager: DHCP: Got some data of length 72. NetworkManager: DHCP: Reply message was not UDP (ip_hdr-protocol = 6, IPPROTO_UDP = 17), won't use it. NetworkManager: DHCP: Got some data of length 52. NetworkManager: DHCP: Reply message was not UDP (ip_hdr-protocol = 6, IPPROTO_UDP = 17), won't use it. NetworkManager: DHCP: Got some data of length 72. NetworkManager: DHCP: Reply message was not UDP (ip_hdr-protocol = 6, IPPROTO_UDP = 17), won't use it. NetworkManager: DHCP: Got some data of length 52. NetworkManager: DHCP: Reply message was not UDP (ip_hdr-protocol = 6, IPPROTO_UDP = 17), won't use it. NetworkManager: DHCP: Got some data of length 72. NetworkManager: DHCP: Reply message was not UDP (ip_hdr-protocol = 6, IPPROTO_UDP = 17), won't use it. NetworkManager: DHCP: Got some data of length 52. NetworkManager: DHCP: Reply message was not UDP (ip_hdr-protocol = 6, IPPROTO_UDP = 17), won't use it. NetworkManager: DHCP: Got some data of length 89. NetworkManager: DHCP: Reply message was not UDP (ip_hdr-protocol = 1, IPPROTO_UDP = 17), won't use it. Detaching after fork from child process 6310. ---Type return to continue, or q return to quit--- Detaching after fork from child process 6311. NetworkManager: DHCP: Got some data of length 212. NetworkManager: DHCP: Reply message's source port (138) was not the DHCP server port number (67), won't use it. Program received signal SIGSEGV, Segmentation fault. [Switching to Thread -1208362064 (LWP 5785)] dhcp_init (iface=0x81c74c0) at client.c:767 767 memcpy ((iface-siaddr), iface-dhcp_options.val[dhcpServerIdentifier], 4); (gdb) print dhcp_resp $1 = {server_ip_addr = 0, server_hw_addr = \000\000\000\000\000, dhcp_msg = { op = 0 '\0', htype = 0 '\0', hlen = 0 '\0', hops = 0 '\0', xid = 0, secs = 0, flags = 0, ciaddr = 0, yiaddr = 0, siaddr = 0, giaddr = 0, chaddr = '\0' repeats 15 times, sname = '\0' repeats 63 times, file = '\0' repeats 127 times, options = '\0' repeats 311 times}} (gdb) print dhcp_resp.dhcp_msg $2 = {op = 0 '\0', htype = 0 '\0', hlen = 0 '\0', hops = 0 '\0', xid = 0, secs = 0, flags = 0, ciaddr = 0, yiaddr = 0, siaddr = 0, giaddr = 0, chaddr = '\0' repeats 15 times, sname = '\0' repeats 63 times, file = '\0' repeats 127 times, options = '\0' repeats 311 times} (gdb) print dhcp_resp.dhcp_msg.options $3 = '\0' repeats 311 times (gdb) quit -- Derek Atkins, SB '93 MIT EE, SM '95 MIT Media Laboratory Member, MIT Student Information Processing Board (SIPB) URL: http://web.mit.edu/warlord/PP-ASEL-IA N1NWH [EMAIL PROTECTED]PGP key available ___ NetworkManager-list mailing list NetworkManager-list@gnome.org http://mail.gnome.org/mailman/listinfo/networkmanager-list
Re: CVS (STABLE_0_3) works for Madwifi/Atheros, but starts too late
On Fri, May 13, 2005 12:33 pm, Derek Atkins said: NetworkManager: DHCP: Got some data of length 66. NetworkManager: DHCP: Reply message was not UDP (ip_hdr-protocol = 6, IPPROTO_UDP = 17), won't use it. NetworkManager: DHCP: Got some data of length 52. NetworkManager: DHCP: Reply message was not UDP (ip_hdr-protocol = 6, IPPROTO_UDP = 17), won't use it. ... Used to get this kind of garbage when I was having trouble with WEP. Just a thought. Is it possible that the cause is not actually garbage from the DHCP but just off the air noise? -- Paul Dugas, Computer Engineer Dugas Enterprises, LLC [EMAIL PROTECTED] phone: 404-932-1355 522 Black Canyon Park http://dugas.cc fax: 866-751-6494 Canton, GA 30114 USA ___ NetworkManager-list mailing list NetworkManager-list@gnome.org http://mail.gnome.org/mailman/listinfo/networkmanager-list
Re: CVS (STABLE_0_3) works for Madwifi/Atheros, but starts too late
Quoting Paul Dugas [EMAIL PROTECTED]: On Fri, May 13, 2005 12:33 pm, Derek Atkins said: NetworkManager: DHCP: Got some data of length 66. NetworkManager: DHCP: Reply message was not UDP (ip_hdr-protocol = 6, IPPROTO_UDP = 17), won't use it. NetworkManager: DHCP: Got some data of length 52. NetworkManager: DHCP: Reply message was not UDP (ip_hdr-protocol = 6, IPPROTO_UDP = 17), won't use it. ... Used to get this kind of garbage when I was having trouble with WEP. Just a thought. Is it possible that the cause is not actually garbage from the DHCP but just off the air noise? Unlikely... I'm getting this when the ethernet is plugged in.. So it should only be listening to the 802.3, not 802.11. However there is definitely a correlation between these error messages and the crash. The crashes always follow a long string of these (however just seeing these errors do not necessarily mean a crash is imminent). -derek -- Derek Atkins, SB '93 MIT EE, SM '95 MIT Media Laboratory Member, MIT Student Information Processing Board (SIPB) URL: http://web.mit.edu/warlord/PP-ASEL-IA N1NWH [EMAIL PROTECTED]PGP key available ___ NetworkManager-list mailing list NetworkManager-list@gnome.org http://mail.gnome.org/mailman/listinfo/networkmanager-list
Re: CVS (STABLE_0_3) works for Madwifi/Atheros, but starts too late
Bill Moss [EMAIL PROTECTED] writes: Should not --type=method-call be --type=method_call Indeed, this is true. Thanks. -derek -- Derek Atkins, SB '93 MIT EE, SM '95 MIT Media Laboratory Member, MIT Student Information Processing Board (SIPB) URL: http://web.mit.edu/warlord/PP-ASEL-IA N1NWH [EMAIL PROTECTED]PGP key available ___ NetworkManager-list mailing list NetworkManager-list@gnome.org http://mail.gnome.org/mailman/listinfo/networkmanager-list
Re: CVS (STABLE_0_3) works for Madwifi/Atheros, but starts too late
Quoting Dan Williams [EMAIL PROTECTED]: NM is likely to be very unhappy if you just suspend it, and wireless drivers in general have issues with suspend. There really is no method for NM to know when you've put the computer to sleep since nothing sends global events out (PowerManager+HAL will eventually do this the Right Way using dbus). I already fixed the device issue in my acpi scripts.. I ifdown the interface manually and then rmmod all the device modules (I have no idea why modprobe -r doesn't DWIW). This seems to work fine as far as the module itself is concerned. NM does have a sleep dbus method that will make NM stop and down all devices, and a wake method to wake it up. Those should be able to be done in the ACPI scripts fairly simply: Ooh, this is useful to know. Is this documented somewhere? I'll try this and see if it works. Thanks! /usr/bin/dbus-send --system --dest=org.freedesktop.NetworkManager --type=method-call /org/freedesktop/NetworkManager org.freedesktop.NetworkManager.sleep sleep 5; and: /usr/bin/dbus-send --system --dest=org.freedesktop.NetworkManager --type=method-call /org/freedesktop/NetworkManager org.freedesktop.NetworkManager.wake Dan -derek -- Derek Atkins, SB '93 MIT EE, SM '95 MIT Media Laboratory Member, MIT Student Information Processing Board (SIPB) URL: http://web.mit.edu/warlord/PP-ASEL-IA N1NWH [EMAIL PROTECTED]PGP key available ___ NetworkManager-list mailing list NetworkManager-list@gnome.org http://mail.gnome.org/mailman/listinfo/networkmanager-list
Re: CVS (STABLE_0_3) works for Madwifi/Atheros, but starts too late
On Thu, 12 May 2005, Derek Atkins wrote: Quoting Bill Moss [EMAIL PROTECTED]: Starting dbus, hal, and NM early will work in the wired case because a connection is established before nm-applet is run so ntpd will find a connection when it runs later, but this won't work for wireless because a connection will not be made until nm-applet runs after all the init scripts have run. Am I missing something? I guess that was my question. Does NM need the applet in order to select the wireless network (and WEP key)? Or will NM have access to the list of preferred networks (and WEP keys) before I login? I would certainly prefer if the list were global instead of per-user. No, the wireless networks are per-user and until you log in to start GConf, they are not available to NetworkManager. If ntpd doesn't find a connection when it is run, it becomes brain dead and will not work until restarted. Depending on the order of init scripts is problematic. Network dependent services need some way to be told that a link is available or some way to be started or restarted when a link becomes available. Yep, this is indeed my problem. Currently (well, before I got NM working at all) I was logging in as root, setting up my network, re-starting my services (like ntpd) that failed during bootup, then logging out and logging in as me. I would prefer if NM could do all this during the boot sequence, even with wireless. You might want to investigate using NetworkManagerDispatcher. You can drop a script into /etc/NetworkManager.d that will be executed by NetworkManagerDispatcher when connections go up or down. Then you can kick ntpd manually. I am running Head 1.361 under FC4 Test 3 on a T42. All seems very stable including vpnc support. I haven't tried vpnc support. However I do have a problem with vpnc and ncsd and selinux: audit(1115923917.382:0): avc: denied { read write } for pid=3378 exe=/usr/sbin/nscd path=socket:[687155] dev=sockfs ino=687155 scontext=root:system_r:nscd_t tcontext=root:system_r:unconfined_t tclass=udp_socket audit(1115923917.382:0): avc: denied { read write } for pid=3378 exe=/usr/sbin/nscd path=/dev/net/tun dev=tmpfs ino=2009 scontext=root:system_r:nscd_t tcontext=system_u:object_r:tun_tap_device_t tclass=chr_file I've noticed a few of these too, I've spoken with Dan Walsh (an SELinux guy at Red Hat) and he's aware that we need more finely tuned policy for NetworkManager. Dan ___ NetworkManager-list mailing list NetworkManager-list@gnome.org http://mail.gnome.org/mailman/listinfo/networkmanager-list
Re: CVS (STABLE_0_3) works for Madwifi/Atheros, but starts too late
On Thu, 12 May 2005, Derek Atkins wrote: [snip] May 12 20:14:21 cliodev NetworkManager: DHCP: Got some data of length 1300. May 12 20:14:21 cliodev NetworkManager: DHCP: Reply message was not UDP (ip_hdr-protocol = 6, IPPROTO_UDP = 17), won't use it. May 12 20:15:30 cliodev NetworkManager: starting... May 12 20:15:30 cliodev NetworkManager: NetworkManager startup succeeded May 12 20:15:30 cliodev NetworkManager: eth0: Driver support level for 'e1000' is fully-supported Can you do this? As root: gdb /usr/bin/NetworkManager r --no-daemon do stuff until it segfaults t a a bt mail me the result Thanks! Dan ___ NetworkManager-list mailing list NetworkManager-list@gnome.org http://mail.gnome.org/mailman/listinfo/networkmanager-list
Re: CVS (STABLE_0_3) works for Madwifi/Atheros, but starts too late
Quoting Dan Williams [EMAIL PROTECTED]: Can you do this? Of course... As root: gdb /usr/bin/NetworkManager r --no-daemon do stuff until it segfaults t a a bt mail me the result Thanks! Dan You're welcome. Here's the backtrace. (It took me about 2 minutes to do this the first time, but then I realized I hadn't installed the debuginfo package -- then it took me another 7 to reproduce it again). Let me know if you need anything else? -derek Detaching after fork from child process 10357. NetworkManager: DHCP: Got some data of length 71. NetworkManager: DHCP: Reply message was not UDP (ip_hdr-protocol = 6, IPPROTO_UDP = 17), won't use it. Program received signal SIGSEGV, Segmentation fault. [Switching to Thread -1208894544 (LWP 10127)] dhcp_init (iface=0x824e0a0) at client.c:767 767 memcpy ((iface-siaddr), iface-dhcp_options.val[dhcpServerIdentifier], 4); (gdb) t a a bt Thread 3 (Thread -1219384400 (LWP 10130)): #0 0x0042f7a2 in _dl_sysinfo_int80 () from /lib/ld-linux.so.2 #1 0x004d31a6 in __nanosleep_nocancel () from /lib/tls/libc.so.6 #2 0x0050931a in usleep () from /lib/tls/libc.so.6 #3 0x00105b60 in iw_scan () from /lib/libiw.so.27 #4 0x0805c07a in nm_completion_scan_has_results (tries=0, args=0xfdfc) at NetworkManagerDevice.c:3674 #5 0x0805fc7b in nm_v_wait_for_completion_or_timeout (max_tries=400, max_time=0x0, interval_usecs=20, test_func=0x805c012 nm_completion_scan_has_results, action_func=0, args=0xb751a230) at NetworkManagerUtils.c:671 #6 0x0805b3bd in nm_device_wireless_scan (user_data=0x82486c8) at NetworkManagerDevice.c:3792 #7 0x006c5368 in g_main_context_wakeup () from /usr/lib/libglib-2.0.so.0 #8 0x006c27bb in g_main_context_dispatch () from /usr/lib/libglib-2.0.so.0 #9 0x006c4242 in g_main_context_acquire () from /usr/lib/libglib-2.0.so.0 #10 0x006c44ef in g_main_loop_run () from /usr/lib/libglib-2.0.so.0 #11 0x080574f9 in nm_device_worker (user_data=0x8249c98) at NetworkManagerDevice.c:507 #12 0x006dc8d2 in g_static_private_free () from /usr/lib/libglib-2.0.so.0 #13 0x00690341 in start_thread () from /lib/tls/libpthread.so.0 #14 0x0050ffee in clone () from /lib/tls/libc.so.6 ---Type return to continue, or q return to quit--- Thread 2 (Thread -1208894544 (LWP 10127)): #0 dhcp_init (iface=0x824e0a0) at client.c:767 #1 0x080549df in nm_device_dhcp_request (dev=0x8247d30) at NetworkManagerDHCP.c:233 #2 0x080585af in nm_device_activation_configure_ip (dev=0x8247d30, do_only_autoip=0) at NetworkManagerDevice.c:2611 #3 0x08059e8b in nm_device_activate (user_data=0x8247d30) at NetworkManagerDevice.c:2689 #4 0x006c5b5a in g_child_watch_add () from /usr/lib/libglib-2.0.so.0 #5 0x006c27bb in g_main_context_dispatch () from /usr/lib/libglib-2.0.so.0 #6 0x006c4242 in g_main_context_acquire () from /usr/lib/libglib-2.0.so.0 #7 0x006c44ef in g_main_loop_run () from /usr/lib/libglib-2.0.so.0 #8 0x080574f9 in nm_device_worker (user_data=0x8247d30) at NetworkManagerDevice.c:507 #9 0x006dc8d2 in g_static_private_free () from /usr/lib/libglib-2.0.so.0 #10 0x00690341 in start_thread () from /lib/tls/libpthread.so.0 #11 0x0050ffee in clone () from /lib/tls/libc.so.6 Thread 1 (Thread -1208891712 (LWP 10086)): #0 0x0042f7a2 in _dl_sysinfo_int80 () from /lib/ld-linux.so.2 #1 0x006953fb in __read_nocancel () from /lib/tls/libpthread.so.0 #2 0x006e765e in g_spawn_error_quark () from /usr/lib/libglib-2.0.so.0 ---Type return to continue, or q return to quit--- #3 0x006e7fc2 in g_spawn_error_quark () from /usr/lib/libglib-2.0.so.0 #4 0x006e87a4 in g_spawn_sync () from /usr/lib/libglib-2.0.so.0 #5 0x0805f4e3 in nm_spawn_process ( args=0x825a518 /sbin/ip route flush dev ath0) at NetworkManagerUtils.c:391 #6 0x08061406 in nm_system_device_flush_routes (dev=0x825a518) at backends/NetworkManagerRedHat.c:66 #7 0x0805d844 in nm_link_state_monitor (user_data=0x8243670) at NetworkManager.c:490 #8 0x006c5368 in g_main_context_wakeup () from /usr/lib/libglib-2.0.so.0 #9 0x006c27bb in g_main_context_dispatch () from /usr/lib/libglib-2.0.so.0 #10 0x006c4242 in g_main_context_acquire () from /usr/lib/libglib-2.0.so.0 #11 0x006c44ef in g_main_loop_run () from /usr/lib/libglib-2.0.so.0 #12 0x0805e101 in main (argc=2, argv=0xbfe76b04) at NetworkManager.c:803 #13 0x0045de23 in __libc_start_main () from /lib/tls/libc.so.6 #14 0x0804b891 in _start () 767 memcpy ((iface-siaddr), iface-dhcp_options.val[dhcpServerIdentifier], 4); (gdb) quit The program is running. Exit anyway? (y or n) y -- Derek Atkins, SB '93 MIT EE, SM '95 MIT Media Laboratory Member, MIT Student Information Processing Board (SIPB) URL: http://web.mit.edu/warlord/PP-ASEL-IA N1NWH [EMAIL PROTECTED]PGP key available ___ NetworkManager-list mailing list NetworkManager-list@gnome.org