Re: CVS (STABLE_0_3) works for Madwifi/Atheros, but starts too late

2005-05-13 Thread Derek Atkins
Quoting Dan Williams [EMAIL PROTECTED]:

 In GDB, when you get here, can you do:
 
 print dhcp_resp
 print dhcp_resp.dhcp_msg
 print dhcp_resp.dhcp_msg.options
 
 And send along the results?  That should give me the actual packet the DHCP 
 server passed back.

It looks like the DHCP client is _thinking_ that it got data, but really it
didn't.  It also feels to me like it's a race condition, because it is much
harder to reproduce since I installed the debuginfo package (whereas it
happened really quick without it).

-derek

NetworkManager: DHCP: Got some data of length 66.
NetworkManager: DHCP: Reply message was not UDP (ip_hdr-protocol = 6,
IPPROTO_UDP = 17), won't use it.
NetworkManager: DHCP: Got some data of length 52.
NetworkManager: DHCP: Reply message was not UDP (ip_hdr-protocol = 6,
IPPROTO_UDP = 17), won't use it.
NetworkManager: DHCP: Got some data of length 58.
NetworkManager: DHCP: Reply message was not UDP (ip_hdr-protocol = 6,
IPPROTO_UDP = 17), won't use it.
NetworkManager: DHCP: Got some data of length 52.
NetworkManager: DHCP: Reply message was not UDP (ip_hdr-protocol = 6,
IPPROTO_UDP = 17), won't use it.
NetworkManager: DHCP: Got some data of length 72.
NetworkManager: DHCP: Reply message was not UDP (ip_hdr-protocol = 6,
IPPROTO_UDP = 17), won't use it.
NetworkManager: DHCP: Got some data of length 52.
NetworkManager: DHCP: Reply message was not UDP (ip_hdr-protocol = 6,
IPPROTO_UDP = 17), won't use it.
NetworkManager: DHCP: Got some data of length 72.
NetworkManager: DHCP: Reply message was not UDP (ip_hdr-protocol = 6,
IPPROTO_UDP = 17), won't use it.
NetworkManager: DHCP: Got some data of length 52.
NetworkManager: DHCP: Reply message was not UDP (ip_hdr-protocol = 6,
IPPROTO_UDP = 17), won't use it.
NetworkManager: DHCP: Got some data of length 72.
NetworkManager: DHCP: Reply message was not UDP (ip_hdr-protocol = 6,
IPPROTO_UDP = 17), won't use it.
NetworkManager: DHCP: Got some data of length 52.
NetworkManager: DHCP: Reply message was not UDP (ip_hdr-protocol = 6,
IPPROTO_UDP = 17), won't use it.
NetworkManager: DHCP: Got some data of length 89.
NetworkManager: DHCP: Reply message was not UDP (ip_hdr-protocol = 1,
IPPROTO_UDP = 17), won't use it.
Detaching after fork from child process 6310.
---Type return to continue, or q return to quit---
Detaching after fork from child process 6311.
NetworkManager: DHCP: Got some data of length 212.
NetworkManager: DHCP: Reply message's source port (138) was not the DHCP server
port number (67), won't use it.

Program received signal SIGSEGV, Segmentation fault.
[Switching to Thread -1208362064 (LWP 5785)]
dhcp_init (iface=0x81c74c0) at client.c:767
767 memcpy ((iface-siaddr),
iface-dhcp_options.val[dhcpServerIdentifier], 4);
(gdb) print dhcp_resp
$1 = {server_ip_addr = 0, server_hw_addr = \000\000\000\000\000, dhcp_msg = {
op = 0 '\0', htype = 0 '\0', hlen = 0 '\0', hops = 0 '\0', xid = 0,
secs = 0, flags = 0, ciaddr = 0, yiaddr = 0, siaddr = 0, giaddr = 0,
chaddr = '\0' repeats 15 times, sname = '\0' repeats 63 times,
file = '\0' repeats 127 times, options = '\0' repeats 311 times}}
(gdb) print dhcp_resp.dhcp_msg
$2 = {op = 0 '\0', htype = 0 '\0', hlen = 0 '\0', hops = 0 '\0', xid = 0,
  secs = 0, flags = 0, ciaddr = 0, yiaddr = 0, siaddr = 0, giaddr = 0,
  chaddr = '\0' repeats 15 times, sname = '\0' repeats 63 times,
  file = '\0' repeats 127 times, options = '\0' repeats 311 times}
(gdb) print dhcp_resp.dhcp_msg.options
$3 = '\0' repeats 311 times
(gdb) quit

-- 
   Derek Atkins, SB '93 MIT EE, SM '95 MIT Media Laboratory
   Member, MIT Student Information Processing Board  (SIPB)
   URL: http://web.mit.edu/warlord/PP-ASEL-IA N1NWH
   [EMAIL PROTECTED]PGP key available

___
NetworkManager-list mailing list
NetworkManager-list@gnome.org
http://mail.gnome.org/mailman/listinfo/networkmanager-list


Re: CVS (STABLE_0_3) works for Madwifi/Atheros, but starts too late

2005-05-13 Thread Paul Dugas
On Fri, May 13, 2005 12:33 pm, Derek Atkins said:
 NetworkManager: DHCP: Got some data of length 66.
 NetworkManager: DHCP: Reply message was not UDP (ip_hdr-protocol = 6,
 IPPROTO_UDP = 17), won't use it.
 NetworkManager: DHCP: Got some data of length 52.
 NetworkManager: DHCP: Reply message was not UDP (ip_hdr-protocol = 6,
 IPPROTO_UDP = 17), won't use it.
...

Used to get this kind of garbage when I was having trouble with WEP.  Just
a thought.  Is it possible that the cause is not actually garbage from the
DHCP but just off the air noise?

-- 
Paul Dugas, Computer Engineer   Dugas Enterprises, LLC
[EMAIL PROTECTED] phone: 404-932-1355   522 Black Canyon Park
http://dugas.cc fax: 866-751-6494   Canton, GA 30114 USA
___
NetworkManager-list mailing list
NetworkManager-list@gnome.org
http://mail.gnome.org/mailman/listinfo/networkmanager-list


Re: CVS (STABLE_0_3) works for Madwifi/Atheros, but starts too late

2005-05-13 Thread Derek Atkins
Quoting Paul Dugas [EMAIL PROTECTED]:

 On Fri, May 13, 2005 12:33 pm, Derek Atkins said:
  NetworkManager: DHCP: Got some data of length 66.
  NetworkManager: DHCP: Reply message was not UDP (ip_hdr-protocol = 6,
  IPPROTO_UDP = 17), won't use it.
  NetworkManager: DHCP: Got some data of length 52.
  NetworkManager: DHCP: Reply message was not UDP (ip_hdr-protocol = 6,
  IPPROTO_UDP = 17), won't use it.
 ...
 
 Used to get this kind of garbage when I was having trouble with WEP.  Just
 a thought.  Is it possible that the cause is not actually garbage from the
 DHCP but just off the air noise?

Unlikely... I'm getting this when the ethernet is plugged in..  So it should
only be listening to the 802.3, not 802.11.  However there is definitely a
correlation between these error messages and the crash.  The crashes always
follow a long string of these (however just seeing these errors do not
necessarily mean a crash is imminent).

-derek
-- 
   Derek Atkins, SB '93 MIT EE, SM '95 MIT Media Laboratory
   Member, MIT Student Information Processing Board  (SIPB)
   URL: http://web.mit.edu/warlord/PP-ASEL-IA N1NWH
   [EMAIL PROTECTED]PGP key available

___
NetworkManager-list mailing list
NetworkManager-list@gnome.org
http://mail.gnome.org/mailman/listinfo/networkmanager-list


Re: CVS (STABLE_0_3) works for Madwifi/Atheros, but starts too late

2005-05-13 Thread Derek Atkins
Bill Moss [EMAIL PROTECTED] writes:

 Should not

 --type=method-call

 be

 --type=method_call

Indeed, this is true.  Thanks.

-derek
-- 
   Derek Atkins, SB '93 MIT EE, SM '95 MIT Media Laboratory
   Member, MIT Student Information Processing Board  (SIPB)
   URL: http://web.mit.edu/warlord/PP-ASEL-IA N1NWH
   [EMAIL PROTECTED]PGP key available
___
NetworkManager-list mailing list
NetworkManager-list@gnome.org
http://mail.gnome.org/mailman/listinfo/networkmanager-list


Re: CVS (STABLE_0_3) works for Madwifi/Atheros, but starts too late

2005-05-12 Thread Derek Atkins
Quoting Dan Williams [EMAIL PROTECTED]:

 NM is likely to be very unhappy if you just suspend it, and wireless
 drivers in general have issues with suspend.  There really is no method
 for NM to know when you've put the computer to sleep since nothing sends
 global events out (PowerManager+HAL will eventually do this the Right
 Way using dbus).

I already fixed the device issue in my acpi scripts..  I ifdown the interface
manually and then rmmod all the device modules (I have no idea why modprobe
-r doesn't DWIW).  This seems to work fine as far as the module itself is
concerned.

 NM does have a sleep dbus method that will make NM stop and down all
 devices, and a wake method to wake it up.  Those should be able to be
 done in the ACPI scripts fairly simply:

Ooh, this is useful to know.  Is this documented somewhere?  I'll try this and
see if it works.  Thanks!

 /usr/bin/dbus-send --system --dest=org.freedesktop.NetworkManager
 --type=method-call /org/freedesktop/NetworkManager
 org.freedesktop.NetworkManager.sleep
 sleep 5;
 
 and:
 
 /usr/bin/dbus-send --system --dest=org.freedesktop.NetworkManager
 --type=method-call /org/freedesktop/NetworkManager
 org.freedesktop.NetworkManager.wake
 
 Dan

-derek
-- 
   Derek Atkins, SB '93 MIT EE, SM '95 MIT Media Laboratory
   Member, MIT Student Information Processing Board  (SIPB)
   URL: http://web.mit.edu/warlord/PP-ASEL-IA N1NWH
   [EMAIL PROTECTED]PGP key available

___
NetworkManager-list mailing list
NetworkManager-list@gnome.org
http://mail.gnome.org/mailman/listinfo/networkmanager-list


Re: CVS (STABLE_0_3) works for Madwifi/Atheros, but starts too late

2005-05-12 Thread Dan Williams
On Thu, 12 May 2005, Derek Atkins wrote:
 Quoting Bill Moss [EMAIL PROTECTED]:
  Starting dbus, hal, and NM early will work in the wired case because a
  connection is established before nm-applet is run so ntpd will find a
  connection when it runs later, but this won't work for wireless because
  a connection will not be made until nm-applet runs after all the init
  scripts have run. Am I missing something?
 
 I guess that was my question.  Does NM need the applet in order to select the
 wireless network (and WEP key)?  Or will NM have access to the list of
 preferred networks (and WEP keys) before I login?  I would certainly prefer if
 the list were global instead of per-user.

No, the wireless networks are per-user and until you log in to start GConf, 
they 
are not available to NetworkManager.

  If ntpd doesn't find a connection when it is run, it becomes brain dead
  and will not work until restarted. Depending on the order of init
  scripts is problematic. Network dependent services need some way to be
  told that a link is available or some way to be started or restarted
  when a link becomes available.
 
 Yep, this is indeed my problem.  Currently (well, before I got NM working at
 all) I was logging in as root, setting up my network, re-starting my services
 (like ntpd) that failed during bootup, then logging out and logging in as me.
 I would prefer if NM could do all this during the boot sequence, even with
 wireless.

You might want to investigate using NetworkManagerDispatcher.  You can drop a 
script into /etc/NetworkManager.d that will be executed by 
NetworkManagerDispatcher when connections go up or down.  Then you can kick 
ntpd 
manually.

  I am running Head 1.361 under FC4 Test 3 on a T42. All seems very stable
  including vpnc support.
 
 I haven't tried vpnc support.  However I do have a problem with vpnc and ncsd
 and selinux:
 
 audit(1115923917.382:0): avc:  denied  { read write } for  pid=3378
 exe=/usr/sbin/nscd path=socket:[687155] dev=sockfs ino=687155
 scontext=root:system_r:nscd_t tcontext=root:system_r:unconfined_t
 tclass=udp_socket
 audit(1115923917.382:0): avc:  denied  { read write } for  pid=3378
 exe=/usr/sbin/nscd path=/dev/net/tun dev=tmpfs ino=2009
 scontext=root:system_r:nscd_t tcontext=system_u:object_r:tun_tap_device_t
 tclass=chr_file

I've noticed a few of these too, I've spoken with Dan Walsh (an SELinux guy at 
Red Hat) and he's aware that we need more finely tuned policy for 
NetworkManager.

Dan
___
NetworkManager-list mailing list
NetworkManager-list@gnome.org
http://mail.gnome.org/mailman/listinfo/networkmanager-list


Re: CVS (STABLE_0_3) works for Madwifi/Atheros, but starts too late

2005-05-12 Thread Dan Williams
On Thu, 12 May 2005, Derek Atkins wrote:
 [snip]
 May 12 20:14:21 cliodev NetworkManager: DHCP: Got some data of length 1300.
 May 12 20:14:21 cliodev NetworkManager: DHCP: Reply message was not UDP
 (ip_hdr-protocol = 6, IPPROTO_UDP = 17), won't use it.
 May 12 20:15:30 cliodev NetworkManager: starting...
 May 12 20:15:30 cliodev NetworkManager: NetworkManager startup succeeded
 May 12 20:15:30 cliodev NetworkManager: eth0: Driver support level for 'e1000'
 is fully-supported

Can you do this?

As root:
gdb /usr/bin/NetworkManager
r --no-daemon
do stuff until it segfaults
t a a bt
mail me the result

Thanks!
Dan
___
NetworkManager-list mailing list
NetworkManager-list@gnome.org
http://mail.gnome.org/mailman/listinfo/networkmanager-list


Re: CVS (STABLE_0_3) works for Madwifi/Atheros, but starts too late

2005-05-12 Thread Derek Atkins
Quoting Dan Williams [EMAIL PROTECTED]:

 Can you do this?

Of course...

 As root:
 gdb /usr/bin/NetworkManager
 r --no-daemon
 do stuff until it segfaults
 t a a bt
 mail me the result
 
 Thanks!
 Dan

You're welcome.  Here's the backtrace.  (It took me about 2 minutes to do this
the first time, but then I realized I hadn't installed the debuginfo package --
then it took me another 7 to reproduce it again).

Let me know if you need anything else?

-derek

Detaching after fork from child process 10357.
NetworkManager: DHCP: Got some data of length 71.
NetworkManager: DHCP: Reply message was not UDP (ip_hdr-protocol = 6,
IPPROTO_UDP = 17), won't use it.

Program received signal SIGSEGV, Segmentation fault.
[Switching to Thread -1208894544 (LWP 10127)]
dhcp_init (iface=0x824e0a0) at client.c:767
767 memcpy ((iface-siaddr),
iface-dhcp_options.val[dhcpServerIdentifier], 4);
(gdb) t a a bt

Thread 3 (Thread -1219384400 (LWP 10130)):
#0  0x0042f7a2 in _dl_sysinfo_int80 () from /lib/ld-linux.so.2
#1  0x004d31a6 in __nanosleep_nocancel () from /lib/tls/libc.so.6
#2  0x0050931a in usleep () from /lib/tls/libc.so.6
#3  0x00105b60 in iw_scan () from /lib/libiw.so.27
#4  0x0805c07a in nm_completion_scan_has_results (tries=0, args=0xfdfc)
at NetworkManagerDevice.c:3674
#5  0x0805fc7b in nm_v_wait_for_completion_or_timeout (max_tries=400,
max_time=0x0, interval_usecs=20,
test_func=0x805c012 nm_completion_scan_has_results, action_func=0,
args=0xb751a230) at NetworkManagerUtils.c:671
#6  0x0805b3bd in nm_device_wireless_scan (user_data=0x82486c8)
at NetworkManagerDevice.c:3792
#7  0x006c5368 in g_main_context_wakeup () from /usr/lib/libglib-2.0.so.0
#8  0x006c27bb in g_main_context_dispatch () from /usr/lib/libglib-2.0.so.0
#9  0x006c4242 in g_main_context_acquire () from /usr/lib/libglib-2.0.so.0
#10 0x006c44ef in g_main_loop_run () from /usr/lib/libglib-2.0.so.0
#11 0x080574f9 in nm_device_worker (user_data=0x8249c98)
at NetworkManagerDevice.c:507
#12 0x006dc8d2 in g_static_private_free () from /usr/lib/libglib-2.0.so.0
#13 0x00690341 in start_thread () from /lib/tls/libpthread.so.0
#14 0x0050ffee in clone () from /lib/tls/libc.so.6
---Type return to continue, or q return to quit---

Thread 2 (Thread -1208894544 (LWP 10127)):
#0  dhcp_init (iface=0x824e0a0) at client.c:767
#1  0x080549df in nm_device_dhcp_request (dev=0x8247d30)
at NetworkManagerDHCP.c:233
#2  0x080585af in nm_device_activation_configure_ip (dev=0x8247d30,
do_only_autoip=0) at NetworkManagerDevice.c:2611
#3  0x08059e8b in nm_device_activate (user_data=0x8247d30)
at NetworkManagerDevice.c:2689
#4  0x006c5b5a in g_child_watch_add () from /usr/lib/libglib-2.0.so.0
#5  0x006c27bb in g_main_context_dispatch () from /usr/lib/libglib-2.0.so.0
#6  0x006c4242 in g_main_context_acquire () from /usr/lib/libglib-2.0.so.0
#7  0x006c44ef in g_main_loop_run () from /usr/lib/libglib-2.0.so.0
#8  0x080574f9 in nm_device_worker (user_data=0x8247d30)
at NetworkManagerDevice.c:507
#9  0x006dc8d2 in g_static_private_free () from /usr/lib/libglib-2.0.so.0
#10 0x00690341 in start_thread () from /lib/tls/libpthread.so.0
#11 0x0050ffee in clone () from /lib/tls/libc.so.6

Thread 1 (Thread -1208891712 (LWP 10086)):
#0  0x0042f7a2 in _dl_sysinfo_int80 () from /lib/ld-linux.so.2
#1  0x006953fb in __read_nocancel () from /lib/tls/libpthread.so.0
#2  0x006e765e in g_spawn_error_quark () from /usr/lib/libglib-2.0.so.0
---Type return to continue, or q return to quit---
#3  0x006e7fc2 in g_spawn_error_quark () from /usr/lib/libglib-2.0.so.0
#4  0x006e87a4 in g_spawn_sync () from /usr/lib/libglib-2.0.so.0
#5  0x0805f4e3 in nm_spawn_process (
args=0x825a518 /sbin/ip route flush dev ath0)
at NetworkManagerUtils.c:391
#6  0x08061406 in nm_system_device_flush_routes (dev=0x825a518)
at backends/NetworkManagerRedHat.c:66
#7  0x0805d844 in nm_link_state_monitor (user_data=0x8243670)
at NetworkManager.c:490
#8  0x006c5368 in g_main_context_wakeup () from /usr/lib/libglib-2.0.so.0
#9  0x006c27bb in g_main_context_dispatch () from /usr/lib/libglib-2.0.so.0
#10 0x006c4242 in g_main_context_acquire () from /usr/lib/libglib-2.0.so.0
#11 0x006c44ef in g_main_loop_run () from /usr/lib/libglib-2.0.so.0
#12 0x0805e101 in main (argc=2, argv=0xbfe76b04) at NetworkManager.c:803
#13 0x0045de23 in __libc_start_main () from /lib/tls/libc.so.6
#14 0x0804b891 in _start ()
767 memcpy ((iface-siaddr),
iface-dhcp_options.val[dhcpServerIdentifier], 4);
(gdb) quit
The program is running.  Exit anyway? (y or n) y

-- 
   Derek Atkins, SB '93 MIT EE, SM '95 MIT Media Laboratory
   Member, MIT Student Information Processing Board  (SIPB)
   URL: http://web.mit.edu/warlord/PP-ASEL-IA N1NWH
   [EMAIL PROTECTED]PGP key available

___
NetworkManager-list mailing list
NetworkManager-list@gnome.org