date:20060923

This patchset is the resubmit of the Ethernet over IPv4 tunnel driver
for Linux.  I want to thank all reviewers for their annotations and
helpfull input.  This version contains some major changes to the driver.
It uses an own device type now (ARPHRD_ETHERIP). This fixes the problem
that EtherIP devices could not be safely differenced from Ethernet
devices. This change also required some other changes. First a second
patch to the bridge code is included to allow the use of EtherIP devices
in a bridge.  The third patch includes the necessary changes to iproute2
(support of the new ARPHRD and general tunnel configuration support for
 EtherIP).

Signed-off-by: Joerg Roedel [EMAIL PROTECTED]
-
To unsubscribe from this list: send the line unsubscribe netdev in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: [PATCH 00/03][RESUBMIT] net: EtherIP tunnel driver

2006-09-23 Thread Jan-Benedict Glaw

On Sat, 2006-09-23 14:07:04 +0200, Joerg Roedel [EMAIL PROTECTED] wrote:
 This patchset is the resubmit of the Ethernet over IPv4 tunnel driver
 for Linux.  I want to thank all reviewers for their annotations and
 helpfull input.  This version contains some major changes to the driver.
 It uses an own device type now (ARPHRD_ETHERIP). This fixes the problem
 that EtherIP devices could not be safely differenced from Ethernet
 devices. This change also required some other changes. First a second
 patch to the bridge code is included to allow the use of EtherIP devices
 in a bridge.  The third patch includes the necessary changes to iproute2
 (support of the new ARPHRD and general tunnel configuration support for
  EtherIP).

I haven't seen the first submission, but is this driver really needed?
Can't this be done with creating two tap interfaces on both endpoints
and bridge them with a local ethernet device using userland software?

MfG, JBG

-- 
  Jan-Benedict Glaw  [EMAIL PROTECTED]  +49-172-7608481
Signature of: Alles wird gut! ...und heute wirds schon ein bißchen 
besser.
the second  :


signature.asc
Description: Digital signature

[PATCH 01/03] net: EtherIP driver, header and MAINTAINERS changes

This patch contains the reworked EtherIP driver, the necessary header
updates and adds an entry for EtherIP to the MAINTAINERS file.

Signed-off-by: Joerg Roedel [EMAIL PROTECTED]
diff -uprN -X linux-2.6.18-vanilla/Documentation/dontdiff 
linux-2.6.18-vanilla/include/linux/if_arp.h linux-2.6.18/include/linux/if_arp.h
--- linux-2.6.18-vanilla/include/linux/if_arp.h 2006-09-20 05:42:06.0 
+0200
+++ linux-2.6.18/include/linux/if_arp.h 2006-09-23 12:50:05.0 +0200
@@ -85,6 +85,7 @@
 #define ARPHRD_IEEE80211 801   /* IEEE 802.11  */
 #define ARPHRD_IEEE80211_PRISM 802 /* IEEE 802.11 + Prism2 header  */
 #define ARPHRD_IEEE80211_RADIOTAP 803  /* IEEE 802.11 + radiotap header */
+#define ARPHRD_ETHERIP  804/* Ethernet over IPv4  tunnel   */
 
 #define ARPHRD_VOID  0x/* Void type, nothing is known */
 #define ARPHRD_NONE  0xFFFE/* zero header length */
diff -uprN -X linux-2.6.18-vanilla/Documentation/dontdiff 
linux-2.6.18-vanilla/include/linux/in.h linux-2.6.18/include/linux/in.h
--- linux-2.6.18-vanilla/include/linux/in.h 2006-09-20 05:42:06.0 
+0200
+++ linux-2.6.18/include/linux/in.h 2006-09-20 22:52:30.0 +0200
@@ -40,6 +40,7 @@ enum {
 
   IPPROTO_ESP = 50,/* Encapsulation Security Payload protocol */
   IPPROTO_AH = 51, /* Authentication Header protocol   */
+  IPPROTO_ETHERIP = 97,/* Ethernet over IPv4 protocol */
   IPPROTO_PIM= 103,/* Protocol Independent Multicast   
*/
 
   IPPROTO_COMP   = 108,/* Compression Header protocol */
diff -uprN -X linux-2.6.18-vanilla/Documentation/dontdiff 
linux-2.6.18-vanilla/net/ipv4/etherip.c linux-2.6.18/net/ipv4/etherip.c
--- linux-2.6.18-vanilla/net/ipv4/etherip.c 1970-01-01 01:00:00.0 
+0100
+++ linux-2.6.18/net/ipv4/etherip.c 2006-09-23 12:52:38.0 +0200
@@ -0,0 +1,542 @@
+/*
+ * etherip.c: Ethernet over IPv4 tunnel driver (according to RFC3378)
+ *
+ * This driver could be used to tunnel Ethernet packets through IPv4
+ * networks. This is especially usefull together with the bridging
+ * code in Linux.
+ *
+ * This code was written with an eye on the IPIP driver in linux from
+ * Sam Lantinga. Thanks for the great work.
+ *
+ *  This program is free software; you can redistribute it and/or
+ *  modify it under the terms of the GNU General Public License
+ *  version 2 (no later version) as published by the
+ *  Free Software Foundation.
+ *
+ */
+
+#include linux/capability.h   
+#include linux/init.h
+#include linux/module.h
+#include linux/kernel.h
+#include linux/types.h
+#include linux/mutex.h
+#include linux/netdevice.h
+#include linux/etherdevice.h
+#include linux/skbuff.h
+#include linux/ip.h
+#include linux/if_tunnel.h
+#include linux/if_arp.h
+#include linux/list.h
+#include linux/string.h
+#include linux/netfilter_ipv4.h
+#include net/ip.h
+#include net/protocol.h
+#include net/route.h
+#include net/ipip.h
+#include net/xfrm.h
+#include net/inet_ecn.h
+
+MODULE_LICENSE(GPL);
+MODULE_AUTHOR(Joerg Roedel [EMAIL PROTECTED]);
+MODULE_DESCRIPTION(Ethernet over IPv4 tunnel driver);
+
+/*
+ * These 2 defines are taken from ipip.c - if it's good enough for them
+ * it's good enough for me.
+ */
+#define HASH_SIZE16
+#define HASH(addr)   ((addr^(addr4))0xF)
+
+#define ETHERIP_HEADER   ((u16)0x0300)
+#define ETHERIP_HLEN 2
+
+#define BANNER1 etherip: Ethernet over IPv4 tunneling driver\n
+
+struct etherip_tunnel {
+   struct list_head list;
+   struct net_device *dev;
+   struct net_device_stats stats;
+   struct ip_tunnel_parm parms;
+   unsigned int recursion;
+};
+
+static struct net_device *etherip_tunnel_dev;
+static struct list_head tunnels[HASH_SIZE];
+
+static DEFINE_RWLOCK(etherip_lock);
+
+static void etherip_tunnel_setup(struct net_device *dev);
+
+/* add a tunnel to the hash */
+static void etherip_tunnel_add(struct etherip_tunnel *tun)
+{
+   unsigned h = HASH(tun-parms.iph.daddr);
+   list_add_tail(tun-list, tunnels[h]);
+}
+
+/* delete a tunnel from the hash*/
+static void etherip_tunnel_del(struct etherip_tunnel *tun)
+{
+   list_del(tun-list);
+}
+
+/* find a tunnel in the hash by parameters from userspace */
+static struct etherip_tunnel* etherip_tunnel_find(struct ip_tunnel_parm *p)
+{
+   struct etherip_tunnel *ret;
+   unsigned h = HASH(p-iph.daddr);
+
+   list_for_each_entry(ret, tunnels[h], list)
+   if (ret-parms.iph.daddr == p-iph.daddr)
+   return ret;
+
+   return NULL;
+}
+
+/* find a tunnel by its destination address */
+static struct etherip_tunnel* etherip_tunnel_locate(u32 remote)
+{
+   struct etherip_tunnel *ret;
+   unsigned h = HASH(remote);
+
+   list_for_each_entry(ret, tunnels[h], list)
+   if (ret-parms.iph.daddr == remote)
+   return

[PATCH 02/03] net/bridge: add support for EtherIP devices

This patch changes the device check in the bridge code to allow EtherIP
devices to be added.

Signed-off-by: Joerg Roedel [EMAIL PROTECTED]
diff -uprN -X linux-2.6.18-vanilla/Documentation/dontdiff 
linux-2.6.18-vanilla/net/bridge/br_if.c linux-2.6.18/net/bridge/br_if.c
--- linux-2.6.18-vanilla/net/bridge/br_if.c 2006-09-20 05:42:06.0 
+0200
+++ linux-2.6.18/net/bridge/br_if.c 2006-09-20 23:03:26.0 +0200
@@ -407,7 +407,8 @@ int br_add_if(struct net_bridge *br, str
struct net_bridge_port *p;
int err = 0;
 
-   if (dev-flags  IFF_LOOPBACK || dev-type != ARPHRD_ETHER)
+   if (dev-flags  IFF_LOOPBACK ||
+   dev-type != ARPHRD_ETHER  dev-type != ARPHRD_ETHERIP)
return -EINVAL;
 
if (dev-hard_start_xmit == br_dev_xmit)

Re: [PATCH 03/03][IPROUTE2] EtherIP tunnel and device support for iproute2

This patch adds support for EtherIP tunnels and devices to the iproute2
userspace software package.

Signed-off-by: Joerg Roedel [EMAIL PROTECTED]
diff -urp iproute2-2.6.16-060323.orig/ip/iptunnel.c 
iproute2-2.6.16-060323/ip/iptunnel.c
--- iproute2-2.6.16-060323.orig/ip/iptunnel.c   2005-02-10 19:31:18.0 
+0100
+++ iproute2-2.6.16-060323/ip/iptunnel.c2006-09-20 22:35:30.0 
+0200
@@ -44,7 +44,7 @@ static void usage(void) __attribute__((n
 static void usage(void)
 {
fprintf(stderr, Usage: ip tunnel { add | change | del | show } [ NAME 
]\n);
-   fprintf(stderr,   [ mode { ipip | gre | sit } ] [ remote ADDR 
] [ local ADDR ]\n);
+   fprintf(stderr,   [ mode { ipip | gre | sit | etherip } ] [ 
remote ADDR ] [ local ADDR ]\n);
fprintf(stderr,   [ [i|o]seq ] [ [i|o]key KEY ] [ [i|o]csum 
]\n);
fprintf(stderr,   [ ttl TTL ] [ tos TOS ] [ [no]pmtudisc ] [ 
dev PHYS_DEV ]\n);
fprintf(stderr, \n);
@@ -202,6 +202,12 @@ static int parse_args(int argc, char **a
exit(-1);
}
p-iph.protocol = IPPROTO_IPV6;
+   } else if (strcmp(*argv, etherip) == 0) {
+   if (p-iph.protocol  p-iph.protocol != 
IPPROTO_ETHERIP) {
+   fprintf(stderr,You managed to ask for 
more than one tunnel mode.\n);
+   exit(-1);
+   }
+   p-iph.protocol = IPPROTO_ETHERIP;
} else {
fprintf(stderr,Cannot guess tunnel mode.\n);
exit(-1);
@@ -324,11 +330,15 @@ static int parse_args(int argc, char **a
p-iph.protocol = IPPROTO_IPIP;
else if (memcmp(p-name, sit, 3) == 0)
p-iph.protocol = IPPROTO_IPV6;
+   else if (memcmp(p-name, ethip, 5) == 0)
+   p-iph.protocol = IPPROTO_ETHERIP;
}
 
-   if (p-iph.protocol == IPPROTO_IPIP || p-iph.protocol == IPPROTO_IPV6) 
{
+   if (p-iph.protocol == IPPROTO_IPIP || 
+   p-iph.protocol == IPPROTO_IPV6 ||
+   p-iph.protocol == IPPROTO_ETHERIP) {
if ((p-i_flags  GRE_KEY) || (p-o_flags  GRE_KEY)) {
-   fprintf(stderr, Keys are not allowed with ipip and 
sit.\n);
+   fprintf(stderr, Keys are not allowed with ipip, sit or 
etherip.\n);
return -1;
}
}
@@ -351,6 +361,21 @@ static int parse_args(int argc, char **a
fprintf(stderr, Broadcast tunnel requires a source 
address.\n);
return -1;
}
+
+   if (p-iph.protocol == IPPROTO_ETHERIP) {
+   if ((cmd == SIOCADDTUNNEL || cmd == SIOCCHGTUNNEL)  
!p-iph.daddr) {
+   fprintf(stderr, EtherIP tunnel requires a 
+   destination address.\n);
+   return -1;
+   }
+
+   /*
+   if (cmd != SIOCDELTUNNEL  p-iph.frag_off  htons(IP_DF)) {
+   fprintf(stderr, Warning: [no]pmtudisc is ignored on
+EtherIP tunnels\n);
+   }
+   */
+   }
return 0;
 }
 
@@ -374,6 +399,8 @@ static int do_add(int cmd, int argc, cha
return do_add_ioctl(cmd, gre0, p);
case IPPROTO_IPV6:
return do_add_ioctl(cmd, sit0, p);
+   case IPPROTO_ETHERIP:
+   return do_add_ioctl(cmd, ethip0, p);
default:
fprintf(stderr, cannot determine tunnel mode (ipip, gre or 
sit)\n);
return -1;
@@ -395,6 +422,8 @@ int do_del(int argc, char **argv)
return do_del_ioctl(gre0, p);
case IPPROTO_IPV6:
return do_del_ioctl(sit0, p);
+   case IPPROTO_ETHERIP:
+   return do_del_ioctl(ethip0, p);
default:
return do_del_ioctl(p.name, p);
}
@@ -418,7 +447,8 @@ void print_tunnel(struct ip_tunnel_parm 
   p-name,
   p-iph.protocol == IPPROTO_IPIP ? ip :
   (p-iph.protocol == IPPROTO_GRE ? gre :
-   (p-iph.protocol == IPPROTO_IPV6 ? ipv6 : unknown)),
+  (p-iph.protocol == IPPROTO_ETHERIP ? etherip :
+   (p-iph.protocol == IPPROTO_IPV6 ? ipv6 : unknown))),
   p-iph.daddr ? format_host(AF_INET, 4, p-iph.daddr, s1, 
sizeof(s1))  : any,
   p-iph.saddr ? rt_addr_n2a(AF_INET, 4, p-iph.saddr, s2, 
sizeof(s2)) : any);
 
@@ -431,19 +461,19 @@ void print_tunnel(struct ip_tunnel_parm 
if (p-iph.ttl)
printf( ttl %d , p-iph.ttl);
else
-   printf( ttl inherit );
+   printf( ttl %s,

Re: [PATCH 00/03][RESUBMIT] net: EtherIP tunnel driver

On Sat, Sep 23, 2006 at 02:13:27PM +0200, Jan-Benedict Glaw wrote:

 I haven't seen the first submission, but is this driver really needed?
 Can't this be done with creating two tap interfaces on both endpoints
 and bridge them with a local ethernet device using userland software?

In general it is possible to use a tap interface to tunnel Ethernet
packets. But this driver uses the EtherIP protocol defined in RFC 3378
which itself defines an own IP protocol for it (number 97). This
protocol is also supported by different other operating systems (some of
the major BSD versions). This driver makes Linux interoperable with
these implementations.

Regards,
Joerg Roedel
-
To unsubscribe from this list: send the line unsubscribe netdev in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

forcedeth broken powermanagement/irq handling ?

2006-09-23 Thread Tobias Diedrich

Hi,

since there hasn't been much progress with the bugzilla entry I'm
bringing this issue to your attention here. :)

http://bugzilla.kernel.org/show_bug.cgi?id=6398

vanilla forcedeth doesn't seem to support suspend and an
ifdown/up-cycle is needed to get it working again after suspend.
Francois Romieu's Awfully experimental patch is working just fine
for me (with message signalled interrupts disabled) and has survived
quite a few suspend/resume cycles.

So I'd very much like to see (at least partial, with msi disabled)
suspend support for forcedeth in mainline. 

Romieu's patch:

--- linux-2.6.18-rc6/drivers/net/forcedeth.c2006-09-09 09:45:43.0 
+0200
+++ linux-2.6.17.11-xen/drivers/net/forcedeth.c 2006-09-09 09:41:25.0 
+0200
@@ -4433,6 +4433,50 @@
pci_set_drvdata(pci_dev, NULL);
 }
 
+
+#ifdef CONFIG_PM
+
+static int nv_suspend(struct pci_dev *pdev, pm_message_t state)
+{
+   struct net_device *dev = pci_get_drvdata(pdev);
+   struct fe_priv *np = netdev_priv(dev);
+
+   if (!netif_running(dev))
+   goto out;
+
+   netif_device_detach(dev);
+
+   // Gross.
+   nv_close(dev);
+
+   pci_save_state(pdev);
+   pci_enable_wake(pdev, pci_choose_state(pdev, state), np-wolenabled);
+   pci_set_power_state(pdev, pci_choose_state(pdev, state));
+out:
+   return 0;
+}
+
+static int nv_resume(struct pci_dev *pdev)
+{
+   struct net_device *dev = pci_get_drvdata(pdev);
+   int rc = 0;
+
+   if (!netif_running(dev))
+   goto out;
+
+   netif_device_attach(dev);
+
+   pci_set_power_state(pdev, PCI_D0);
+   pci_restore_state(pdev);
+   pci_enable_wake(pdev, PCI_D0, 0);
+
+   rc = nv_open(dev);
+out:
+   return rc;
+}
+
+#endif /* CONFIG_PM */
+
 static struct pci_device_id pci_tbl[] = {
{   /* nForce Ethernet Controller */
PCI_DEVICE(PCI_VENDOR_ID_NVIDIA, PCI_DEVICE_ID_NVIDIA_NVENET_1),
@@ -4534,6 +4578,10 @@
.id_table = pci_tbl,
.probe = nv_probe,
.remove = __devexit_p(nv_remove),
+#ifdef CONFIG_PM
+   .suspend= nv_suspend,
+   .resume = nv_resume,
+#endif
 };
 
 
-- 
Tobias  PGP: http://9ac7e0bc.uguu.de
このメールは十割再利用されたビットで作られています。
-
To unsubscribe from this list: send the line unsubscribe netdev in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: [PATCH 00/03][RESUBMIT] net: EtherIP tunnel driver

2006-09-23 Thread jamal

On Sat, 2006-23-09 at 14:13 +0200, Jan-Benedict Glaw wrote:
 On Sat, 2006-09-23 14:07:04 +0200, Joerg Roedel [EMAIL PROTECTED] wrote:
  This patchset is the resubmit of the Ethernet over IPv4 tunnel driver
  for Linux.  I want to thank all reviewers for their annotations and
  helpfull input.  This version contains some major changes to the driver.
  It uses an own device type now (ARPHRD_ETHERIP). This fixes the problem
  that EtherIP devices could not be safely differenced from Ethernet
  devices. This change also required some other changes. First a second
  patch to the bridge code is included to allow the use of EtherIP devices
  in a bridge.  The third patch includes the necessary changes to iproute2
  (support of the new ARPHRD and general tunnel configuration support for
   EtherIP).
 
 I haven't seen the first submission, but is this driver really needed?
 Can't this be done with creating two tap interfaces on both endpoints
 and bridge them with a local ethernet device using userland software?

You just need to use GRE tunnel instead of what you describe above.

While i feel bad that Joerg (and Lennert and others before) have put the
effort to do the work, i too question the need for this driver. I dont
think even the authors of the original RFC feel this provides anything
that GRE cant (according to some posting on netdev that one of the
authors made). My understanding is also that the only other OS that
implemented this got it wrong - hence you will have to interop with them
and provide quirks checks.

I am actually curious if anyone uses it instead of GRE in openbsd?
You could argue that including this driver would allow Linux to have
another bulb in the christmas tree; the other (more pragmatic way) to
look at this is it allows spreading a bad idea and needs to be censored.
I prefer the later - and hope this doesnt discourage Joerg from
contributing in the future.

cheers,
jamal

-
To unsubscribe from this list: send the line unsubscribe netdev in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: [PATCH 5/9] network namespaces: async socket operations

2006-09-23 Thread Andrey Savochkin

On Fri, Sep 22, 2006 at 05:33:56PM +0200, Daniel Lezcano wrote:
 Andrey Savochkin wrote:
  Non-trivial part of socket namespaces: asynchronous events
  should be run in proper context.
  
  Signed-off-by: Andrey Savochkin [EMAIL PROTECTED]
  ---
   af_inet.c|   10 ++
   inet_timewait_sock.c |8 
   tcp_timer.c  |9 +
   3 files changed, 27 insertions(+)
  
  --- ./net/ipv4/af_inet.c.venssock-asyn  Mon Aug 14 17:04:07 2006
  +++ ./net/ipv4/af_inet.cTue Aug 15 13:45:44 2006
  @@ -366,10 +366,17 @@ out_rcu_unlock:
   int inet_release(struct socket *sock)
   {
  struct sock *sk = sock-sk;
  +   struct net_namespace *ns, *orig_net_ns;
   
  if (sk) {
  long timeout;
   
  +   /* Need to change context here since protocol -close
  +* operation may send packets.
  +*/
  +   ns = get_net_ns(sk-sk_net_ns);
  +   push_net_ns(ns, orig_net_ns);
  +
 
 Is it not a race condition here ? What happens if you have a packet 
 incoming during the namespace context switching ?

All asynchronous operations (RX softirq, timers) should set their context
explicitly, and can't rely on the current context being the right one
(or a valid pointer at all).

Andrey
-
To unsubscribe from this list: send the line unsubscribe netdev in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: [PATCH 00/03][RESUBMIT] net: EtherIP tunnel driver

On Sat, Sep 23, 2006 at 08:38:37AM -0400, jamal wrote:

Hello Jamal,

 You just need to use GRE tunnel instead of what you describe above.

The main intention for this driver was not only to provide Ethernet over
IPv4 tunneling. This is also possible in userspace using a tap interface
(as Jan-Benedict Glaw mentioned). Another main intention for this driver
was to provide tunneling of Ethernet packets using the EtherIP protocol.

 While i feel bad that Joerg (and Lennert and others before) have put the
 effort to do the work, i too question the need for this driver. I dont
 think even the authors of the original RFC feel this provides anything
 that GRE cant (according to some posting on netdev that one of the
 authors made).

You are right. I completly agree with this. But this is also true for
the IPIP and the SIT driver. You can do both with GRE. And there are
reasons to keep both in the Kernel.

 My understanding is also that the only other OS that implemented this
 got it wrong - hence you will have to interop with them and provide
 quirks checks.

At the moment I know at least that at least OpenBSD, NetBSD and FreeBSD
support the EtherIP protocol. The first of them was OpenBSD, thats
right. I don't think OpenBSD made a wrong implementation at this point
(I assume you are speaking of the position of the 3 in the header). The
RFC is not clear at this point. It defines that the first 4 bits in the
16 bit Ethernet header MUST be 0011. But it don't defines the
byteorder of that 16 bit word nor if the least or most significant bit
comes first. This was the reason (to keep interoperability with the
existing implementations) I implemented it the same way as OpenBSD and
my driver does not check the incoming EtherIP header.
 
 I am actually curious if anyone uses it instead of GRE in openbsd?

When I searched Google for EtherIP I found some entries in BSD forums
discussing questions concering EtherIP usage. This, and the fact I know
a BSD user that uses EtherIP too, makes be believe there are numerous
users of EtherIP in the BSD world. And at least the BSD user I know
wants interoperability of his NetBSD implemenation with Linux. This
request was the starting point for this driver.

 You could argue that including this driver would allow Linux to have
 another bulb in the christmas tree; the other (more pragmatic way) to
 look at this is it allows spreading a bad idea and needs to be censored.

I am not a friend of censorship. I think the users should have the
freedom to decide what they want to use. There are reasons to have more
than one way to tunnel Ethernet packets in the Kernel (the reason for
EtherIP is the interoperability with the BSD implementations). I don't
know if the GRE driver in mainline already support Ethernet tunneling.
But if not, my driver is already the second way to do it (after the tap
devices).

 I prefer the later - and hope this doesnt discourage Joerg from
 contributing in the future.

Surely not. I intend to further contribute even if this driver would be
finally rejected :)

Regards,
Joerg Roedel
-
To unsubscribe from this list: send the line unsubscribe netdev in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: Bcm43xx softMac Driver in 2.6.18

2006-09-23 Thread Ray Lee

Rafael J. Wysocki wrote:
 2.6.18 vanilla and 2.6.18 with your patch both lock my system hard
 with bcm43xx. I've got an HP/Compaq nx6125 laptop. Symptoms are that
 it will associate fine on its own and send traffic to/fro upon ifup,
 but when I do an iwconfig, ifdown, ifup to change the access point,
 the system locks (somewhat randomly) during one of those operations.
 Well, the iwconfig or the ifup, actually.
 
 I have observed similar symptoms on HPC nx6325, although I haven't managed
 to get the adapter associate with an AP.

Yeah, I'm having the same troubles. Carefully watching the iwconfig
results showed me that only half of the time did my `iwconfig eth1 essid
AccessPointName` actually take. (It listed the essid of the ap I told it
to associate with, but then showed Access Point: Invalid or words to
that effect, until I issued the exact same iwconfig again.)

So, try it twice, double check the iwconfig output, then try bringing up
the interface. Though that seems awfully difficult to do as well (DHCP
is just sending out stuff with nothing coming back).

When I switch consoles while DHCP is plaintively asking for an IP, and
issue *another* iwconfig with the same essid, then it seems to kick
something in the driver and DHCP immediately associates. Happened twice
for me so far, though that could merely be a coincidence.

Ray
-
To unsubscribe from this list: send the line unsubscribe netdev in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

[PATCH] Advertise PPPoE MTU / avoid memory leak.

2006-09-23 Thread mostrows

PPPoE must advertise the underlying device's MTU via the ppp channel
descriptor structure, as multilink functionality depends on it.

__pppoe_xmit must free any skb it allocates if there is an error
submitting the skb downstream.

Signed-off-by: Michal Ostrowski [EMAIL PROTECTED]
---
 drivers/net/pppoe.c |5 -
 1 files changed, 4 insertions(+), 1 deletions(-)

diff --git a/drivers/net/pppoe.c b/drivers/net/pppoe.c
index 475dc93..b4dc516 100644
--- a/drivers/net/pppoe.c
+++ b/drivers/net/pppoe.c
@@ -600,6 +600,7 @@ static int pppoe_connect(struct socket *
po-chan.hdrlen = (sizeof(struct pppoe_hdr) +
   dev-hard_header_len);
 
+   po-chan.mtu = dev-mtu - sizeof(struct pppoe_hdr);
po-chan.private = sk;
po-chan.ops = pppoe_chan_ops;
 
@@ -831,7 +832,7 @@ static int __pppoe_xmit(struct sock *sk,
struct pppoe_hdr *ph;
int headroom = skb_headroom(skb);
int data_len = skb-len;
-   struct sk_buff *skb2;
+   struct sk_buff *skb2 = NULL;
 
if (sock_flag(sk, SOCK_DEAD) || !(sk-sk_state  PPPOX_CONNECTED))
goto abort;
@@ -887,6 +888,8 @@ static int __pppoe_xmit(struct sock *sk,
return 1;
 
 abort:
+   if (skb2)
+   kfree_skb(skb2);
return 0;
 }
 
-- 
1.4.1.1

-
To unsubscribe from this list: send the line unsubscribe netdev in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

softmac mtu

2006-09-23 Thread Matthieu CASTET

Hi,

why softmac (and maybe device using linux 80211 stack) can't increase
their mtu above 1500 ?

IRRC 802.11 allow to send bigger frame. Moreover some driver like airo
allow to use mtu biger than 2000.

thanks,


Matthieu

-
To unsubscribe from this list: send the line unsubscribe netdev in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

softmac mtu

2006-09-23 Thread Matthieu CASTET

Hi,

why softmac (and maybe device using linux 80211 stack) can't increase
their mtu above 1500 ?

IRRC 802.11 allow to send bigger frame. Moreover some driver like airo
allow to use mtu biger than 2000.

thanks,


Matthieu

-
To unsubscribe from this list: send the line unsubscribe netdev in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: 2.6.1[78] page allocation failure. order:3, mode:0x20

2006-09-23 Thread Auke Kok


Andrew Morton wrote:

On Fri, 22 Sep 2006 22:25:07 -0700 (PDT)
David Miller [EMAIL PROTECTED] wrote:


From: Andrew Morton [EMAIL PROTECTED]
Date: Fri, 22 Sep 2006 21:50:00 -0700


On Fri, 22 Sep 2006 10:10:36 -0700
Auke Kok [EMAIL PROTECTED] wrote:


e1000: account for NET_IP_ALIGN when calculating bufsiz

Account for NET_IP_ALIGN when requesting buffer sizes from netdev_alloc_skb to 
reduce slab allocation by half.

Could we please do whatever is needed to get this blessed and merged?  This
is such a common problem on such a common driver that I would suggest that
we want this in 2.6.18.x as well.  At least, I'd expect distributors to
ship this fix (they're nuts if they don't) and so it makes sense to deliver
it from kernel.org.

The NET_IP_ALIGN existed not just for fun :)  There are ramifications
for removing it.


It's still there, isn't it?

For the 9k MTU case, for example, we end up allocating 16384 byte skbs
instead of 32786 kbytes ones.


yes, the only thing I'm doing is accounting for the 2 bytes one steap earlier. 
It works fine for the general case and I tested it too, but I am not too sure 
about the corner cases as the hardware has no notion of mtu at all and could 
possibly overwrite by two bytes. I think my patch actually give the hardware 
two bytes too much now, so we're on the other side (safe) of that problem, but 
I have to verify this first of course.


I'll be wrestling this on monday with Jesse and try to nail it down.

Auke




diff -puN 
drivers/net/e1000/e1000_main.c~e1000-account-for-net_ip_align-when-calculating-bufsiz
 drivers/net/e1000/e1000_main.c
--- 
a/drivers/net/e1000/e1000_main.c~e1000-account-for-net_ip_align-when-calculating-bufsiz
+++ a/drivers/net/e1000/e1000_main.c
@@ -1101,7 +1101,7 @@ e1000_sw_init(struct e1000_adapter *adap
 
 	pci_read_config_word(pdev, PCI_COMMAND, hw-pci_cmd_word);
 
-	adapter-rx_buffer_len = MAXIMUM_ETHERNET_VLAN_SIZE;

+   adapter-rx_buffer_len = MAXIMUM_ETHERNET_VLAN_SIZE + NET_IP_ALIGN;
adapter-rx_ps_bsize0 = E1000_RXBUFFER_128;
hw-max_frame_size = netdev-mtu +
 ENET_HEADER_SIZE + ETHERNET_FCS_SIZE;
@@ -3163,26 +3163,27 @@ e1000_change_mtu(struct net_device *netd
 * larger slab size
 * i.e. RXBUFFER_2048 -- size-4096 slab */
 
-	if (max_frame = E1000_RXBUFFER_256)

+   if (max_frame + NET_IP_ALIGN = E1000_RXBUFFER_256)
adapter-rx_buffer_len = E1000_RXBUFFER_256;
-   else if (max_frame = E1000_RXBUFFER_512)
+   else if (max_frame + NET_IP_ALIGN = E1000_RXBUFFER_512)
adapter-rx_buffer_len = E1000_RXBUFFER_512;
-   else if (max_frame = E1000_RXBUFFER_1024)
+   else if (max_frame + NET_IP_ALIGN = E1000_RXBUFFER_1024)
adapter-rx_buffer_len = E1000_RXBUFFER_1024;
-   else if (max_frame = E1000_RXBUFFER_2048)
+   else if (max_frame + NET_IP_ALIGN = E1000_RXBUFFER_2048)
adapter-rx_buffer_len = E1000_RXBUFFER_2048;
-   else if (max_frame = E1000_RXBUFFER_4096)
+   else if (max_frame + NET_IP_ALIGN = E1000_RXBUFFER_4096)
adapter-rx_buffer_len = E1000_RXBUFFER_4096;
-   else if (max_frame = E1000_RXBUFFER_8192)
+   else if (max_frame + NET_IP_ALIGN = E1000_RXBUFFER_8192)
adapter-rx_buffer_len = E1000_RXBUFFER_8192;
-   else if (max_frame = E1000_RXBUFFER_16384)
+   else
adapter-rx_buffer_len = E1000_RXBUFFER_16384;
 
 	/* adjust allocation if LPE protects us, and we aren't using SBP */

if (!adapter-hw.tbi_compatibility_on 
((max_frame == MAXIMUM_ETHERNET_FRAME_SIZE) ||
 (max_frame == MAXIMUM_ETHERNET_VLAN_SIZE)))
-   adapter-rx_buffer_len = MAXIMUM_ETHERNET_VLAN_SIZE;
+   adapter-rx_buffer_len = MAXIMUM_ETHERNET_VLAN_SIZE +
+   NET_IP_ALIGN;
 
 	netdev-mtu = new_mtu;
 
@@ -4002,7 +4003,8 @@ e1000_alloc_rx_buffers(struct e1000_adap

struct e1000_buffer *buffer_info;
struct sk_buff *skb;
unsigned int i;
-   unsigned int bufsz = adapter-rx_buffer_len + NET_IP_ALIGN;
+   /* we have already accounted for NET_IP_ALIGN */
+   unsigned int bufsz = adapter-rx_buffer_len;
 
 	i = rx_ring-next_to_use;

buffer_info = rx_ring-buffer_info[i];
_

-
To unsubscribe from this list: send the line unsubscribe netdev in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: Bcm43xx softMac Driver in 2.6.18


Ray Lee wrote:

Rafael J. Wysocki wrote:

2.6.18 vanilla and 2.6.18 with your patch both lock my system hard
with bcm43xx. I've got an HP/Compaq nx6125 laptop. Symptoms are that
it will associate fine on its own and send traffic to/fro upon ifup,
but when I do an iwconfig, ifdown, ifup to change the access point,
the system locks (somewhat randomly) during one of those operations.
Well, the iwconfig or the ifup, actually.

I have observed similar symptoms on HPC nx6325, although I haven't managed
to get the adapter associate with an AP.


Yeah, I'm having the same troubles. Carefully watching the iwconfig
results showed me that only half of the time did my `iwconfig eth1 essid
AccessPointName` actually take. (It listed the essid of the ap I told it
to associate with, but then showed Access Point: Invalid or words to
that effect, until I issued the exact same iwconfig again.)

So, try it twice, double check the iwconfig output, then try bringing up
the interface. Though that seems awfully difficult to do as well (DHCP
is just sending out stuff with nothing coming back).

When I switch consoles while DHCP is plaintively asking for an IP, and
issue *another* iwconfig with the same essid, then it seems to kick
something in the driver and DHCP immediately associates. Happened twice
for me so far, though that could merely be a coincidence.



I don't know about the problems associating, and/or with changing APs - I have 
only one and it
associates and authenticates with WPA-PSK without any trouble.

As to the lockups that you are seeing, I have generated a diff between vanilla 
2.6.18 and
wireless-2.6 with some essential patches added. At the moment, I'm compiling 
and testing it. There
are more problems with locking than I realized. If the patch works here, I'll 
post it to you and to
the bcm43xx list. The hard part may be getting stable to accept it for 2.6.18.1.

Thanks for the bug reports.

Larry

-
To unsubscribe from this list: send the line unsubscribe netdev in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: [PATCH] bcm43xx: further fix for periodic work errors

Michael Buesch wrote:

On Saturday 23 September 2006 06:08, Larry Finger wrote:

Recent changes in the setup for preemptible periodic work fixed most
of the problems with NETDEV watchdog timeouts; however, some variants
of the bcm43xx device still had the problem. These were fixed by setting
the parameter MAXIMUM_BADNESS to 0. By doing so, all the functionality
associated with calculating the 'badness' of the upcoming periodic work
is no longer needed; therefore it is removed.

Uhm, no. Wait. _Why_ does the watchdog trigger.
All periodic work in the fastpath (which you remove with this patch)
is supposed to execute in a few microseconds.
I don't think we want to fix this my removing the fastpath and always
taking the _expensive_ slowpath periodic work.

So why does the watchdog trigger for the fast periodic work?
We need to find out.
Removing the fastpath is just bad for overall latency.

The two fastpath periodic works are 15 and 30, if executed
standalone. If the 15 and/or 30 is execiuted alongside with
a 60sec work, it's all slowpath, of course.

I was thinking that the 15 second periodic work called mac suspend, which is the most expensive part
of the slowpath, but I see that is an unlikely condition. I'm now testing to see if moving the
netif_tx_disable/netif_wake_queue pair into all paths fixes the errors. Those calls should be
relatively inexpensive.

Larry

-
To unsubscribe from this list: send the line unsubscribe netdev in
the body of a message to [EMAIL PROTECTED]
More majordomo info at http://vger.kernel.org/majordomo-info.html

Re: [PATCH] bcm43xx: further fix for periodic work errors

2006-09-23 Thread Michael Buesch

On Saturday 23 September 2006 21:06, Larry Finger wrote:
 Michael Buesch wrote:
  On Saturday 23 September 2006 06:08, Larry Finger wrote:
  Recent changes in the setup for preemptible periodic work fixed most
  of the problems with NETDEV watchdog timeouts; however, some variants
  of the bcm43xx device still had the problem. These were fixed by setting
  the parameter MAXIMUM_BADNESS to 0. By doing so, all the functionality
  associated with calculating the 'badness' of the upcoming periodic work
  is no longer needed; therefore it is removed.
  
  Uhm, no. Wait. _Why_ does the watchdog trigger.
  All periodic work in the fastpath (which you remove with this patch)
  is supposed to execute in a few microseconds.
  I don't think we want to fix this my removing the fastpath and always
  taking the _expensive_ slowpath periodic work.
  
  So why does the watchdog trigger for the fast periodic work?
  We need to find out.
  Removing the fastpath is just bad for overall latency.
  
  The two fastpath periodic works are 15 and 30, if executed
  standalone. If the 15 and/or 30 is execiuted alongside with
  a 60sec work, it's all slowpath, of course.
 
 I was thinking that the 15 second periodic work called mac suspend, which is 
 the most expensive part 
 of the slowpath, but I see that is an unlikely condition. I'm now testing to 
 see if moving the 
 netif_tx_disable/netif_wake_queue pair into all paths fixes the errors. Those 
 calls should be 
 relatively inexpensive.

Well, even _if_ mac_suspend takes a few milliseconds (which it
does not), it would not trigger the watchdog.
I measured the time it takes to execute the various works
and based the badness selection on the results.

If the 15 or 30 second work is really able to trigger a watchdog
timeout, it's a _bug_ that needs to be fixed and not to be
papered over.
It won't trigger the watchdog, because it is running too long
uninterruptible (it won't run 5sec...). If it triggers, it's
triggered by something else (like the synchronize_net thingie
in the past).

-- 
Greetings Michael.
-
To unsubscribe from this list: send the line unsubscribe netdev in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: tested: Re: [PATCH] tcp: make cubic the default

From: bert hubert [EMAIL PROTECTED]
Date: Sat, 23 Sep 2006 13:14:34 +0200

 All in all, this final iteration of the congestion selection patches appears
 to do the job!

 Davem, I'd recommend both patches for merging.

Great, I'll make sure I review them too and integrate them.

Thanks for checking this stuff out so thoroughly.
-
To unsubscribe from this list: send the line unsubscribe netdev in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: 2.6.1[78] page allocation failure. order:3, mode:0x20

From: Auke Kok [EMAIL PROTECTED]
Date: Sat, 23 Sep 2006 11:50:34 -0700

 Andrew Morton wrote:
  It's still there, isn't it?

  For the 9k MTU case, for example, we end up allocating 16384 byte skbs
  instead of 32786 kbytes ones.

 yes, the only thing I'm doing is accounting for the 2 bytes one steap 
 earlier. 

Ok, I'm fine with this patch unless it causes some regression that hasn't
been discovered yet :-)
-
To unsubscribe from this list: send the line unsubscribe netdev in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: [PATCH] bcm43xx: further fix for periodic work errors


Michael Buesch wrote:


Well, even _if_ mac_suspend takes a few milliseconds (which it
does not), it would not trigger the watchdog.
I measured the time it takes to execute the various works
and based the badness selection on the results.

If the 15 or 30 second work is really able to trigger a watchdog
timeout, it's a _bug_ that needs to be fixed and not to be
papered over.
It won't trigger the watchdog, because it is running too long
uninterruptible (it won't run 5sec...). If it triggers, it's
triggered by something else (like the synchronize_net thingie
in the past).


Even the synchronize_net problem wasn't taking 5 seconds to complete, it was messing up the transmit 
process.


I went back to check my logs again, and the actual error was BCM43xx_IRQ_XMIT_ERROR, which is 
always preceded by a MAC suspend failed. These never happened all the time I was running with 
MAXIMUM_BADNESS of 0.


I think the _bug_ is letting the transmit process run while doing the periodic work, which is why 
I'm testing with the tx_disable before all periodic work. I'll let you know in 2 or 3 days if it 
fixes the problem. It takes that long to trigger.


Larry


Larry


-
To unsubscribe from this list: send the line unsubscribe netdev in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: [PATCH] bcm43xx: further fix for periodic work errors

2006-09-23 Thread Michael Buesch

On Saturday 23 September 2006 22:05, Larry Finger wrote:
 Michael Buesch wrote:
  
  Well, even _if_ mac_suspend takes a few milliseconds (which it
  does not), it would not trigger the watchdog.
  I measured the time it takes to execute the various works
  and based the badness selection on the results.
  
  If the 15 or 30 second work is really able to trigger a watchdog
  timeout, it's a _bug_ that needs to be fixed and not to be
  papered over.
  It won't trigger the watchdog, because it is running too long
  uninterruptible (it won't run 5sec...). If it triggers, it's
  triggered by something else (like the synchronize_net thingie
  in the past).
 
 Even the synchronize_net problem wasn't taking 5 seconds to complete, it was 
 messing up the transmit 
 process.

That's what I am saying. There must be another similiar bug.

 I went back to check my logs again, and the actual error was 
 BCM43xx_IRQ_XMIT_ERROR, which is 
 always preceded by a MAC suspend failed. These never happened all the time I 
 was running with 
 MAXIMUM_BADNESS of 0.

We can debug with the recently spec'ed reason and error registers why
this is triggered. See v4 specs.

 I think the _bug_ is letting the transmit process run while doing the 
 periodic work,

No. We don't let TX run while doing any periodic work (slow or fast).
Same for the IRQ handler.
We take the IRQ lock, which protects against IRQ and TX path (and everything 
else).
The _only_ difference between slowpath and fastpath periodic work is
that slowpath (long) periodic work is preemptible. This is gained
by not taking the IRQ lock, but protecting it otherwise (disabling IRQs and TX).

So what you are doing by your patch is: _never_ taking the lock.

 which is why  
 I'm testing with the tx_disable before all periodic work. I'll let you know 
 in 2 or 3 days if it 

It is not needed. tx_disable is only needed for long periodic work.

-- 
Greetings Michael.
-
To unsubscribe from this list: send the line unsubscribe netdev in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: softmac mtu


Matthieu CASTET wrote:

Hi,

why softmac (and maybe device using linux 80211 stack) can't increase
their mtu above 1500 ?

IRRC 802.11 allow to send bigger frame. Moreover some driver like airo
allow to use mtu biger than 2000.


The maximum value for MTU is set in include/linux/if_ether.h for all ethernet-type communications, 
not in softmac or ieee80211. I doubt that one could easily change the number. It may be that the 
802.11 standard allows bigger frames, but it looks to me as if Linux does not.


Larry
-
To unsubscribe from this list: send the line unsubscribe netdev in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: softmac mtu

From: Larry Finger [EMAIL PROTECTED]
Date: Sat, 23 Sep 2006 16:40:15 -0500

 The maximum value for MTU is set in include/linux/if_ether.h for all
 ethernet-type communications, not in softmac or ieee80211. I doubt
 that one could easily change the number. It may be that the 802.11
 standard allows bigger frames, but it looks to me as if Linux does
 not.

Not correct.  Linux is perfectly fine with setting 9000 byte MTU on
ethernet devices that support it, and in fact just about every
gigabit ethernet driver supports it.

That macro you see in if_ether.h is just the value of the base MTU
limit, so larger MTU settings are easily allowable on a per-device
basis.
-
To unsubscribe from this list: send the line unsubscribe netdev in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: softmac mtu

David Miller wrote:

From: Larry Finger [EMAIL PROTECTED]
Date: Sat, 23 Sep 2006 16:40:15 -0500

The maximum value for MTU is set in include/linux/if_ether.h for all
ethernet-type communications, not in softmac or ieee80211. I doubt
that one could easily change the number. It may be that the 802.11
standard allows bigger frames, but it looks to me as if Linux does
not.

Not correct.  Linux is perfectly fine with setting 9000 byte MTU on
ethernet devices that support it, and in fact just about every
gigabit ethernet driver supports it.

That macro you see in if_ether.h is just the value of the base MTU
limit, so larger MTU settings are easily allowable on a per-device
basis.

Where/how does the device allow it? When I tried 'ifconfig eth0 mtu 2000' on my VIA Technologies, 
Inc. VT6102 [Rhine-II] wired controller, I got a 'SIOCSIFMTU: Invalid argument' message, which is 
the same message I get on my BCM4306 wireless card.

Larry

-
To unsubscribe from this list: send the line unsubscribe netdev in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: softmac mtu

2006-09-23 Thread Arnaldo Carvalho de Melo

On 9/23/06, Arnaldo Carvalho de Melo [EMAIL PROTECTED] wrote:

On 9/23/06, Larry Finger [EMAIL PROTECTED] wrote:
 David Miller wrote:
  From: Larry Finger [EMAIL PROTECTED]
  Date: Sat, 23 Sep 2006 16:40:15 -0500

  The maximum value for MTU is set in include/linux/if_ether.h for all
  ethernet-type communications, not in softmac or ieee80211. I doubt
  that one could easily change the number. It may be that the 802.11
  standard allows bigger frames, but it looks to me as if Linux does
  not.

  Not correct.  Linux is perfectly fine with setting 9000 byte MTU on
  ethernet devices that support it, and in fact just about every
  gigabit ethernet driver supports it.

  That macro you see in if_ether.h is just the value of the base MTU
  limit, so larger MTU settings are easily allowable on a per-device
  basis.

 Where/how does the device allow it? When I tried 'ifconfig eth0 mtu 2000' on 
my VIA Technologies,
 Inc. VT6102 [Rhine-II] wired controller, I got a 'SIOCSIFMTU: Invalid 
argument' message, which is
 the same message I get on my BCM4306 wireless card.

David didn't said 1500 all the way to 9000, he said that some drivers
support 9000, some don't, lemme check for ya which one does...

drivers/net/8139cp.c: max is 4096
drivers/net/acenic.c: 9000

just do a:

vi $(find drivers/net | xargs grep -l change_mtu)

and check the rest :-)

- Arnaldo
-
To unsubscribe from this list: send the line unsubscribe netdev in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: [PATCH] Advertise PPPoE MTU / avoid memory leak.

From: [EMAIL PROTECTED]
Date: Sat, 23 Sep 2006 12:30:23 -0500

 __pppoe_xmit must free any skb it allocates if there is an error
 submitting the skb downstream.

This isn't right, dev_queue_xmit() can return -ENETDOWN and still
free the SKB, so your change will cause the SKB to be freed up
twice in that case, from dev_queue_xmit():

rc = -ENETDOWN;
rcu_read_unlock_bh();

out_kfree_skb:
kfree_skb(skb);
return rc;

dev_queue_xmit() is basically expected to consume the packet,
error or not.

What case of calling dev_queue_xmit() did you discover that did not
kfree the SKB on error?  We should fix that.  On a quick scan on the
entire dev_queue_xmit() implmentation, I cannot find such a case.
-
To unsubscribe from this list: send the line unsubscribe netdev in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: softmac mtu

From: Larry Finger [EMAIL PROTECTED]
Date: Sat, 23 Sep 2006 16:59:48 -0500

 Where/how does the device allow it? When I tried 'ifconfig eth0 mtu
 2000' on my VIA Technologies, Inc. VT6102 [Rhine-II] wired
 controller, I got a 'SIOCSIFMTU: Invalid argument' message, which is
 the same message I get on my BCM4306 wireless card.

It allows it in the device specific -change_mtu() method.
Tigon3, for example, overrides this with it's own function
called tg3_change_mtu() which checks if the particular model
of the chip supports jumbo MTU and if so allows such a setting.

The VIA driver simply doesn't override that function, and uses
the default ethernet one because either that ethernet chip doesn't
support the larger MTU or the author simply hasn't gotten around
to implementing the override.
-
To unsubscribe from this list: send the line unsubscribe netdev in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: [PATCH 00/03][RESUBMIT] net: EtherIP tunnel driver