Re: [Int-area] IPv6 ND applicability in draft-nachum-sarp-03

2013-01-16 Thread Linda Dunbar

Thanks to many people's response and comments to this draft. 
We want to clarify a couple of issues of this draft:

People are under the impression that the draft is to scale the flooding of ND 
messages on all links. MLD was brought up to say that ND messages are actually 
suppressed from flooding to links which don't have the target hosts. But that 
is the not the main intent of this draft. 


The real impact of ARP/ND in DC with massive number of hosts (or VMs) is on the 
L2/L3 boundary router (or Default GW). When hosts in subnet A needs to send 
data frames to Subnet B, the router has to 1) respond the ARP/ND requests from 
hosts in Subnet-A and 2) resolve target MAC for hosts in subnet-B. 

The second step is not only CPU intensive but also buffer intensive. There are 
some practices to alleviate the pain of Step 1) for IPv4, but not for IPv6 
(https://datatracker.ietf.org/doc/draft-dunbar-armd-arp-nd-scaling-practices/).

In order to protect routers CPU being overburdened by target resolution 
requests,  some routers has to rate limit the Target MAC resolution requests to 
CPU (Glean Throttling rate in this manual). 
http://www.cisco.com/en/US/docs/switches/datacenter/sw/5_x/nx-os/unicast/configuration/guide/l3_ip.pdf
 , searching for Glean Throttling. 

When the Glean Throttling rate is exceeded, the incoming data frames are 
dropped. 

In traditional Data Center, it is less of an issue because the number of hosts 
attached to one L2/L3 boundary router is limited by physical ports of 
switches/routers. When Servers are virtualized to support 30 plus VMs, the 
number of hosts under one router can grow 30 plus times. 

The solution proposed in this draft can eliminate (or reduce the likelihood of) 
inter-subnet data frames being dropped.  

In addition, the traditional DC has each subnet nicely placed in limited number 
of server racks, i.e. switches under router only need to deal with MAC 
addresses of those limited subnets. With subnets being spread across many 
server racks, the switches are exposed to VLAN/MAC of many subnets, greatly 
increasing the FDB. 

This draft also addresses the FDB entries explosion issue. 

Is it clear enough?


We can update the draft with this explanation. 

Linda


 -Original Message-
 From: int-area-boun...@ietf.org [mailto:int-area-boun...@ietf.org] On
 Behalf Of Linda Dunbar
 Sent: Tuesday, November 27, 2012 12:41 PM
 To: Greg Daley; Suresh Krishnan
 Cc: julien.i...@gmail.com; int-area@ietf.org
 Subject: Re: [Int-area] IPv6 ND applicability in draft-nachum-sarp-03
 
 Greg,
 
 Thank you very much for the detailed explanation.
 
 As for Router sending MLD query to all-nodes multicast ff02::1, if
 Explicit Tracing is not turned on, routers usually don't keep all the
 host IP addresses attached to each port/link, does it?
 If the ToR switches has MLD proxy, it could suppress reports from some
 IP hosts. Then, the Router may not receive MLD reports from all hosts
 attached to one port.
 
 Under this scenario,  when a host a sends Neighbor Solicitation
 multicast to b, router may not know which link b is attached. As
 the result router could flood all the ports which has the subnet
 enabled.
 
 The MLD with Explicit Tracking being turned on can make router be
 refreshed with all hosts in the domain. That can definitely scope the
 ND multicast.
 However, MLD with Explicit Tracking could be very resource intensive.
 
 draft-nachum-sarp gives another approach to scope the ND multicast
 and alleviates the burden on router when subnets are spread across
 multiple links/ToRs.
 
 Linda
 
  -Original Message-
  From: Greg Daley [mailto:gda...@au.logicalis.com]
  Sent: Wednesday, November 21, 2012 6:13 PM
  To: Linda Dunbar; Suresh Krishnan
  Cc: julien.i...@gmail.com; int-area@ietf.org
  Subject: RE: IPv6 ND applicability in draft-nachum-sarp-03
 
  Hi Linda,
  (also noting some overlap with Julien's response)
 
  The MLDv1 (RFC2710) or v2 (RFC3810) Querier sends out a general query
 R
  times (where R is the reliability factor for the link), which
 requests
  that all devices respond.  This goes to all-nodes multicast ff02::1,
  and the desitnation of the query doesn't have to match recipients
 (all
  of which listen to that multicast group).
 
  This scales with responses on order N x R where N is the number of
  nodes.
 
  If a router needs to manage a particular stream, it can send a
 specific
  query to one of the multicast addresses, particularly to verify that
  participants.
 
  The router doesn't actually need to track the solicited nodes
 multicast
  addresses, as these are link-local scoped and not multicast routed
 off-
  link.
 
  Actually in conformance with RFC 6398/BCP 168 (IP Router Alert
  Considerations and Usage)/RFC 6192 (Protecting the Router Control
 Plane)
  it is feasible to construct filters which ignore MLD report messages
  which only contain Link-local scoped multicast groups on the router.
  With the correct hardware support

Re: [Int-area] IPv6 ND applicability in draft-nachum-sarp-03

2012-11-27 Thread Linda Dunbar
Greg, 

Thank you very much for the detailed explanation. 

As for Router sending MLD query to all-nodes multicast ff02::1, if Explicit 
Tracing is not turned on, routers usually don't keep all the host IP addresses 
attached to each port/link, does it? 
If the ToR switches has MLD proxy, it could suppress reports from some IP 
hosts. Then, the Router may not receive MLD reports from all hosts attached to 
one port. 

Under this scenario,  when a host a sends Neighbor Solicitation multicast to 
b, router may not know which link b is attached. As the result router could 
flood all the ports which has the subnet enabled. 

The MLD with Explicit Tracking being turned on can make router be refreshed 
with all hosts in the domain. That can definitely scope the ND multicast. 
However, MLD with Explicit Tracking could be very resource intensive. 

draft-nachum-sarp gives another approach to scope the ND multicast and 
alleviates the burden on router when subnets are spread across multiple 
links/ToRs. 

Linda

 -Original Message-
 From: Greg Daley [mailto:gda...@au.logicalis.com]
 Sent: Wednesday, November 21, 2012 6:13 PM
 To: Linda Dunbar; Suresh Krishnan
 Cc: julien.i...@gmail.com; int-area@ietf.org
 Subject: RE: IPv6 ND applicability in draft-nachum-sarp-03
 
 Hi Linda,
 (also noting some overlap with Julien's response)
 
 The MLDv1 (RFC2710) or v2 (RFC3810) Querier sends out a general query R
 times (where R is the reliability factor for the link), which requests
 that all devices respond.  This goes to all-nodes multicast ff02::1,
 and the desitnation of the query doesn't have to match recipients (all
 of which listen to that multicast group).
 
 This scales with responses on order N x R where N is the number of
 nodes.
 
 If a router needs to manage a particular stream, it can send a specific
 query to one of the multicast addresses, particularly to verify that
 participants.
 
 The router doesn't actually need to track the solicited nodes multicast
 addresses, as these are link-local scoped and not multicast routed off-
 link.
 
 Actually in conformance with RFC 6398/BCP 168 (IP Router Alert
 Considerations and Usage)/RFC 6192 (Protecting the Router Control Plane)
 it is feasible to construct filters which ignore MLD report messages
 which only contain Link-local scoped multicast groups on the router.
 With the correct hardware support, this can be done at the line
 interface, without CPU processing.  The default is to pass all MLD
 reports up to the Router though.
 
 Arguably, the only reason to send MLD Join messages for Solicited
 Nodes' multicast addresses _is_ to support MLD snooping switches.
 
 Please note that MLD snooping is an LRU caching scheme, where the cache
 is refreshed for all active users through the background polling of the
 MLD querier.   The switches do not need to know anything ahead of time.
 Devices which do not respond to the query will be dropped from the
 cache, and only the number of active hosts will be retained.
 
 If you changed the IP addresses of every host on the link within a
 single query interval, the maximum number of Solicited nodes addresses
 cached by an MLD snooping network would be:
 
 ^A x N x 2
 
 Where:
 
 The original  the number of addresses is ^A x N
 and ^A is the mean number of addresses per host, and N is the number of
 hosts on the link.
 
 Sincerely
 
 Greg Daley
 
 
 
 
  From: int-area-boun...@ietf.org [mailto:int-area-boun...@ietf.org] On
 Behalf Of Linda Dunbar
  Sent: Thursday, 22 November 2012 2:17 AM
  To: Suresh Krishnan
  Cc: julien.i...@gmail.com; int-area@ietf.org
  Subject: Re: [Int-area] IPv6 ND applicability in draft-nachum-sarp-03
 
  Suresh,
 
  solicited node multicast address is formed by taking the low-order
 24 bits of target address (unicast or anycast) and appending those bits
 to the prefix  FF02:0:0:0:0:1:FF00::/104
 
  Routers don't know ahead of time what hosts are present in their
 Layer 2 domain. In DC with virtual machines added/deleted consistently,
 there could  be millions of possibilities.
 
  What multicast addresses should router send out MLD Query?
 
  What you are proposing is like using MLD snooping to achieve ND
 function, which doesn't scale.
 
  Linda
 
  -Original Message-
  From: Suresh Krishnan [mailto:suresh.krish...@ericsson.com]
  Sent: Tuesday, November 20, 2012 5:58 PM
  To: Linda Dunbar
  Cc: int-area@ietf.org; julien.i...@gmail.com; Tal Mizrahi
  Subject: Re: IPv6 ND applicability in draft-nachum-sarp-03
 
  Hi Linda,
 
  On 11/20/2012 01:05 PM, Linda Dunbar wrote:
   Suresh,
  
   Are you saying that router has to send out MLD Query for all
  potential ND multicast addresses and keep up state for listeners for
  all those multicast addresses?
 
  Nope. Only for those solicited node multicast groups that have at
 least
  one host on the subnet join.
 
  
   There could be millions of Solicited-Node multicast addresses.
 That
  is a lot of processing on routers.
 
  Not sure that I

Re: [Int-area] IPv6 ND applicability in draft-nachum-sarp-03

2012-11-21 Thread Linda Dunbar
Suresh,

solicited node multicast address is formed by taking the low-order 24 bits of 
target address (unicast or anycast) and appending those bits to the prefix 
FF02:0:0:0:0:1:FF00::/104

Routers don't know ahead of time what hosts are present in their Layer 2 
domain. In DC with virtual machines added/deleted consistently, there could be 
millions of possibilities.

What multicast addresses should router send out MLD Query?

What you are proposing is like using MLD snooping to achieve ND function, which 
doesn't scale.

Linda

 -Original Message-
 From: Suresh Krishnan [mailto:suresh.krish...@ericsson.com]
 Sent: Tuesday, November 20, 2012 5:58 PM
 To: Linda Dunbar
 Cc: int-area@ietf.org; julien.i...@gmail.com; Tal Mizrahi
 Subject: Re: IPv6 ND applicability in draft-nachum-sarp-03

 Hi Linda,

 On 11/20/2012 01:05 PM, Linda Dunbar wrote:
  Suresh,
 
  Are you saying that router has to send out MLD Query for all
 potential ND multicast addresses and keep up state for listeners for
 all those multicast addresses?

 Nope. Only for those solicited node multicast groups that have at least
 one host on the subnet join.

 
  There could be millions of Solicited-Node multicast addresses. That
 is a lot of processing on routers.

 Not sure that I follow. Do you have millions of hosts on the subnet? If
 not, I do not see why you have to keep track of millions of solicited
 node multicast addresses.

 Thanks
 Suresh



___
Int-area mailing list
Int-area@ietf.org
https://www.ietf.org/mailman/listinfo/int-area


Re: [Int-area] IPv6 ND applicability in draft-nachum-sarp-03

2012-11-21 Thread Julien Laganier
Linda,

The router periodically sends out a general query to which nodes
replies to with a report of their multicast listening state. Also,
when a node interface is configured with an IPv6 address, the node
spontaneously sends a report for the solicited-node multicast address
corresponding to the IPv6 address configured.

So routers don't have to know anything ahead of time. And same goes
for MLD-snooping switches.

--julien

On Wed, Nov 21, 2012 at 7:17 AM, Linda Dunbar linda.dun...@huawei.com wrote:
 Suresh,

 solicited node multicast address is formed by taking the low-order 24 bits
 of target address (unicast or anycast) and appending those bits to the
 prefix FF02:0:0:0:0:1:FF00::/104

 Routers don't know ahead of time what hosts are present in their Layer 2
 domain. In DC with virtual machines added/deleted consistently, there could
 be millions of possibilities.

 What multicast addresses should router send out MLD Query?

 What you are proposing is like using MLD snooping to achieve ND function,
 which doesn't scale.

 Linda

 -Original Message-
 From: Suresh Krishnan [mailto:suresh.krish...@ericsson.com]
 Sent: Tuesday, November 20, 2012 5:58 PM
 To: Linda Dunbar
 Cc: int-area@ietf.org; julien.i...@gmail.com; Tal Mizrahi
 Subject: Re: IPv6 ND applicability in draft-nachum-sarp-03

 Hi Linda,

 On 11/20/2012 01:05 PM, Linda Dunbar wrote:
  Suresh,
 
  Are you saying that router has to send out MLD Query for all
 potential ND multicast addresses and keep up state for listeners for
 all those multicast addresses?

 Nope. Only for those solicited node multicast groups that have at least
 one host on the subnet join.

 
  There could be millions of Solicited-Node multicast addresses. That
 is a lot of processing on routers.

 Not sure that I follow. Do you have millions of hosts on the subnet? If
 not, I do not see why you have to keep track of millions of solicited
 node multicast addresses.

 Thanks
 Suresh



___
Int-area mailing list
Int-area@ietf.org
https://www.ietf.org/mailman/listinfo/int-area


Re: [Int-area] IPv6 ND applicability in draft-nachum-sarp-03

2012-11-21 Thread Greg Daley
Hi Linda, 
(also noting some overlap with Julien's response)

The MLDv1 (RFC2710) or v2 (RFC3810) Querier sends out a general query R times 
(where R is the reliability factor for the link), which requests that all 
devices respond.  This goes to all-nodes multicast ff02::1, and the desitnation 
of the query doesn't have to match recipients (all of which listen to that 
multicast group).

This scales with responses on order N x R where N is the number of nodes.

If a router needs to manage a particular stream, it can send a specific query 
to one of the multicast addresses, particularly to verify that participants.

The router doesn't actually need to track the solicited nodes multicast 
addresses, as these are link-local scoped and not multicast routed off-link.

Actually in conformance with RFC 6398/BCP 168 (IP Router Alert Considerations 
and Usage)/RFC 6192 (Protecting the Router Control Plane) it is feasible to 
construct filters which ignore MLD report messages which only contain 
Link-local scoped multicast groups on the router.  With the correct hardware 
support, this can be done at the line interface, without CPU processing.  The 
default is to pass all MLD reports up to the Router though.

Arguably, the only reason to send MLD Join messages for Solicited Nodes' 
multicast addresses _is_ to support MLD snooping switches.

Please note that MLD snooping is an LRU caching scheme, where the cache is 
refreshed for all active users through the background polling of the MLD 
querier.   The switches do not need to know anything ahead of time.   Devices 
which do not respond to the query will be dropped from the cache, and only the 
number of active hosts will be retained.

If you changed the IP addresses of every host on the link within a single query 
interval, the maximum number of Solicited nodes addresses cached by an MLD 
snooping network would be:

^A x N x 2 

Where:

The original  the number of addresses is ^A x N 
and ^A is the mean number of addresses per host, and N is the number of hosts 
on the link. 

Sincerely

Greg Daley


 

 From: int-area-boun...@ietf.org [mailto:int-area-boun...@ietf.org] On Behalf 
 Of Linda Dunbar
 Sent: Thursday, 22 November 2012 2:17 AM
 To: Suresh Krishnan
 Cc: julien.i...@gmail.com; int-area@ietf.org
 Subject: Re: [Int-area] IPv6 ND applicability in draft-nachum-sarp-03

 Suresh, 
 
 solicited node multicast address is formed by taking the low-order 24 bits 
 of target address (unicast or anycast) and appending those bits to the prefix 
  FF02:0:0:0:0:1:FF00::/104 
 
 Routers don't know ahead of time what hosts are present in their Layer 2 
 domain. In DC with virtual machines added/deleted consistently, there could  
 be millions of possibilities.
 
 What multicast addresses should router send out MLD Query? 
 
 What you are proposing is like using MLD snooping to achieve ND function, 
 which doesn't scale. 
 
 Linda
 
 -Original Message-
 From: Suresh Krishnan [mailto:suresh.krish...@ericsson.com]
 Sent: Tuesday, November 20, 2012 5:58 PM
 To: Linda Dunbar
 Cc: int-area@ietf.org; julien.i...@gmail.com; Tal Mizrahi
 Subject: Re: IPv6 ND applicability in draft-nachum-sarp-03
 
 Hi Linda,
 
 On 11/20/2012 01:05 PM, Linda Dunbar wrote:
  Suresh,
 
  Are you saying that router has to send out MLD Query for all
 potential ND multicast addresses and keep up state for listeners for
 all those multicast addresses?
 
 Nope. Only for those solicited node multicast groups that have at least
 one host on the subnet join.
 
 
  There could be millions of Solicited-Node multicast addresses. That
 is a lot of processing on routers.
 
 Not sure that I follow. Do you have millions of hosts on the subnet? If
 not, I do not see why you have to keep track of millions of solicited
 node multicast addresses.
 
 Thanks
 Suresh
 
 
 
___
Int-area mailing list
Int-area@ietf.org
https://www.ietf.org/mailman/listinfo/int-area


Re: [Int-area] IPv6 ND applicability in draft-nachum-sarp-03

2012-11-20 Thread Suresh Krishnan
Hi Linda,

On 11/19/2012 07:11 PM, Linda Dunbar wrote:
 Suresh, 
 
 
 
 -Original Message-

 Because it is :-). As an example, if you take an fat tree architecture,
 the number of access switch ports on a subnet is not directly related
 to
 the number of ports on the L3 gateway that are on the same subnet.

 
 For the following network, if one subnet (regardless IPv4 or IPv6) have hosts 
 in all server racks, the L2/L3 has to enable the subnet on links (or ports) 
 to all the access switches (ToRs). 
 
 Then the IPv6's ND for the subnet will be flooded to all links. It is the 
 same problem as IPv4 ARP. How IPv6 ND scale better than IPv4's ARP?

At the risk of repeating myself. NO, the ND message will not be flooded
to all the links. It is multicast to a solicited node multicast address
(ff02::1:ffxx:) and the ND message is only sent to the links where
there is at least one node that has joined this multicast group.

 
 
 
 
  L2/L3 boundary
   +---+
 +/--+ |   
 | L2/L3   |  \
 +---+---+/\   \
  / \\ \ 
 /   \  \  \  
  +---++---+  +---+ +---+
  |T11|... |T1x|  |T21| ...   |T2y|  ToR switches
  +---++---+  +---+ +---+
||  | | 
  +-|-++-|-+  +-|-+ +-|-+
  |   |... |   |  |   | ...   |   |
  +---++---+  +---+ +---+  Server racks
  |   |... |   |  |   | ...   |   |
  +---++---+  +---+ +---+
  |   |... |   |  |   | ...   |   |
  +---++---+  +---+ +---+
 
 
 p.s. I still don't see how it is related to implementation. May be you mean 
 network design?  

As I said above, if your switch implementation(s) do not support MLD
snooping, then you will have an issue. In that case, the right fix is to
add MLD snooping functionality into the switch.

Thanks
Suresh

___
Int-area mailing list
Int-area@ietf.org
https://www.ietf.org/mailman/listinfo/int-area


Re: [Int-area] IPv6 ND applicability in draft-nachum-sarp-03

2012-11-20 Thread Linda Dunbar
Suresh, 

Are you saying that router has to send out MLD Query for all potential ND 
multicast addresses and keep up state for listeners for all those multicast 
addresses? 

There could be millions of Solicited-Node multicast addresses. That is a lot 
of processing on routers. 

Linda 

 -Original Message-
 From: Suresh Krishnan [mailto:suresh.krish...@ericsson.com]
 Sent: Tuesday, November 20, 2012 10:48 AM
 To: Linda Dunbar
 Cc: int-area@ietf.org; julien.i...@gmail.com; Tal Mizrahi
 Subject: Re: IPv6 ND applicability in draft-nachum-sarp-03
 
 Hi Linda,
 
 On 11/19/2012 07:11 PM, Linda Dunbar wrote:
  Suresh,
 
 
 
  -Original Message-
 
  Because it is :-). As an example, if you take an fat tree
 architecture,
  the number of access switch ports on a subnet is not directly
 related
  to
  the number of ports on the L3 gateway that are on the same subnet.
 
 
  For the following network, if one subnet (regardless IPv4 or IPv6)
 have hosts in all server racks, the L2/L3 has to enable the subnet on
 links (or ports) to all the access switches (ToRs).
 
  Then the IPv6's ND for the subnet will be flooded to all links. It is
 the same problem as IPv4 ARP. How IPv6 ND scale better than IPv4's ARP?
 
 At the risk of repeating myself. NO, the ND message will not be flooded
 to all the links. It is multicast to a solicited node multicast address
 (ff02::1:ffxx:) and the ND message is only sent to the links where
 there is at least one node that has joined this multicast group.
 
 
 
 
 
 L2/L3 boundary
+---+
  +/--+ |
  | L2/L3   |  \
  +---+---+/\   \
   / \\ \
  /   \  \  \
   +---++---+  +---+ +---+
   |T11|... |T1x|  |T21| ...   |T2y|  ToR switches
   +---++---+  +---+ +---+
 ||  | |
   +-|-++-|-+  +-|-+ +-|-+
   |   |... |   |  |   | ...   |   |
   +---++---+  +---+ +---+  Server racks
   |   |... |   |  |   | ...   |   |
   +---++---+  +---+ +---+
   |   |... |   |  |   | ...   |   |
   +---++---+  +---+ +---+
 
 
  p.s. I still don't see how it is related to implementation. May be
 you mean network design?
 
 As I said above, if your switch implementation(s) do not support MLD
 snooping, then you will have an issue. In that case, the right fix is
 to
 add MLD snooping functionality into the switch.
 
 Thanks
 Suresh

___
Int-area mailing list
Int-area@ietf.org
https://www.ietf.org/mailman/listinfo/int-area


Re: [Int-area] IPv6 ND applicability in draft-nachum-sarp-03

2012-11-20 Thread Suresh Krishnan
Hi Linda,

On 11/20/2012 01:05 PM, Linda Dunbar wrote:
 Suresh, 
 
 Are you saying that router has to send out MLD Query for all potential ND 
 multicast addresses and keep up state for listeners for all those multicast 
 addresses? 

Nope. Only for those solicited node multicast groups that have at least
one host on the subnet join.

 
 There could be millions of Solicited-Node multicast addresses. That is a 
 lot of processing on routers. 

Not sure that I follow. Do you have millions of hosts on the subnet? If
not, I do not see why you have to keep track of millions of solicited
node multicast addresses.

Thanks
Suresh


___
Int-area mailing list
Int-area@ietf.org
https://www.ietf.org/mailman/listinfo/int-area


Re: [Int-area] IPv6 ND applicability in draft-nachum-sarp-03

2012-11-19 Thread Suresh Krishnan
Hi Linda,

On 11/13/2012 08:43 PM, Linda Dunbar wrote:
 Suresh, 
 
 Comments are inserted below:
 
 -Original Message-

 Even though that IPv6 ND uses multicast instead broadcast, IPv6 ND
 does
 have scalability issues in Data Center when hosts within same subnet
 are
 spread across different access switches, and when there are lot of
 those
 subnets. In particular:

 *when hosts within the same subnet are spread across multiple
 switches (or GWs in the draft), the ND solicitation still go to all
 switches which might have the same subnet hosts attached. The traffic
 across GWs are same as IPv4's ARP.  The only difference is that hosts
 impact is reduced.

 This is not true. If the switches are MLD snooping then the packets
 will
 not be sent to irrelevant GWs/switches.

 
 [Linda] For those GWs which has Hosts belonging to a particular subnet, the 
 multicast for the subnet will reach all those GWs. 

The NSs are NOT multicasted to the subnet. They are sent to a solicited
node multicast address that is formed from the last 24 bits of the IPv6
address. So only network segments that have nodes with matching 24 last
bits in the address would get these messages when MLD snooping switches
are present.

 
 



 *When host a needs to communicate with host b in
 different
 subnet, a needs to send Neighbor Solicitation to L2/L3 boundary
 router, (just the same way as IPv4 ARP).

 Correct. But only the router gets it. Not all the other hosts in the
 subnet (which is the case for ARP).


 When there are many subnets spreading across many access switches,
 the
 L2/L3 boundary router has to enable all those subnets on many of its
 ports/links.

 This is purely an implementation issue. I know of several router
 implementations where this is not the case.

 
 [Linda] This is the standard setting. When you have one subnet spanning over 
 20 server racks, with each rack reaching the L2/L3 boundary router via 
 different links, the subnet has to be enabled on all those 20 ports. 
 
 
 Why do you say it is implementation issue?

Because it is :-). As an example, if you take an fat tree architecture,
the number of access switch ports on a subnet is not directly related to
the number of ports on the L3 gateway that are on the same subnet.

Thanks
Suresh


___
Int-area mailing list
Int-area@ietf.org
https://www.ietf.org/mailman/listinfo/int-area


Re: [Int-area] IPv6 ND applicability in draft-nachum-sarp-03

2012-11-19 Thread Linda Dunbar
Suresh, 



 -Original Message-
 
 Because it is :-). As an example, if you take an fat tree architecture,
 the number of access switch ports on a subnet is not directly related
 to
 the number of ports on the L3 gateway that are on the same subnet.
 

For the following network, if one subnet (regardless IPv4 or IPv6) have hosts 
in all server racks, the L2/L3 has to enable the subnet on links (or ports) to 
all the access switches (ToRs). 

Then the IPv6's ND for the subnet will be flooded to all links. It is the same 
problem as IPv4 ARP. How IPv6 ND scale better than IPv4's ARP?




   L2/L3 boundary
  +---+
+/--+ |   
| L2/L3   |  \
+---+---+/\   \
 / \\ \ 
/   \  \  \  
 +---++---+  +---+ +---+
 |T11|... |T1x|  |T21| ...   |T2y|  ToR switches
 +---++---+  +---+ +---+
   ||  | | 
 +-|-++-|-+  +-|-+ +-|-+
 |   |... |   |  |   | ...   |   |
 +---++---+  +---+ +---+  Server racks
 |   |... |   |  |   | ...   |   |
 +---++---+  +---+ +---+
 |   |... |   |  |   | ...   |   |
 +---++---+  +---+ +---+


p.s. I still don't see how it is related to implementation. May be you mean 
network design?  


Linda
___
Int-area mailing list
Int-area@ietf.org
https://www.ietf.org/mailman/listinfo/int-area


Re: [Int-area] IPv6 ND applicability in draft-nachum-sarp-03

2012-11-13 Thread Suresh Krishnan
Hi Linda,

On 11/12/2012 05:36 PM, Linda Dunbar wrote:
 Suresh and Julien,
 
  
 
 When the “draft-nachum-sarp-02” was presented at InterArea WG in 84^th
 IETF, the feedback was that the draft needs to be updated to include the
 processing for IPv6.
 
  
 
 So we updated the draft to include the processing of ND for IPv6.
 
  
 
 When the revised draft (draft-nachum-sarp-03) was presented at the
 InterArea WG in 85^th IETF, there are several people saying there is no
 problems for IPv6 ND, therefore it shouldn’t be optimized.
 
  
 
 Even though that IPv6 ND uses multicast instead broadcast, IPv6 ND does
 have scalability issues in Data Center when hosts within same subnet are
 spread across different access switches, and when there are lot of those
 subnets. In particular:
 
 ·when hosts within the same subnet are spread across multiple
 switches (or GWs in the draft), the ND solicitation still go to all
 switches which might have the same subnet hosts attached. The traffic
 across GWs are same as IPv4’s ARP.  The only difference is that hosts
 impact is reduced.

This is not true. If the switches are MLD snooping then the packets will
not be sent to irrelevant GWs/switches.

 
  
 
 ·When host “a” needs to communicate with host “b” in different
 subnet, “a” needs to send “Neighbor Solicitation” to L2/L3 boundary
 router, (just the same way as IPv4 ARP).

Correct. But only the router gets it. Not all the other hosts in the
subnet (which is the case for ARP).

 
 When there are many subnets spreading across many access switches, the
 L2/L3 boundary router has to enable all those subnets on many of its
 ports/links.

This is purely an implementation issue. I know of several router
implementations where this is not the case.

 
  
 
 When the first “a”- “b” data frame arrives at the boundary router, the
 boundary router has to “hold the data frame” and send a “Neighbor
 Solicitation” to all links which has the subnet for “b” enabled. The
 boundary router may need to send “NS” multiple times until it can make
 sure the target “b” exists before sending down the data frame from “a”.
 This process is not only CPU intensive, but also takes buffer.
 
  
 
  
 
 We need your advice on how to move this draft forward.

(Chair hat off) I personally find the IPv6 ND motivations to be lacking.
I think the draft needs a better description of the ND related issues
that are not stated in the armd problem statement.

Thanks
Suresh

___
Int-area mailing list
Int-area@ietf.org
https://www.ietf.org/mailman/listinfo/int-area