Re: [bess] shepherd review of draft-ietf-bess-evpn-etree

Thomas Morin Mon, 12 Dec 2016 06:49:47 -0800

Hi Ali,

2016-12-10, Ali Sajassi (sajassi):

Your suggestion regarding multiple MAC-VRFs per EVI for E-TREE,impacts lot more sections than just section 2.2 for which yousuggested some texts. It drastically impacts section 3.1 (knownunicast traffic), and it also impacts section 3.2 (BUM traffic) andsection 5.1.


Can you detail why ?

The understanding that leads me to this suggestion is that the2-RT+split-horizon scenario in 2.1, then applied to Root/Leaf PE in a2.2.1 would not require new procotol procedures nor changes in the textthat as I understand provides procedures for 2.2(.2) and 2.3.

Furthermore, it creates a new paradigm for EVPN that was neverintended for because of creating two MAC-VRFs (and two bridge tables)for the same VLAN.

The "<new thing> created a new paradigm that <RFX xyz> was neverintended for" is a not generally valid, or sufficiently detailed,argument: if it was, then you might go as far as challenging the wholeE-Tree spec on the same kind grounds (and many other new things).

So here is where it seems we have a gap to bridge: I still don'tunderstand what in RFC7432 describes an intention of "not supporting twoMAC-VRFs for the same VLAN".

The WG LC was completed on 3/29/16 and I am sure it is not yourintention to have major changes to the doc at this stage wheremultiple vendors have already implemented the draft.

As you know, there are different stages at which people do reviews on adoc after WGLC, an which may lead doc editors to introduce significant--editorial or technical-- changes in a document. Sometimes that leadsto documents going back to the working group.

However my root intention as doc shepherd, of course, is not to proposea major change, but merely to able to answer the standard question ofthe shepherd review -- on the reviews done, on document readiness, andon the document quality -- in a way as positive and sincere as possible.In particular questions (3) (4) and (6).

This draft talks about two kinds of traffic filtering: a) ingressfiltering for known unicast and b) egress filtering for BUM traffic.What you are suggesting is an alternate mechanism for ingress filtering.

(well I'm not suggesting the mechanism itself --which section 2.1already does-- but simply to document that it can still apply withoutthe constraint of avoiding the presence of a Root MAC-VRF and a LeafMAC-VRF on a same PE)

Although having multiple VRFs (and forwarding tables) are fine forIP-VPNs because the unknown traffic is always dropped, multiple VRFsfor the same VLAN is not OK for L2 traffic because of flooding ofunknown traffic. That’s why in section 6 of RFC 7432, for all serviceinterface types, the draft talks about a single MAC-VRF per EVI per PEand in case of VLAN-aware mode, multiple VLANs per MAC-VRF but only asingle bridge table per VLAN. In other words, the bottom line is thatthere can only be a single bridge table per VLAN in order to avoidunnecessary flooding.

When you have two MAC-VRFs per VLAN (one for root ACs and another forLeaf ACs), then you either need to duplicate lots of MAC addressesbetween these two VRFs, or do lookup on both of these VRFs. Eitherways this is not a good option relative to keeping a single VRF tablefor both root and leaf sites and just have a single-bit indication onwhether a MAC is associated with root or leaf (as currently describedapproach in the draft). I

In the above, it seems you agree that it can work, and you are able tooffer reasons why it is not the preferred option, then why not justdocument that it can work and provides these reasons as the motivationsthat lead to proposing a new specs ?


(it seems you have an unfinished last sentence: "I [...]" )

(assuming the previous point is resolved:)
With this mechanism above, isn't it possible to have on a given PE,for a single E-TREE EVI, both Leaves and Roots, as long as distinctMAC-VRFs are used (one for Leaves and one for Roots) ? (it seemsto me that the assymetric import/export RT would do what is neededto build an E-TREE, we would just have a particular case where aLeaf MAC-VRF and a Root MAC-VRF for a given E-TREE end up on asingle PE)
That’s not possible because per definition of an EVI, there is onlya single MAC-VRF per EVI for a PE.
Where can I read such a definition ? (the Terminology section inRFC7432 does not say that, unless I'm missing something).
And that seems a completely arbitrary restriction.
(just thinking that a given PE device can be split in two logicaldevices show that it can work)
Section 6 of RFC7432 where it gives definitions for different serviceinterface types, it specifies the relationship between MAC-VRF andVLAN (bridge table) and how many MAC-VRF (and bridge tables) can beper EVI.
This section of RFC7434 discusses many different things for thedifferent variants.Can you provide a specific pointer about "how many MAC-VRFs can be perEVI" ?
Ali> Section 6 of RFC7432 spells out the relationship between EVI,MAC-VRF, and bridge tables for all service interfaces very clearly.In all service interfaces, the RFC says there is one MAC-VRF per EVIon a given PE.Now, if the service interface is “vlan-aware”, then there are severalbridge tables for that single MAC-VRF – ie, one bridge table per VLAN.In all service interfaces, you can ONLY have one bridge table per VLAN.


This answer is everything but a specific pointer.

If Section 6 of RFC7432 says all this very clearly, I guess it should bepossible to extract quotes about "there is one MAC-VRF per EVI on agiven PE", right ?

In bridging world, there can only be a single bridge table per VLANin a device.
I still don't find here anything that would preclude having, on agiven PE, for a given E-TREE EVI, one Leaves MAC-VRF and one RootsMAC-VRF: can't these two MAC-VRFs use different internal VLANs (withtranslation if the external VLANs are constrained).
Ali> Lets assume we are using vlan-based service and thus there isonly a single bridge table per MAC-VRF, then what you are suggestingis two use two MAC-VRFs (two bridge tables) for the same EVI (sameVLAN). This results in some duplications of MAC addresses and wouldonly work if flooding is disabled (more on this later).

"results in some duplications of MAC" is perhaps a drawback, but nothinglike "just does not work" ?

"would only work if flooding is disabled": why ? (you wrote "(more onthis later)" but I couldn't identify anything recent from you in therest of the email below)

From an helicopter view, I can't see what fundamentally would becomeproblematic between "two MAC-VRFs on two distinct PEs" and the same "twoMAC-VRFs on a same PEs", at worse it is as efficient or as inefficientas having them on separate PEs (think logical router without anykind ofdataplane optimisation), and we can't exclude that the PE could havelocal implementation details to do better than that.

Besides, I don’t understand what good does it do to have twoMAC-VRFs on the same PE (one for Leafs and another for Roots)
Well, the "what is good for" is pretty simple: it means you can have,just by tailoring the import/export policies like in 2.1, somethingas useful as the scenario in 2.2.
There can only be a single bridge table per VLAN. Now even if you addsome kind of logic to form two logical PEs in single physical PE, youend up replicating all the MAC addresses associated with the rootsites in two bridge tables.
Your point above certainly does not sound to me as "it can't be done":some may think that the above is an acceptable cost, some others mayfind ways to make this "replication" with a low overhead, on someplatforms the cost may be negligible, etc.
because Leafs and Roots need to talk to each other and thus we wantthem to be in the same MAC-VRF.
The fact that Leafs and Roots need to talk to each other does notmean that they *have* to be in the same MAC-VRF, you can rely on thelocal MPLS dataplane inside the PE to carry the traffic between Rootsand Leaves can be passed between a Leaf MAC-VRF and a Root MAC-VRF(and you can possibly implement a shortcut not involving MPLSencap/decap).
Anything is possible but at what cost.
You know, for cost it is not always obvious to reach conclusions thatare true for all implementations and all targets.
The current proposal is very efficient in terms of forwarding path aswell as control plane.
Sure, but what I question is not the new solution but the lack ofdiscussion on why using the existing specs was not considered good enough.
I think that my concern of clearly explaining the scenarios andmotivations for this new spec could be addressed by splitting section2.2 into a 2.2.1 describing the approach from 2.1 and its possibledrawbacks, and a 2.2.2 having essentially the content of currentsection 2.2.
Here is a proposal:

2.2 Scenario 2: Leaf of Root site(s) per AC

   In these scenarii, a PE receives traffic from either Root OR Leaf
   sites (but not both) on a given Attachment Circuit (AC) of an EVI. In
   other words, an AC (ES or ES/VLAN) is either associated with Root(s)
   or Leaf(s) (but not both).
2.2.1 Scenario 2a: Leaf OR Root site(s) per AC, separate Leaf/RootMAC-VRFs
+---------+            +---------+
|   PE1   |            |   PE2   |
    +---+ |  +---+  |  +------+  |  +---+  |            +---+
|CE1+-----ES1----+--+   |  |  |      |  | |MAC+--+---ES2/AC1--+CE2|
    +---+    (Leaf) |  |MAC|  |  | MPLS |  |  |VRF|  |   (Leaf)   +---+
|  |VRF|  |  |  /IP |  |  '---'  |
|  |   |  |  |      |  |  .---.  |
|  |   |  |  |      |  |  |MAC|  |            +---+
|  |   |  |  |      |  |  |VRF+--+---ES2/AC2--+CE3|
|  +---+  |  +------+  |  +---+  |   (Root)   +---+
+---------+            +---------+

   Figure 2: Scenario 2a
In this scenario, the RT constraint procedures described in section2.1 couldalso be used. The feasibility and efficiency of this approachdepends on
   platforms specifics.
This approach will lead toduplication of a large proportion of MACaddressesonPEs having both Leaf and Root sites, and is hence considered lesssuitable fordeployment contexts where the vast majority of PEs are likely toultimately
   have both Leaf and Root sites attached to them.

2.2.2 Scenario 2b: Leaf OR Root site(s) per AC, single MAC-VRF

+---------+            +---------+
|   PE1   |            |   PE2   |
    +---+ |  +---+  |  +------+  |  +---+  |            +---+
|CE1+-----ES1----+--+   |  |  |      |  |  | +--+---ES2/AC1--+CE2|
    +---+    (Leaf) |  |MAC|  |  | MPLS |  |  |MAC|  |   (Leaf)   +---+
|  |VRF|  |  |  /IP |  |  |VRF|  |
|  |   |  |  |      |  |  |   |  |            +---+
|  |   |  |  |      |  |  |   +--+---ES2/AC2--+CE3|
|  +---+  |  +------+  |  +---+  |   (Root)   +---+
+---------+            +---------+

   Figure 2: Scenario 2b
This scenario will alleviate keys drawbacks from Scenario 2a, inparticularby avoiding duplication of MAC addresses on Leaf/Root PEs andavoiding the
   operational overhead of managing more than one RT.
This approach comes at the expense of having routes for unneededMAC addresses on Leaf-only PEs, and is hence considered less suitablefor deployment contexts where the vast majority of PEs would remainLeaf-only. Unlike Scenario 1 and Scenario 2a, this scenario requires additional procedures
    provided in this document.


(And this last sentence should be added to section 2.3 as well)
For this scenario, if for a given
   EVI, the majority of PEs will eventually have both Leaf and Root
   sites attached, even though they may start as Root-only or Leaf-only
   PEs, then it is recommended to use a single RT per EVI and avoid
   additional configuration and operational overhead.
Why this recommendation ?
Even with a majority of PEs having both Leaves and Roots, there canremain (up to 49% of) PEs having only Leaves, which will uselesslyhave all routes to other Leaves.
So "it is recommended" above, deserves to be explained more, I think.

OK, I changed “majority” to “vast majority” :-)
My point was not to nit pick on "majority", but was that you shouldexplain why you recommend that.As the text currently reads, the cost of the recommendation can beidentified: having useless routes on the fraction of PEs having onlyLeaves.But the gain brought by the recommendation is not even mentioned, notto say explained.
Hence: why ?
(Why is it a useful tradeoff to have useless routes on some, even ifonly one, PE ?)
Changed the last sentence from:
"then it is recommended to use a single RT per EVI and avoidadditional configuration and operational overhead.”
To
"then it is recommended to use a single RT per EVI and avoidadditional configuration and operational overhead
at the expense of having unwanted MAC addresses on the Leaf PEs."
Ok. I adapted and incorporated this addition into my proposed textsplitting 2.2 into a 2.2.1 and a 2.2.2.
Best,

-Thomas

_______________________________________________
BESS mailing list
[email protected]
https://www.ietf.org/mailman/listinfo/bess

Re: [bess] shepherd review of draft-ietf-bess-evpn-etree

Reply via email to