On Fri, Feb 5, 2016 at 1:30 PM, Russell Bryant <russ...@ovn.org> wrote:
>
> On 02/05/2016 02:22 AM, Justin Pettit wrote:
> > Joe and I spent some time today discussing our options.  This is
> > pretty tricky to get right and most of the options that come
> > immediately to mind have subtle corner cases.  We're planning to
> > whiteboard more options tomorrow, but I wanted to get down what's my
> > personal favorite and see what people think of its shortcoming.
> > We're planning to document the other options that we've considered
> > and the problems that they have, which we'll share with the group.
> >
> > The idea is to essentially implement a mark and sweep algorithm.
> > Assuming that we have a lowest priority "drop" flow, we'll add an
> > action that sets a "drop_flow" bit (e.g., 0x1) in the conntrack
> > label.  In the next table, we'll have a flow that matches on this
> > label bit and drops traffic.  Here's a psuedo set of flows to
> > implement allowing stateful traffic to port 22 and 80:
> >
> > 1) table=0, ip, actions=ct(table=1)
> > 2) table=1, priority=10, ct_state=-rpl, tcp, tp_dst=22,
actions=ct(commit,table=2)
> > 3) table=1, priority=10, ct_state=-rpl, tcp, tp_dst=80,
actions=ct(commit,table=2)
> > 4) table=1, priority=0, ct_state=-rpl, actions=ct(set_ct_label=0x1),drop
> > 5) table=1, priority=10, ct_state=+rpl, ct_label=0x1, actions=drop
> > 6) table=1, priority=0, ct_state=+rpl+est, actions=goto_table:2
> > 7) table=2, priority=0, actions= /* Continue logical forwarding
pipeline. */
> >
> > Here's an explanation of the flows:
> >
> > 1) Send all IP traffic to the connection tracker and then go to
> >    table 1.
> > 2) If the destination TCP port is 22 in the request direction, commit
> >    it to the connection tracker and continue to table 2.
> > 3) Same as flow 2, but with TCP port 80 traffic.
> > 4) Traffic in the request direction that doesn't match flows 2 or 3
> >    get the conntrack label set to 0x1 (the "drop_flow" bit) and the
> >    traffic gets dropped.  It's important to note that there's no
> >    "commit" here, so that this will mark an existing conntrack entry
> >    with that label, but won't create a new entry for it.
> > 5) Drop traffic in the reply direction with the "drop_flow" bit set.
> > 6) Send any reply traffic that has an existing conntrack entry (and
> >    the "drop_flow" bit not set) to table 2.
> > 7) Continue the logical forwarding pipeline (ie, the ACL allowed the
traffic)
> >
> > If traffic is initiated to port 23, it will be dropped by flow 4, but
> > there won't be an entry in the conntrack table since no one committed
> > it.  If traffic is initiated to port 22, the connection will be
> > allowed and committed to the conntrack table by flow 2.  Similarly
> > for traffic initiated to port 80, it will be allowed and committed by
> > flow 3.  The reply direction traffic to 22 and 80 will be allowed by
> > flow 6.
> >
> > Now let's say that flow 2 is removed because we don't want to allow
> > port 22 traffic anymore.  There will still be a conntrack entry from
> > that previous connection.  Now when the initiator sends traffic to
> > port 22, it will get dropped by flow 4, but we'll also set the
> > existing conntrack entry's flow label to 0x1.  When the reply traffic
> > comes back, it will now match flow 5, since the ct_label value will
> > be 0x1 and the flow will be dropped.  Traffic to port 80 will be
> > unaffected.
> >
> > The nice thing about this approach is that it's not very heavy duty:
> > it doesn't cause a lot of flow churn, it doesn't make worse
> > megaflows, it doesn't cause race conditions between updating the OVS
> > flow table and conntrack entries, we don't have to write (and debug)
> > another flow classifier in ovn-controller, it's straight-forward to
> > implement, and it's instantaneous in application--mostly.
> >
> > That "mostly" is it's drawback, though.  It instantly corrects
> > traffic in both directions once a packet is sent in the initiating
> > direction.  However, until that happens, reply traffic will continue
> > to flow.  I doubt this will be a big problem in practice, since you'd
> > need to have traffic that is largely unidirectional without any sort
> > of acknowledgement.  ACKs would take care of this for TCP, so it
> > wouldn't be a real problem (there could be a few packets that are let
> > through, but policy updates aren't going to be instantaneous coming
> > down from the CMS, anyway).  There could be UDP-based protocols that
> > don't use any sort of positive acknowledgement, but I don't know of
> > any off the top of my head.
> >
> > As I mentioned, Joe and I will try to come up with a document that
> > describes the different approaches that occur to us along with their
> > strengths and weaknesses.  I think that will be helpful to have a
> > more fruitful discussion about alternatives.
> >
> > In the meantime, I'd be curious to hear what people think about the
> > above proposal.  In the meantime, I think this would be a reasonable
> > approach, since it covers most of the use-cases nicely and it
> > wouldn't be hard to implement.
>
> Thank you for the write-up!  This approach sounds great to me.  Some
> small questions...
>
> 1) If we're only using 1 bit for now, is there any reason to use
> ct_label over ct_mark?  The docs in ovs-ofctl(8) seem to suggest they're
> identical other than being 32-bit vs 128-bit.  Would using the 32-bit
> ct_mark be beneficial in any way instead?
>
> 2) One thing not explicitly addressed in this write-up is traffic marked
> as related.  I think the proposal means just adding a match on
> ct_label=0x1 where we match ct_state=+rel today and we just rely on a
> packet in the request direction of the main connection to set ct_label.
>  That seems fine, but I wanted to clarify that point.
>
> I'm happy to work on the OVN implementation of this approach assuming no
> alternative supplants it.  It sounds fun.  :-)
>
> --
> Russell Bryant
> _______________________________________________
> dev mailing list
> dev@openvswitch.org
> http://openvswitch.org/mailman/listinfo/dev

This looks nice! I have one more question on top of Russell's.
In this proposal, every packet in request direction will trigger a "commit"
in conntrack. Just want to confirm is there performance impact?
Would it be better to split the flow to 2 flows, e.g.:

> > 2) table=1, priority=10, ct_state=-rpl, tcp, tp_dst=22,
actions=ct(commit,table=2)

change to:

2.1) table=1, priority=10, ct_state=+new, tcp, tp_dst=22,
actions=ct(commit,table=2)
2.2) table=1, priority=10, ct_state=-rpl-new, tcp, tp_dst=22,
actions=goto_table:2

--
Best regards,
Han
_______________________________________________
dev mailing list
dev@openvswitch.org
http://openvswitch.org/mailman/listinfo/dev

Reply via email to