On Sun, Jul 29, 2018 at 8:38 AM, Joe Touch <[email protected]> wrote: > > > On Jul 28, 2018, at 11:24 AM, Ole Troan <[email protected]> wrote: > > Here’s the thing about fragmentation: > > 1. all links have a maximum packet size > 2. all tunneling/encapsulation/layering increases payload size > > 1+2 implies there is always the need for fragmentation at some layer: > > > 1 implies that. > There is enough head room designed in 1 to accommodate 2. > > > As was noted, there’s never enough headroom because you can’t control (or > even know) how many layers of tunnels your traffic experiences. > > > > 3. fragmentation always splits info across packets > > And there’s something important about layering: > > 4. layering intends to isolate the behavior of one layer from another, such > that > it will always be impossible for an upper layer to know exactly what is > going on below, > i.e., to determine that limiting size across an entire path of possibly > virtual tunnels > > The next two are where we get into trouble: > > 5. network devices increasingly WANT to inspect contents beyond the layer at > which they are intended to operate > > > not that network devices have an intent in themselves, but yes, it seems > like network operators want to inspect content or are forced into it because > of the necessity of IPv4 address sharing. > > > They were “forced” into differentiating commercial from home customer > pricing. NATs didn’t need to exist when they were first deployed. > > That said, there’s no real problem with a NAT *IF* it acts as a host on the > Internet > (see ouch, J: Middlebox Models Compatible with the Internet. USC/ISI > (ISI-TR-711), 2016.)
Joe, It's still a problem though. A NAT (or any stateful device in the network) forces the requirement in network architecture that all packets of a flow are routed through the same device. This has killed our ability to use multi-homing and multi-path. The best way for an intermediate devices to deal with transport layer state is to be an L4 proxy. The intermediate is a host endpoint for the proxy connections, but then that has its own problems since it breaks E2E functionality (like TCP auth). So the only real solution is to eliminate transport state from the network. I'm still holding out hope that IPv6 will start to obsolete use of NAT! FAST (draft-herbert-fast-02) is intended to provide a viable alternative to stateful firewalls. Tom > > 6. inspecting contents ultimately means reassembly, at some level > > > _some_ content inspection would require that, but I don't think you can make > that the general rule. > e.g. a NAT or an L4 ACL only needs access to the L4 header. > > > It’s a general rule; you merely refer to an instance that is relevant ONLY > when the L3 header at which a router operates is followed (or expected to be > followed) by an L4 header. > > That may have been the Internet of 10 years ago, but it is less and less the > Internet moving forward. > > > Which brings us to the punchline: > > 7. but network device vendors want to save money, so they don’t want to > reassemble at any layer > > > We'd all wish it to be that simple. Can you substantiate that claim? > You can easily make the speculation that customers don't want to pay what it > costs to be able to do reassembly at terabit speeds... > Or accept that it's technically hard. > > > I’m not claiming it isn’t hard; I’m claiming that it’s always cheaper to > *not* do something. > > My concern isn’t that reassembly isn’t being done; it’s that vendors sell > devices that don’t make this clear - AND that they don’t pass on packets > they don’t/can’t reassemble. > > The implementations of e.g. NATs, IPv4 address sharing implementations I'm > aware of do flavours of network layer reassembly. > > > If they did so, we would not be here talking about considering IP > fragmentation fragile. > > However much money you throw at it, you can't reassemble fragments > travelling on different paths, nor can you trivially make network layer > reassembly not be an attack vector on those boxes. > > > Agreed, but here’s the other point: > > Any device that inspects L4 content can do so ONLY as a proxy for the > destination endpoint. > > I.e., I know vendors WANT to sell devices they say can be deployed anywhere > in the network, and operators believe that, but it’s wrong. > > Basically, if you’re not at a place in the network where you represent that > endpoint, you have no business acting as that endpoint - “full stop”. > > > > So I agree, IP fragmentation has its flaws - but those flaws are created not > only because it leaves out the transport port numbers, but also because DPI > and NAT devices don’t reassemble. And they don’t because it’s cheaper to > sell devices that say they run at 1 Gbps (e.g.) that don’t bother to > reassemble. > > > I don't agree with your conclusion. > NATs extend the network layer to include the L4 ports. NAT implementations > of course do reassemble. > > > See Touch, J: Middlebox Models Compatible with the Internet. USC/ISI > (ISI-TR-711), 2016. > > NATs do not extend the network layer to include L4; they act as endpoints in > the public net and routers in the private net - for the reason above > (multipath), among others. > > > I.e., it will never matter what layering we add to fix this - GRE, GUE, > Aero, etc. - ultimately, we’re doomed to need fragmentation support down to > IP exactly because: > > a. #1-4 mean we need frag/reassembly at any tunnel ingress > b. vendors want to sell #5 at a price that is too low for them to support #6 > (i.e., point #7) > > > > So pushing this to another layer will never solve it. What will solve it > will only be a compliance requirement for #6 - which could be done right > now, and has to be done for ANY solution to work. > > > For IPv4 address sharing specifically removing network layer fragmentation > would be a solution. > > > For ONE problem there is ONE patch until a new device looks further (to do > port filtering on UDP tunneled traffic, e.g.). Then we’re back here talking > about fragmentation over GUE considered fragile. > > > NOTE: even rewriting EVERY application won’t fix this, nor will deploying a > new layer at any level. > > > For some type of content inspection that would require reassembling the > whole application context. > But that's quite different from IPv4 address sharing, which we have > unfortunately made an integral part of the Internet architecture. > > > The only difference is that the impact of the current L4 problem is being > currently discussed. If you consider why it’s happening, it might be more > clear that this problem will cycle around again and cannot be solved by > patching this way. > > Joe > > _______________________________________________ > Int-area mailing list > [email protected] > https://www.ietf.org/mailman/listinfo/int-area > _______________________________________________ Int-area mailing list [email protected] https://www.ietf.org/mailman/listinfo/int-area
