Re: [c-nsp] ASR1002 -- interface stops passing IPv4 traffic?
I've hit this CSCva35619 so that matches up. Upgraded at the time to 03.16.04a extended support release. Worked perfectly! Forwarding correctly ever since Sent from my portable email sender Please excuse shorter messages > On 19/05/2017, at 21:09, Paul Sherrattwrote: > > Hi John, > > This sounds like it may be an input queue wedge on the interface, which is > only fixed with a reload. > > I've seen CVE-2016-1478 / CSCva35619 hit a few people. If you're running > an affected version you'll need to upgrade or workaround. To verify, check > queue size in "show interface" output - if it's a queue wedge you can issue > "show buffers old [dump]" to confirm it is indeed the NTP bug causing your > issues. > > https://bst.cloudapps.cisco.com/bugsearch/bug/CSCva35619 > > > Cheers, > > Paul > >> On 19 May 2017 at 05:43, John Osmon wrote: >> >> I've never found an IOS device I couldn't tame with the help of Usenet >> and then google. However, I'm new to the ASR1000 and IOS-XE, and I'm >> running into something I've never seen before. >> >> I've got GigE ports that will pass traffic, and then suddenly stop. >> The interface still shows up/up, but you can't even ping the local >> interface from the router itself. >> >> We've can restore traffic by moving the config to another port, but the >> "dead" port stays dead. We've tried shut/no shut, new SFPs, and new >> configs -- but the port still won't work. >> >> Interestingly, the port *DOES* work with IPv6 -- but not IPv4. This >> router doesn't use IPv6, but when I put an address on the interface, it >> is pingable. >> >> If you apply an IPv4 /24 to the dead interface, the routing table shows >> the /24 as a "connected" network, and shows a "local" /32 for the >> address in use -- but is not pingable. >> >> The only thing we've found in common between the ports is that they >> were connected to eBGP peers. We've had three events, on ports >> connected to two different providers. >> >> My next step is to get to the colo and move one of the "dead" ports to >> a spanned port switch and start sniffing the line. >> >> Any suggestions would be appreciated. Hardware in use includes: >> ASR1000-ESP10 >> ASR1002-RP1 >> SPA-8X1GE-V2 >> >> Problem has occurred in both built-in and SPA-8X1GE-V2 ports, with >> multi-mode, and GE-T transceivers. >> >> ___ >> cisco-nsp mailing list cisco-nsp@puck.nether.net >> https://puck.nether.net/mailman/listinfo/cisco-nsp >> archive at http://puck.nether.net/pipermail/cisco-nsp/ >> > ___ > cisco-nsp mailing list cisco-nsp@puck.nether.net > https://puck.nether.net/mailman/listinfo/cisco-nsp > archive at http://puck.nether.net/pipermail/cisco-nsp/ ___ cisco-nsp mailing list cisco-nsp@puck.nether.net https://puck.nether.net/mailman/listinfo/cisco-nsp archive at http://puck.nether.net/pipermail/cisco-nsp/
Re: [c-nsp] ASR1002 -- interface stops passing IPv4 traffic?
Thanks Paul -- this appears to be the exact issue. I just didn't have enough experience with the platform to know to look here. John On Fri, May 19, 2017 at 12:09:18PM +0100, Paul Sherratt wrote: > Hi John, > > This sounds like it may be an input queue wedge on the interface, which is > only fixed with a reload. > > I've seen CVE-2016-1478 / CSCva35619 hit a few people. If you're running > an affected version you'll need to upgrade or workaround. To verify, check > queue size in "show interface" output - if it's a queue wedge you can issue > "show buffers old [dump]" to confirm it is indeed the NTP bug causing your > issues. > > https://bst.cloudapps.cisco.com/bugsearch/bug/CSCva35619 > > > Cheers, > > Paul > > On 19 May 2017 at 05:43, John Osmonwrote: > > > I've never found an IOS device I couldn't tame with the help of Usenet > > and then google. However, I'm new to the ASR1000 and IOS-XE, and I'm > > running into something I've never seen before. > > > > I've got GigE ports that will pass traffic, and then suddenly stop. > > The interface still shows up/up, but you can't even ping the local > > interface from the router itself. > > > > We've can restore traffic by moving the config to another port, but the > > "dead" port stays dead. We've tried shut/no shut, new SFPs, and new > > configs -- but the port still won't work. > > > > Interestingly, the port *DOES* work with IPv6 -- but not IPv4. This > > router doesn't use IPv6, but when I put an address on the interface, it > > is pingable. > > > > If you apply an IPv4 /24 to the dead interface, the routing table shows > > the /24 as a "connected" network, and shows a "local" /32 for the > > address in use -- but is not pingable. > > > > The only thing we've found in common between the ports is that they > > were connected to eBGP peers. We've had three events, on ports > > connected to two different providers. > > > > My next step is to get to the colo and move one of the "dead" ports to > > a spanned port switch and start sniffing the line. > > > > Any suggestions would be appreciated. Hardware in use includes: > >ASR1000-ESP10 > >ASR1002-RP1 > >SPA-8X1GE-V2 > > > > Problem has occurred in both built-in and SPA-8X1GE-V2 ports, with > > multi-mode, and GE-T transceivers. > > > > ___ > > cisco-nsp mailing list cisco-nsp@puck.nether.net > > https://puck.nether.net/mailman/listinfo/cisco-nsp > > archive at http://puck.nether.net/pipermail/cisco-nsp/ > > > ___ > cisco-nsp mailing list cisco-nsp@puck.nether.net > https://puck.nether.net/mailman/listinfo/cisco-nsp > archive at http://puck.nether.net/pipermail/cisco-nsp/ ___ cisco-nsp mailing list cisco-nsp@puck.nether.net https://puck.nether.net/mailman/listinfo/cisco-nsp archive at http://puck.nether.net/pipermail/cisco-nsp/
Re: [c-nsp] ASR1002 -- interface stops passing IPv4 traffic?
Hi John, This sounds like it may be an input queue wedge on the interface, which is only fixed with a reload. I've seen CVE-2016-1478 / CSCva35619 hit a few people. If you're running an affected version you'll need to upgrade or workaround. To verify, check queue size in "show interface" output - if it's a queue wedge you can issue "show buffers old [dump]" to confirm it is indeed the NTP bug causing your issues. https://bst.cloudapps.cisco.com/bugsearch/bug/CSCva35619 Cheers, Paul On 19 May 2017 at 05:43, John Osmonwrote: > I've never found an IOS device I couldn't tame with the help of Usenet > and then google. However, I'm new to the ASR1000 and IOS-XE, and I'm > running into something I've never seen before. > > I've got GigE ports that will pass traffic, and then suddenly stop. > The interface still shows up/up, but you can't even ping the local > interface from the router itself. > > We've can restore traffic by moving the config to another port, but the > "dead" port stays dead. We've tried shut/no shut, new SFPs, and new > configs -- but the port still won't work. > > Interestingly, the port *DOES* work with IPv6 -- but not IPv4. This > router doesn't use IPv6, but when I put an address on the interface, it > is pingable. > > If you apply an IPv4 /24 to the dead interface, the routing table shows > the /24 as a "connected" network, and shows a "local" /32 for the > address in use -- but is not pingable. > > The only thing we've found in common between the ports is that they > were connected to eBGP peers. We've had three events, on ports > connected to two different providers. > > My next step is to get to the colo and move one of the "dead" ports to > a spanned port switch and start sniffing the line. > > Any suggestions would be appreciated. Hardware in use includes: >ASR1000-ESP10 >ASR1002-RP1 >SPA-8X1GE-V2 > > Problem has occurred in both built-in and SPA-8X1GE-V2 ports, with > multi-mode, and GE-T transceivers. > > ___ > cisco-nsp mailing list cisco-nsp@puck.nether.net > https://puck.nether.net/mailman/listinfo/cisco-nsp > archive at http://puck.nether.net/pipermail/cisco-nsp/ > ___ cisco-nsp mailing list cisco-nsp@puck.nether.net https://puck.nether.net/mailman/listinfo/cisco-nsp archive at http://puck.nether.net/pipermail/cisco-nsp/
Re: [c-nsp] ASR1002 -- interface stops passing IPv4 traffic?
Hi John, Unfortunately we have seen several examples of this on the ASR9k platform with partial NP microcode lockups. Almost every case relates to us using more than 2 or 3 'features' per physical interface (like Sampled Netflow + Mirror First 64 bytes + Sub Interfaces + CBQoS). These may not apply to you, but after countless TAC cases and following various leads I can confirm there are many different new and exciting ways to break it with each IOS release. We also had examples of breakage in only IPv4 or only IPv6, or only sub-interfaces... and so on. In all cases, the rest of the platform (forwarding, routing and in some cases even hardware BFD) continued to believe the port was up/up. In fact, in several cases even the port mirroring itself continued to work - but none of the traffic on the mirror was actually being forwarded. All of our examples were repeatable with the correct combination of features. For other readers, all of these issues were in the 5.3.3 train and at the very beginning of the 5.3.4 train (mostly fixed in 5.3.4 SMUs). So, purely on experience with the 9k platform (XR obviously), this sounds like you are hitting a bug to me. TAC may be your best option here, or start stripping out features until it stops doing it, then add them back in the same order you removed them (which is what we did, as TAC was taking too long). Not a massive amount of help, but thought I'd share! Cheers, Robert Williams Custodian Data Centre Email: rob...@custodiandc.com http://www.CustodianDC.com -Original Message- From: cisco-nsp [mailto:cisco-nsp-boun...@puck.nether.net] On Behalf Of John Osmon Sent: 19 May 2017 05:44 To: Cisco Network Service ProvidersSubject: [c-nsp] ASR1002 -- interface stops passing IPv4 traffic? I've never found an IOS device I couldn't tame with the help of Usenet and then google. However, I'm new to the ASR1000 and IOS-XE, and I'm running into something I've never seen before. I've got GigE ports that will pass traffic, and then suddenly stop. The interface still shows up/up, but you can't even ping the local interface from the router itself. We've can restore traffic by moving the config to another port, but the "dead" port stays dead. We've tried shut/no shut, new SFPs, and new configs -- but the port still won't work. Interestingly, the port *DOES* work with IPv6 -- but not IPv4. This router doesn't use IPv6, but when I put an address on the interface, it is pingable. If you apply an IPv4 /24 to the dead interface, the routing table shows the /24 as a "connected" network, and shows a "local" /32 for the address in use -- but is not pingable. The only thing we've found in common between the ports is that they were connected to eBGP peers. We've had three events, on ports connected to two different providers. My next step is to get to the colo and move one of the "dead" ports to a spanned port switch and start sniffing the line. Any suggestions would be appreciated. Hardware in use includes: ASR1000-ESP10 ASR1002-RP1 SPA-8X1GE-V2 Problem has occurred in both built-in and SPA-8X1GE-V2 ports, with multi-mode, and GE-T transceivers. ___ cisco-nsp mailing list cisco-nsp@puck.nether.net https://puck.nether.net/mailman/listinfo/cisco-nsp archive at http://puck.nether.net/pipermail/cisco-nsp/ ___ cisco-nsp mailing list cisco-nsp@puck.nether.net https://puck.nether.net/mailman/listinfo/cisco-nsp archive at http://puck.nether.net/pipermail/cisco-nsp/