On 2011-03-24 17:28, Vinzenz Bargsten wrote: > Am 08.03.2011 21:59, schrieb Jan Kiszka: >> On 2011-03-08 21:21, Vinzenz Bargsten wrote: >>>>>>> Smells like IRQ conflict in line 17. What devices are using it? >>>>>>> Check >>>>>>> /proc/interrupt or lspci. >>>>>>> >>>>>>> >>>>>> The real-time nic was using IRQ 16, see >>>>>> >>>>>> Feb 28 14:37:12 robot02 kernel: [ 2700.099343] rt_8139too >>>>>> 0000:03:00.0: >>>>>> PCI->APIC IRQ transform: INT A -> IRQ 16 >>>>>> You are right, IRQ 16 was shared with a usb controller. >>>>>> >>>>>> As the 2nd (non-rt) 8139 nic had it's own IRQ 17, >>>>>> I tried to use that instead and just swapped the cables. The >>>>>> problem occured, too. >>>>>> >>>>>> Then I swapped the cards in the PCI slots and the situation got >>>>>> worse. >>>>>> See list of interrupts attached (interrupts.txt) >>>>>> >>>>> Both 8139 cards are of the same type. What were you steps to ensure >>>>> that >>>>> the right driver handles the right card? >>>>> >>>> - I use the cards parameter (well, that doesn't really ensure) >>>> - the cards and cables are labeled (also with mac addresses) >>>> - If I use the wrong interface, there shouldn't be any communication >>>> since my >>>> program only uses the rt interface >>>> - one card's mac address is registered in the uni network, that's why I >>>> swapped the cards pci slot rather than just using the other one >>>> >>> The problem also occurs with only one 8139 card, which has its own >>> irq. I could >>> not reproduce the PCI Bus errors. >>> >>> I'm letting rtping running in the background, did not get that >>> problem then (so >>> far). >>> Maybe unrelated or a problem of the remote side, I see periodically >>> high ping >>> times, see attachment. >> If you have an IRQ conflict (which I still assume) but you still have >> valid IRQ reason at a sufficiently high rate, the system will >> continuously recover. Only if a large number of unhandled IRQs were >> received, Xenomai will disable the chatty line. >> >>>> I also have a tulip card available, but if I remember correctly, >>>> loading the >>>> rt_tulip module locked up the pc. I can check again, though. >>>> I can also try to remove one of the 8139 nics. >>> The driver is loaded but no communication is possible. In wireshark >>> 3-4 outgoing >>> rtping packets show up, then rtping causes a "ioctl: No buffer space >>> available". >>> Nothing is received. >> The fact that two different adapter show similar issues (IRQs are not >> properly delivered, which will exhaust buffer resources sooner or later) >> indicate, that the problem is not the adapter but likely the system with >> its IRQ routing. >> >>> Can someone recommend a PCI network card which works well with recent >>> RTNet/ >>> kernels / hardware? I thought about a Intel Pro 1000 or 100 PCI as >>> well as >>> via_rhine cards, which are also quite common. >>> >> On x86, a good way to avoid IRQ issues is to pick a recent supported >> card with MSI(-X) support. rt_igb is a known-to-work example. >> >> Jan > The setup changed in the following way: > - I do not use any 8139too card anymore. > - Instead, I use an igb card (Intel E1G42ET); to free the PCIe-x16 slot > I use a PCI graphics card. > > The problem changed in the following way: > - The communication does not stop, but I encountered a > - delayed output packet and 2 or 3 consecutive input packets are not > received (do not show up in wireshark). > I think they are on the line / sent by the remote station but it is > difficult to proove. > Actually it counts delayed (or non-answered) packets and the counter > increases to 3. > - The rtping response times are similar (maybe a bit less often), i.e. > up ~1000µs every ~100s. > Not sure if this is caused by the remote station. > > What do you recommend to resolve these issues?
- Does Xenomai's tests work flawlessly on your box? Check e.g. latency. - Does the igb use MSI-X? Check the kernel log and your kernel .config (CONFIG_PCI_MSI). - If you don't trust the peer station, use a know-to-work Linux box without RTnet for the tests. Jan
signature.asc
Description: OpenPGP digital signature
------------------------------------------------------------------------------ Enable your software for Intel(R) Active Management Technology to meet the growing manageability and security demands of your customers. Businesses are taking advantage of Intel(R) vPro (TM) technology - will your software be a part of the solution? Download the Intel(R) Manageability Checker today! http://p.sf.net/sfu/intel-dev2devmar
_______________________________________________ RTnet-users mailing list RTnet-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/rtnet-users