Launchpad has imported 14 comments from the remote bug at
https://bugzilla.redhat.com/show_bug.cgi?id=652210.

If you reply to an imported comment from within Launchpad, your comment
will be sent to the remote bug automatically. Read more about
Launchpad's inter-bugtracker facilities at
https://help.launchpad.net/InterBugTracking.

------------------------------------------------------------------------
On 2010-11-11T11:42:05+00:00 Hangbin wrote:

Description of problem:

while testing bz645284 on hp-dl2x170g6-01.rhts.eng.bos.redhat.com ,
change the igb max_vfs number ,but have a wrong reaction

Version-Release number of selected component (if applicable):

[root@hp-dl2x170g6-01 ~]# uname -r
2.6.18-227.el5

[root@hp-dl2x170g6-01 ~]# lspci | grep 82576
05:00.0 Ethernet controller: Intel Corporation 82576 Gigabit Network Connection
(rev 01)
05:00.1 Ethernet controller: Intel Corporation 82576 Gigabit Network Connection
(rev 01)

[root@hp-dl2x170g6-01 ~]# rpm -qa | grep kvm
etherboot-zroms-kvm-5.4.4-13.el5
kvm-qemu-img-83-164.el5
kvm-tools-83-164.el5
kvm-83-164.el5
etherboot-roms-kvm-5.4.4-13.el5
kmod-kvm-83-164.el5


How reproducible:

100%

Steps to Reproduce:
1.modprobe -r igb;modprobe igb max_vfs=10
2.
3.
  
Actual results:

[root@hp-dl2x170g6-01 ~]# modprobe -r igb;modprobe igb max_vfs=10
dca service started, version 1.8
PCI: Enabling device 0000:05:00.0 (0000 -> 0002)
PCI: Enabling device 0000:05:00.1 (0000 -> 0002)


Expected results:

host reboot in loop (kernel 2.6.18-227.el5)
or  generate igb vf message (other RHEL5.5 kernel)

Additional info:

with kernel 2.6.18-194.el5 on this system have the same reaction

Reply at:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/790201/comments/0

------------------------------------------------------------------------
On 2010-11-15T09:47:32+00:00 Stefan wrote:

The system hp-dl2x170g6-01.rhts.eng.bos.redhat.com is not SR-IOV capable. 
Booting the system with RHEL6 revealed:
igb 0000:05:00.1: not enough MMIO resources for SR-IOV

However the message did not appear on RHEL5. Will investigate why this
is not shown on RHEL5.

Reply at:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/790201/comments/1

------------------------------------------------------------------------
On 2010-11-25T12:12:47+00:00 Stefan wrote:

I took some time to dig into this and the reason why there's no message
displayed about MMIO resources is that the extended PCI config space is
not accessible. Thus the SRIOV capability register are not accessible.

pci_cfg_space_size() already fails to recognize the extended PCI config space.
The exact point where things go wrong is
if (pci_read_config_dword(dev, 256, &status) != PCIBIOS_SUCCESSFUL)
      goto fail;
[...]
 fail:  
        return PCI_CFG_SPACE_SIZE;

Prarit, any idea why this doesn't work?

Reply at:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/790201/comments/2

------------------------------------------------------------------------
On 2010-11-29T13:01:25+00:00 Prarit wrote:

(In reply to comment #3)
> I took some time to dig into this and the reason why there's no message
> displayed about MMIO resources is that the extended PCI config space is not
> accessible. Thus the SRIOV capability register are not accessible.
> 
> pci_cfg_space_size() already fails to recognize the extended PCI config space.
> The exact point where things go wrong is
> if (pci_read_config_dword(dev, 256, &status) != PCIBIOS_SUCCESSFUL)
>       goto fail;
> [...]
>  fail:  
>         return PCI_CFG_SPACE_SIZE;
> 
> Prarit, any idea why this doesn't work?

No idea :/ ... but ... ddd, wasn't there something in the errata about
this?

P.

Reply at:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/790201/comments/3

------------------------------------------------------------------------
On 2010-12-02T12:16:09+00:00 Stefan wrote:

ping ddutile

Reply at:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/790201/comments/4

------------------------------------------------------------------------
On 2010-12-09T22:00:38+00:00 Don wrote:

Stefan's assessment of the problem is correct:
(a) the BIOS is not reserving enough space in the PCI bridge mmap-resources to 
configure the VF devices
(b) Linux's PCI configuration sw isn't savvy enough to re-allocate 
mmap-resources to resolve this issue (and not cause regressions in other 
machines).
(although another bz claims some patches may fix it; these same patches have 
caused regressions in other machines.)

The BIOS must be 'SRIOV savvy'  meaning, it must scan the (architected)
PCI cap structures of devices looking for SRIOV caps, and ensure the PCI
bridges leading to those devices have sufficient memory resources for
the PF & VF devices below it.

btw: I hope VTd was enabled in the BIOS, and the kernel command line had
something like intel_iommu=on or amd_iommu=on (Q: is the machine an
Intel or AMD machine, and what is it's chipset? ... an lspci -vvv dump
would be helpful confirmation.)

Reply at:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/790201/comments/5

------------------------------------------------------------------------
On 2010-12-10T09:38:54+00:00 Stefan wrote:

The question I'm currently after is, why can't RHEL5 access the extended
PCI config space on this machine (RHEL6 can). See comment #3.

This should be independent of any *iommu=on or BIOS option.

Q: is the machine an Intel or AMD machine, and what is it's chipset?
A: Intel machine with 55x0 chipset

Reply at:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/790201/comments/6

------------------------------------------------------------------------
On 2010-12-10T09:40:23+00:00 Stefan wrote:

Created attachment 467936
lspci-vvv.txt

lspci -vvv on RHEL5

Reply at:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/790201/comments/7

------------------------------------------------------------------------
On 2010-12-10T15:05:29+00:00 Don wrote:

(In reply to comment #9)
> The question I'm currently after is, why can't RHEL5 access the extended PCI
> config space on this machine (RHEL6 can). See comment #3.
> 
> This should be independent of any *iommu=on or BIOS option.
> 
> Q: is the machine an Intel or AMD machine, and what is it's chipset?
> A: Intel machine with 55x0 chipset

I have _not_ seen a system where RHEL6 can see more of config space then
RHEL5 wrt SRIOV caps, so something else is amiss.

If this was tested with a *recent* rhel5.5 installation, and SELinux is enabled,
there were SELinux access (default policy) problems which maybe the culprit.
An update to the nightly RHEL5 repo may resolve it.... but....

Comparing the pci_cfg_space_size() code btwn rhel5 & rhel6, it appears
rhel6 filters this access a bit further dependent on the type of device
it is checking.  So, it's possible if yet-another-BIOS bug doesn't properly 
enable pci-mmconf for the particular device &/or PCI bridge, RHEL5 will be
limited and RHEL6 will work around it.

Given this is the first failure I've seen like this (btwn RHEL5 & RHEL6 
dev-assignment) and the first on this HP system,  I (still) tend to believe
a BIOS fix/upgrade would resolve this issue.

Reply at:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/790201/comments/8

------------------------------------------------------------------------
On 2010-12-16T14:15:23+00:00 Stefan wrote:

So I'll check for a new BIOS. Anyway I don't think it's SElinux related
because that's one of the first thing I disable.

Reply at:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/790201/comments/9

------------------------------------------------------------------------
On 2011-02-12T14:14:47+00:00 Tony wrote:

I am planning to be on-site in Westford on Tuesday.
I will update the BIOS for this system then.

Reply at:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/790201/comments/10

------------------------------------------------------------------------
On 2011-02-21T15:42:54+00:00 Don wrote:

Not sure why the sriov messages differ on boot-up, but here's the culprit,
besides not enough MMIO space:

This is the PCI bridge above the 82579's (note the sec/sub-bus num's of
05):

00:05.0 PCI bridge: Intel Corporation 5520/X58 I/O Hub PCI Express Root Port 5 
(rev 13) (prog-if 00 [Normal decode])
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- 
Stepping- SERR+ FastB2B- DisINTx+
        Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- 
<MAbort- >SERR- <PERR- INTx-
        Latency: 0, Cache Line Size: 64 bytes
        Bus: primary=00, secondary=05, subordinate=05, sec-latency=0
        I/O behind bridge: 0000e000-0000efff
        Memory behind bridge: fbe00000-fbefffff
        Prefetchable memory behind bridge: 00000000fff00000-00000000000fffff
        Secondary status: 66MHz- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- 
<MAbort- <SERR- <PERR-
        BridgeCtl: Parity- SERR+ NoISA- VGA- MAbort- >Reset- FastB2B-
                PriDiscTmr- SecDiscTmr- DiscTmrStat- DiscTmrSERREn-
        Capabilities: [40] Subsystem: Hewlett-Packard Company Device 330b
        Capabilities: [60] MSI: Enable+ Count=1/2 Maskable+ 64bit-
                Address: fee10000  Data: 40c9
                Masking: 00000002  Pending: 00000000
        Capabilities: [90] Express (v2) Root Port (Slot+), MSI 00
                DevCap: MaxPayload 256 bytes, PhantFunc 0, Latency L0s <64ns, 
L1 <1us
                        ExtTag+ RBE+ FLReset-
                DevCtl: Report errors: Correctable- Non-Fatal- Fatal- 
Unsupported-
                        RlxdOrd- ExtTag- PhantFunc- AuxPwr- NoSnoop-
                        MaxPayload 256 bytes, MaxReadReq 128 bytes
                DevSta: CorrErr- UncorrErr- FatalErr- UnsuppReq- AuxPwr- 
TransPend-
                LnkCap: Port #0, Speed 5GT/s, Width x4, ASPM L0s L1, Latency L0 
<512ns, L1 <64us
                        ClockPM- Surprise+ LLActRep+ BwNot+
                LnkCtl: ASPM Disabled; RCB 64 bytes Disabled- Retrain- CommClk-
                        ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
                LnkSta: Speed 2.5GT/s, Width x4, TrErr- Train- SlotClk+ 
DLActive+ BWMgmt- ABWMgmt-
                SltCap: AttnBtn- PwrCtrl- MRL- AttnInd- PwrInd- HotPlug- 
Surprise-
                        Slot #0, PowerLimit 0.000W; Interlock- NoCompl-
                SltCtl: Enable: AttnBtn- PwrFlt- MRL- PresDet- CmdCplt- HPIrq- 
LinkChg-
                        Control: AttnInd Off, PwrInd Off, Power- Interlock-
                SltSta: Status: AttnBtn- PowerFlt- MRL- CmdCplt- PresDet+ 
Interlock-
                        Changed: MRL- PresDet+ LinkState+
                RootCtl: ErrCorrectable- ErrNon-Fatal- ErrFatal- PMEIntEna- 
CRSVisible+
                RootCap: CRSVisible-
                RootSta: PME ReqID 0000, PMEStatus- PMEPending-
                DevCap2: Completion Timeout: Range BCD, TimeoutDis+ ARIFwd+
                DevCtl2: Completion Timeout: 260ms to 900ms, TimeoutDis- ARIFwd-
                                                                         ^^^^^

Without ARIFwd enabled on this bridge (by the BIOS), then the 82576 VFs cannot
be accessed (use BDF's beside PF's 05.00.x, which requires ARI).

We've seen this recently on a couple systems.  Sometimes it's alleviated by
trying a different slot in the machine.  If not, it once again points to a BIOS 
that is not SRIOV savvy.

Reply at:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/790201/comments/11

------------------------------------------------------------------------
On 2011-02-22T15:56:28+00:00 Tony wrote:

Didn't have time to update the BIOS last Tuesday.

Maybe later this week ...

Reply at:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/790201/comments/12

------------------------------------------------------------------------
On 2011-03-02T15:44:27+00:00 Stefan wrote:

With an updated BIOS I still couldn't get SR-IOV to work. However I'm getting
igb 0000:05:00.0: not enough MMIO resources for SR-IOV
on RHEL5 now with 2.6.18-246.el5.

That's probably all we can do, having a message saying "sorry no SR-IOV for 
you".
Case closed, imho. :)

Reply at:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/790201/comments/13


** Changed in: linux (Fedora)
       Status: Unknown => Invalid

** Changed in: linux (Fedora)
   Importance: Unknown => Medium

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/790201

Title:
  Cannot use SR-IOV feature of Intel 82576 network card

Status in linux package in Ubuntu:
  Incomplete
Status in linux package in Fedora:
  Invalid

Bug description:
  Hardware is Supermicro B8DTT (TwinBlade), running Ubuntu 10.04.2
  (amd64).

  Steps I've taken:

  - read 
http://docs.fedoraproject.org/en-US/Fedora/13/html/Virtualization_Guide/sect-Para-virtualized_Windows_Drivers_Guide-How_SR_IOV_Libvirt_Works.html
  - enabled VT-d in the BIOS
  - added "options igb max_vfs=7" to /etc/modprobe.d/local.conf, then ran 
"update-initramfs -k all -t -u"
  - rebooted

  dmesg quote after reboot:

  [    5.573742] igb 0000:05:00.0: not enough MMIO resources for SR-IOV
  [    5.666170] igb 0000:05:00.1: not enough MMIO resources for SR-IOV

  There is a RH bug report with patches at
  https://bugzilla.redhat.com/show_bug.cgi?id=523341

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/790201/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to