Hi,
I am pleased to announce that OFED-1.5.1 GA release is done
Notes:
The tarball is available on:
http://www.openfabrics.org/builds/ofed-1.5.1/release/OFED-1.5.1.tgz
To get BUILD_ID run ofed_info
Please report any issues in bugzilla https://bugs.openfabrics.org/ for
OFED 1.5.1
Vladimir & Tziporet
========================================================================
Release information:
--------------------
Linux Operating Systems:
- RedHat EL4 up7 2.6.9-78.ELsmp
- RedHat EL4 up8 2.6.9-89.ELsmp
- RedHat EL5 up3 2.6.18-128.el5
- RedHat EL5 up4 2.6.18-164.el5
- SLES10 SP2 2.6.16.60-0.21-smp
- SLES10 SP3 2.6.16.60-0.54-smp
- SLES11 2.6.27.19-5-default
- OEL 4 up7 2.6.9-78.ELsmp
- OEL 4 up8 2.6.9-89.ELsmp
- CentOS5.3 2.6.18-128.el5
- CentOS5.4 2.6.18-164.el5
- Fedora Core12 2.6.31.5-127.fc12 *
- OpenSuSE 11.2 2.6.31.5-0.1-default *
- kernel.org 2.6.29, 2.6.30,
2.6.31 and 2.6.32 *
* Minimal QA for these versions
Systems:
* x86_64
* x86
* ia64
* ppc64
Main Changes from OFED 1.5
============================
1. Added RoCEE support - see RoCEE_README.txt
2. Added enhanced atomic operations to ConnectX (kernel only).
See mlx4_release_notes.txt.
3. Updated Open MPI to rev 1.4.1-2ofed
4. Updated MVAPICH2 to rev 1.4.1
5. Updated DAPL to rev 2.0.27
6. Updated libnes to rev 1.0.1
7. Updated librdmacm to rev 1.0.11
8. Removed tvflash RPM
9. NFS-RDMA is not supported on SLES10 SP3
10. Fixed IPv6 support and IPv4 routing corner cases for RDMA CM
11. Bug fixes
See attached.
bug_id,"bug_severity","priority","op_sys","assigned_to","bug_status","resolution","short_short_desc"
138,"normal","P2","All","[email protected]","RESOLVED","FIXED","getpeername, after other side closes, fails, which is not the behavior of TCP"
592,"normal","P3","Other","[email protected]","RESOLVED","FIXED","libsdp memory leak"
668,"critical","P2","All","[email protected]","RESOLVED","FIXED","iPath SMA does not generate traps"
779,"normal","P3","Other","[email protected]","RESOLVED","FIXED","sdp server accept: BUG: scheduling while atomic: ib_cm..."
828,"normal","P3","RHEL 5","[email protected]","RESOLVED","FIXED","SDP accept() fails with CONFIG_PREEMPT kernel"
833,"normal","P3","Other","[email protected]","RESOLVED","FIXED","IPoib hangout while running sdp with multiple"
838,"normal","P3","RHEL 5","[email protected]","RESOLVED","FIXED","connection refuse"
894,"major","P2","SLES 10","[email protected]","RESOLVED","FIXED","IPoIB connectivity lost during heavy testing on memfree"
912,"normal","P3","RHEL 4","[email protected]","RESOLVED","FIXED","When remote side is not accsessible and socket based application is running over SDP reference count is high"
969,"normal","P3","RHEL 4","[email protected]","RESOLVED","FIXED","HTTP over SDP with 200 connection cause to kernel panic in client side"
977,"normal","P3","All","[email protected]","RESOLVED","FIXED","binding 2 sockets to the same address fails with the wrong errno"
998,"normal","P3","Other","[email protected]","RESOLVED","FIXED","intermittent SDP BUG on ppc64"
1087,"minor","P5","SLES 10","[email protected]","RESOLVED","FIXED","recovery from rdma_create_qp() is bad"
1242,"normal","P2","RHEL 4","[email protected]","RESOLVED","FIXED","kernel panic while running mpi2007 against ofed1.4 -- ib_ipath: ipath_sdma_verbs_send"
1310,"normal","P3","Other","[email protected]","RESOLVED","FIXED","stress_connect crash sometimes on SW220/SW221"
1334,"normal","P3","Other","[email protected]","RESOLVED","FIXED","possible lock ordering issue"
1393,"minor","P3","SLES 10","[email protected]","RESOLVED","FIXED","Dmesg errors after running rds-gen/sink tests"
1397,"normal","P3","SLES 10","[email protected]","RESOLVED","FIXED","Egle SDR agains Falcon QDR donât run with new mvapich-1.1.0"
1427,"normal","P3","All","[email protected]","RESOLVED","FIXED","running netperf on ppc64, results in preload error and causes the client machine to hang"
1440,"blocker","P3","All","[email protected]","RESOLVED","FIXED","mstvpd hangs on QDR HCAs"
1445,"normal","P3","All","[email protected]","RESOLVED","FIXED","removing a test module which is using the kernel socket api over sdp, causes the machine to hang"
1453,"minor","P3","Other","[email protected]","RESOLVED","FIXED","RDS use of QPD_SQD state causes problems for ConnectX HCAs"
1502,"normal","P3","Other","[email protected]","RESOLVED","FIXED","2.6.16.46-0.12-SLERT-10-15: scheduling while atomic"
1519,"normal","P3","Other","[email protected]","RESOLVED","FIXED","RDS may be doing to much at interrupt level"
1552,"normal","P3","RHEL 4","[email protected]","RESOLVED","FIXED","spurious read events on RDS socket"
1590,"normal","P3","Other","[email protected]","RESOLVED","FIXED","Unnecessary sock_hold in sdp_reset_sk()"
1612,"normal","P3","RHEL 5","[email protected]","RESOLVED","FIXED","killing stress_connect test results in Kernel BUG on ppc64 machine"
1682,"normal","P3","Other","[email protected]","RESOLVED","FIXED","Low performance on ofed-1.5"
1714,"normal","P3","RHEL 5","[email protected]","RESOLVED","FIXED","RHEL5.3 host shows traceback after the nfs-rdma server is restarted"
1716,"normal","P3","RHEL 5","[email protected]","RESOLVED","FIXED","Poor performance with RDS 1.4.1 or 1.4.2"
1719,"normal","P3","Other","[email protected]","RESOLVED","FIXED","Reconnect path in Linux client can loop indefinitely"
1742,"normal","P3","RHEL 5","[email protected]","RESOLVED","FIXED","Interrupt balancing across CPUs does not work with RDS 1.4.2"
1761,"normal","P3","RHEL 5","[email protected]","RESOLVED","FIXED","RDS traffic stalls with 1.4.2 when there is congestion"
1774,"critical","P3","SLES 11","[email protected]","VERIFIED","FIXED","Installing OFED1.5 RC1 on SLES11 on machine with OFED-distro does not remove ofed-kmp"
1800,"critical","P3","RHEL 5","[email protected]","VERIFIED","FIXED","iperf sdp on ppc cause to client machine to dead lock"
1808,"normal","P3","RHEL 5","[email protected]","VERIFIED","FIXED","Bonding: Error in network restart output after configure consistent configuration of bonding"
1821,"major","P3","RHEL 5","[email protected]","RESOLVED","FIXED","Crash in bonding"
1826,"critical","P3","Other","[email protected]","VERIFIED","FIXED","Iperf sdp with more than 50 connections cause to call trace on client side (RH5 up4)"
1828,"normal","P3","All","[email protected]","RESOLVED","FIXED","ibv_query_port man page should be updated"
1840,"critical","P1","All","[email protected]","RESOLVED","FIXED","Some NFS large transfers stall"
1851,"major","P3","CentOS 5","[email protected]","RESOLVED","FIXED","Crash when running fstress with a large number of threads"
1859,"normal","P2","All","[email protected]","CLOSED","FIXED","IPOIB Can miss a change in dgid"
1865,"normal","P1","Other","[email protected]","RESOLVED","FIXED","Mixed port configuration does not work."
1866,"normal","P3","Other","[email protected]","RESOLVED","FIXED","rping oops in mixed configuration"
1873,"major","P3","RHEL 5","[email protected]","RESOLVED","FIXED","IPoIB errors found in log"
1878,"normal","P3","Other","[email protected]","RESOLVED","FIXED","SDP 1.5 - small packets multi stream gives bad performance"
1879,"normal","P3","Other","[email protected]","RESOLVED","FIXED","MSG_PEEK is not spported on ZCopy"
1887,"normal","P3","All","[email protected]","RESOLVED","FIXED","qperf doesn't support operation between DDR and QDR servers"
1895,"blocker","P2","All","[email protected]","RESOLVED","FIXED","sdp throws up "" sdp_alloc_fmr "" error while running qperf tests"
1897,"normal","P3","All","[email protected]","RESOLVED","FIXED","Send data failed in both directions, at the same time, by Zcopy."
1899,"major","P3","RHEL 4","[email protected]","RESOLVED","FIXED","Getting timer related warnings when running sdp tests on RHEL 4.8"
1900,"normal","P3","All","[email protected]","RESOLVED","FIXED","The example test Ibv_srq_pingpong always fail when run over IB link"
1908,"major","P3","RHEL 5","[email protected]","RESOLVED","FIXED","Running ârds-tools/rds-gen âl 4096â over RoCEE results in kernel panic"
1910,"major","P3","All","[email protected]","RESOLVED","FIXED","Running ucmatose over RoCEE results in Kernel panic"
1912,"blocker","P1","SLES 11","[email protected]","RESOLVED","FIXED","SDP doesn't work on PPC64 SLES11"
1915,"normal","P3","RHEL 5","[email protected]","RESOLVED","FIXED","Bonding: Removing vlan of bond1 (eth) cause to kernel panic"
1917,"critical","P3","RHEL 5","[email protected]","RESOLVED","FIXED","Runing iperf and netperf over SDP for long time cause to call trace (on kernel 2.6.24)"
1918,"blocker","P1","All","[email protected]","RESOLVED","FIXED","RDMA-CM allows binds to 127.0.0.1 which kills openmpi over iwarp"
1920,"critical","P3","All","[email protected]","RESOLVED","FIXED","OFED_1_5_1 doesn't support Drexler's device IDs"
1924,"major","P1","RHEL 4","[email protected]","RESOLVED","FIXED","Open mpi test over RoCEE cause kernel oops in cma_resolve_rocee_route"
1926,"normal","P3","RHEL 4","[email protected]","RESOLVED","FIXED","multicast leakage when unloading ib_ipoib"
1945,"minor","P3","SLES 10","[email protected]","RESOLVED","FIXED","Interactive installer generates faulty ofed.conf"
1951,"critical","P3","All","[email protected]","RESOLVED","FIXED","Opensm fails to move the port to Active (remains Armed)"
1955,"normal","P1","RHEL 5","[email protected]","RESOLVED","FIXED","SDP connection reset"
1959,"critical","P3","RHEL 5","[email protected]","RESOLVED","FIXED","[OFED-1.5.1 -NFSoverRDMA] - NFSoverRDMA client hits kernel panic while running ""iozone test "" on NFS client along with ""interface toggle test "" on NFS server continously in a loop"
1962,"blocker","P1","Other","[email protected]","RESOLVED","FIXED","nfsrdma cause regular NFS mounts to hang (OFED-1.5.1-20100223-0740)"
1964,"blocker","P3","All","[email protected]","RESOLVED","FIXED","cxgb3 fails openmpi branding"
1965,"major","P3","RHEL 5","[email protected]","RESOLVED","FIXED","Bonding mlx4_en: ping does not resume after failover between 10G ports"
1978,"major","P1","RHEL 5","[email protected]","RESOLVED","FIXED","Kernel Panic when unloading ib_srp"
1988,"major","P2","All","[email protected]","RESOLVED","FIXED","[OFED-1.5.1-rc4] - Softirq seen on one of the nodes after ~48 hours while running 2 instances each of IMB-MPI1+ OSU+Presta test together on 64 bit platforms"
1989,"critical","P2","Other","[email protected]","RESOLVED","FIXED","[REG]OFED-1.5.1 : OFED-1.5.1-20100317-0806 installtion fails while building ibutils RPM on RHEL4U8 32 bit"
1995,"critical","P1","Other","[email protected]","RESOLVED","FIXED","[REG]OFED-1.5.1 : OFED-1.5.1-20100318-0600 installtion fails on 2.6.30_32 bit kernel while uninstalling previous version of OFED."
Open Fabrics Enterprise Distribution (OFED)
Version 1.5.1
Release Notes
March 2010
===============================================================================
Table of Contents
===============================================================================
1. Overview, which includes:
- OFED Distribution Rev 1.5.1 Contents
- Supported Platforms and Operating Systems
- Supported HCA and RNIC Adapter Cards and Firmware Versions
- Tested Switch Platforms
- Third party Test Packages
- OFED sources
2. Main Changes from OFED 1.4.2
3. Main Changes from OFED 1.5
4. Known Issues
===============================================================================
1. Overview
===============================================================================
These are the release notes of OpenFabrics Enterprise Distribution (OFED)
release 1.5.1. The OFED software package is composed of several software
modules,
and is intended for use on a computer cluster constructed as an InfiniBand
subnet or iWARP network.
Note: If you plan to upgrade the OFED package on your cluster, please upgrade
all of its nodes to this new version.
1.1 OFED 1.5.1 Contents
-----------------------
The OFED package contains the following components:
- OpenFabrics core and ULPs:
- IB HCA drivers (mthca, mlx4, qib, ehca)
- iWARP RNIC driver (cxgb3, nes)
- core
- Upper Layer Protocols: IPoIB, SDP, SRP Initiator and target, iSER
Initiator and target, RDS, uDAPL, qlgc_vnic and NFS-RDMA.
- OpenFabrics utilities:
- OpenSM (OSM): InfiniBand Subnet Manager
- Diagnostic tools
- Performance tests
- MPI:
- OSU MPI stack supporting the InfiniBand and iWARP interface
- Open MPI stack supporting the InfiniBand and iWARP interface
- OSU MVAPICH2 stack supporting the InfiniBand and iWARP interface
- MPI benchmark tests (OSU benchmarks, Intel MPI benchmarks, Presta)
- Extra packages:
- open-iscsi: open-iscsi initiator with iSER support
- ib-bonding: Bonding driver for IPoIB interface
- Sources of all software modules (under conditions mentioned in the modules'
LICENSE files)
- Documentation
Notes:
1. iSER Target and NFS-RDMA are of Beta quality.
2. All other OFED components are of production quality.
3. See release notes for each package in the docs directory.
4. Any Topspin copyright belongs to Cisco Systems, Inc.
1.2 Supported Platforms and Operating Systems
---------------------------------------------
o CPU architectures:
- x86_64
- x86
- ppc64
- ia64
o Linux Operating Systems:
- RedHat EL4 up7 2.6.9-78.ELsmp
- RedHat EL4 up8 2.6.9-89.ELsmp
- RedHat EL5 up3 2.6.18-128.el5
- RedHat EL5 up4 2.6.18-164.el5
- RedHat EL5 up5 (beta) 2.6.18-186.el5
- SLES10 SP2 2.6.16.60-0.21-smp
- SLES10 SP3 2.6.16.60-0.54-smp
- SLES11 2.6.27.19-5-default
- OEL 4 up7 2.6.9-78.ELsmp
- OEL 4 up8 2.6.9-89.ELsmp
- CentOS5.3 2.6.18-128.el5
- CentOS5.4 2.6.18-164.el5
- Fedora Core12 2.6.31.5-127.fc12 *
- OpenSuSE 11.2 2.6.31.5-0.1-default *
- kernel.org 2.6.29, 2.6.30,
2.6.31 and 2.6.32 *
* Minimal QA for these versions
1.3 HCAs and RNICs Supported
----------------------------
This release supports IB HCAs by Mellanox Technologies, Qlogic and IBM as
well as iWARP RNICs by Chelsio Communications and Intel.
o Mellanox Technologies HCAs (SDR, DDR and QDR Modes are Supported):
- InfiniHost (fw-23108 Rev 3.5.000)
- InfiniHost III Ex (MemFree: fw-25218 Rev 5.3.000
with memory: fw-25208 Rev 4.8.200)
- InfiniHost III Lx (fw-25204 Rev 1.2.000)
- ConnectX IB (fw-25408 Rev 2.7.000)
For official firmware versions please see:
http://www.mellanox.com/content/pages.php?pg=firmware_download
o Qlogic HCAs:
- QHT7140 QLogic InfiniPath SDR HTX HCA
- QLE7140 QLogic InfiniPath SDR PCIe HCA
- QLE7240 QLogic InfiniPath DDR x8 PCIe HCA
- QLE7280 QLogic IniniPath DDR x16 PCIe HCA
o IBM HCAs:
- GX Dual-port SDR 4x IB HCA
- GX Dual-port SDR 12x IB HCA
- GX Dual-port DDR 4x IB HCA
- GX Dual-port DDR 12x IB HCA
o Chelsio RNICs:
- S310/S320 10GbE Storage Accelerators
- R310/R320 10GbE iWARP Adapters
o Intel RNICs:
- NE020 10Gb iWARP Adapter
1.4 Switches Supported
----------------------
This release was tested with switches and gateways provided by the following
companies:
- Voltaire
- Qlogic
- Flextronics
- Sun
- Mellanox
1.5 Third Party Packages
------------------------
The following third party packages have been tested with OFED 1.5.1:
- Intel MPI, Version 3.2.2
- Intel MPI, Version 4.0 beta
1.6 OFED Sources
----------------
All sources are located under git://git.openfabrics.org/
Kernel sources: git://git.openfabrics.org/ofed_1_5/linux-2.6.git ofed_kernel_1_5
User level Sources are downloaded from http://www.openfabrics.org/downloads/
as written in the BUILD_ID
The kernel sources are based on Linux 2.6.30 mainline kernel. Its patches
are included in the OFED sources directory.
For details see HOWTO.build_ofed.
===============================================================================
2. Main Changes from OFED 1.4.2
===============================================================================
Note: For details regarding the various changes, please see the release notes
for each package in the docs directory.
2.1 General changes
o Kernel code based on 2.6.30
o libraries location - all userspace libraries can be downloaded from
http://www.openfabrics.org/downloads/
See BUILD_ID for exact locations
o Qlogic moved the low level driver from ipath to qib.
2.2 SDP
o Performance improvements
o Zero copy in beta level
2.3 uDAPL
o New UCM provider (ofa-v2-mlx4_0-1u) with IB UD-based CM per process.
It is more scalable than rdma_cm (cma) or socket cm (scm).
o Common code base with WinOF 2.1
o Bug fixes
2.4 perftest
o Renamed tests:
ib_rdma_bw -> rdma_bw
ib_rdma_lat -> rdma_lat
2.5 Management
o OpenSM
- Support for Mesh Analysis for LASH routing algorithm
- Reloadable OpenSM configuration (preliminary implementation)
- Routing paths sorted balancing (for UpDown and MinHops)
- Weighted LID matrices calculation (for UpDown, MinHop and DOR)
- I/O nodes connectivity (for FatTree)
2.6 MPI:
a. OSU MVAPICH 1.2.0
b. Open MPI 1.4
c. OSU MVAPICH2 1.4
d. MPI tests 3.2
2.7 iSER:
o Available only on kernel.org 2.6.30, 2.6.31 and 2.6.32
2.8 NFS-RDMA
o Added support for RHEL5.4, SLES10 SP3, kernel.org 2.6.25 and 2.6.30.
Kernels 2.6.26 and 2.6.27 are not supported
o NFS-RDMA is in Beta level
===============================================================================
3. Main Changes from OFED 1.5
===============================================================================
1. Added RoCEE support - see RoCEE_README.txt
2. Added enhanced atomic operations to ConnectX (kernel only).
See mlx4_release_notes.txt.
3. Updated Open MPI to rev 1.4.1-2ofed
4. Updated MVAPICH2 to rev 1.4.1
5. Updated DAPL to rev 2.0.27
6. Updated libnes to rev 1.0.1
7. Updated librdmacm to rev 1.0.11
8. Removed tvflash RPM
9. NFS-RDMA is not supported on SLES10 SP3
10. Fixed IPv6 support and IPv4 routing corner cases for RDMA CM
11. Bug fixes
===============================================================================
4. Known Issues
===============================================================================
The following is a list of general limitations and known issues of the various
components of the OFED 1.5.1 release.
1. When upgrading from an earlier OFED version, the installation script does
not stop the earlier OFED version prior to uninstalling it.
Workaround: Stop the old OFED stack (/etc/init.d/openibd stop) before
upgrading to OFED 1.5.1 or reboot the server after OFED installation.
2. Memory registration by the user is limited according to administrator
setting. See "Pinning (Locking) User Memory Pages" in OFED_tips.txt for
system configuration.
3. Fork support from kernel 2.6.12 and above is available provided
that applications do not use threads. fork() is supported as long as the
parent process does not run before the child exits or calls exec().
The former can be achieved by calling wait(childpid), and the latter can be
achieved by application specific means. The Posix system() call is
supported.
4. The qib driver is supported only on 64-bit platforms.
5. When installing OFED on OpenSuse or Ubuntu, use the --without-depcheck
option of the install.pl script.
6. IPoIB: brctl utilities do not work on IPoIB interfaces. The reason for that
is that these utilities support devices of type Ethernet only.
7. "openibd stop" can sometime fail with the error:
Unloading ib_cm [FAILED]
ERROR: Module ib_cm is in use by ib_ipoib
Workaround: run "openibd stop" again or remove ib_ipoib aliases from
/etc/modprobe.conf.
8. When working with ISCSI over IPoIB or mlx4_en, you must disable LRO (even
if IPoIB is set to connected mode). This is due to a bug in older kernels
which causes a kernel panic.
9. On SLES11, and in case uninstall is failing, check the error log and remove
the remaining RPMs manually using 'rpm -e <rpms list>'.
10. On SLES11, set allow_unsupported_modules parameter to 1 in file:
/etc/modprobe.d/unsupported-modules. Without this the modules will not
load.
11. iSER is supported on kernel.org 2.6.30, 2.6.31 and 2.6.32 only.
OFED-1.5 will not install iSER on other kernels, and the original iSER
module that comes with the Linux distribution will stop working due to a
mismatch in the symbols version.
12. On SLES10 SP3, the kernel 2.6.16.60-0.54.5 should be updated to
2.6.16.60-0.59.1 or later. The original kernel may cause kernel panic
during '/etc/init.d/openibd restart'.
Note: See the release notes of each component for additional issues.