Hi,
OFED 1.5.2-rc2 is available

Notes:

The tarball is available on:
http://www.openfabrics.org/downloads/OFED/ofed-1.5.2/OFED-1.5.2-rc2.tgz

To get BUILD_ID run ofed_info

Please report any issues in bugzilla https://bugs.openfabrics.org/  for
OFED 1.5.2

Vladimir & Tziporet

========================================================================
Supported Platforms and Operating Systems
---------------------------------------------
    o   CPU architectures:
          - x86_64
          - x86
          - ppc64
          - ia64

    o   Linux Operating Systems:
          - RedHat EL4 up7        2.6.9-78.ELsmp
          - RedHat EL4 up8        2.6.9-89.ELsmp
          - RedHat EL5 up3        2.6.18-128.el5
          - RedHat EL5 up4        2.6.18-164.el5
          - RedHat EL5 up5        2.6.18-194.el5
          - RedHat EL6 beta       2.6.32-19.el6.x86_64 *
          - SLES10 SP2            2.6.16.60-0.21-smp
          - SLES10 SP3            2.6.16.60-0.54-smp
          - SLES11                2.6.27.19-5-default
          - SLES11 SP1 RC1        2.6.32.9-0.5-default *
          - OEL 4 up7             2.6.9-78.ELsmp
          - OEL 4 up8             2.6.9-89.ELsmp
          - CentOS5.3             2.6.18-128.el5
          - CentOS5.4             2.6.18-164.el5
          - Fedora Core12         2.6.31.5-127.fc12    *
          - OpenSuSE 11.2         2.6.31.5-0.1-default *
          - kernel.org            2.6.29, 2.6.30,
                                  2.6.31 and 2.6.32    *

        * Minimal QA for these versions

Main changes from 1.5.2-rc1:
===========================
1. Updated packages:
   libibverbs-1.1.4-0.10.g2f6624c.tar.gz
   librdmacm-1.0.12-0.1.gc9ce9d4.tar.gz
   dapl-2.0.29.tar.gz

   - MPI
   mvapich2-1.5-0.1.rc2.src.rpm

2. New features
   RAW ETH support
commit ccb1c56aa20fa7d68c916b650ec3201ab68b1cf2
Author: Amir Vadai <[email protected]>
Date:   Wed Jun 23 13:43:55 2010 +0300

    sdp: cleanup ssk->rx_sa when aborting incoming SrcAvail
    
    Signed-off-by: Amir Vadai <[email protected]>

commit f0fa91c1c089942ccb31e6d81382b863d3608479
Author: Amir Vadai <[email protected]>
Date:   Wed Jun 23 13:42:39 2010 +0300

    sdp: fix compilation warnings in RH
    
    use %z for size_t in printf's
    
    Signed-off-by: Amir Vadai <[email protected]>

commit 75775355c33e66b89cf7247d188ac0f05be511f2
Author: Amir Vadai <[email protected]>
Date:   Wed Jun 23 11:07:36 2010 +0300

    sdp: sdp_destroy_qp should be protected in destroy work
    
    sdp_destroy_work didn't take a lock before destorying QP
    
    Signed-off-by: Amir Vadai <[email protected]>

commit 430dde60ba3f1037c07384c217657409071b4914
Author: Amir Vadai <[email protected]>
Date:   Tue Jun 22 14:29:31 2010 +0300

    sdp: don't double free fmr
    
    There are flows that try to free ssk->fmr twice.
    
    Signed-off-by: Amir Vadai <[email protected]>

commit c31788f272e631838e792120240572063c0fc4fc
Author: Amir Vadai <[email protected]>
Date:   Tue Jun 22 10:45:58 2010 +0300

    sdp: Limit FMR resources
    
    - Make FMR pool size a module parameter and set default value to a
      smaller value (20 FMR's)
    - Fallback to SendSM silently when  can't allocate FMR
    
    Signed-off-by: Amir Vadai <[email protected]>

commit ab47ceeb606f8f8051a2dcb2d677fe8e87fd8b05
Author: Amir Vadai <[email protected]>
Date:   Tue Jun 22 10:40:54 2010 +0300

    sdp: Fix for hangs/crashes in rare cases
    
    Signed-off-by: Amir Vadai <[email protected]>

commit bc376b5664b41eb2bea3b35ab14d10e78bb7d403
Author: Eldad Zinger <[email protected]>
Date:   Thu Jun 17 09:57:57 2010 +0300

    sdp: Fix for warning message when receiving with MSG_PEEK flag, and free skb
    that is not needed any more after all data was read from it.
    
    'rx_sa->used', unlike 'offset', was not updated when MSG_PEEK flag was up 
and
    that lead to the behavior that bytes considered to be available to copy 
while
    the sequence offset showed that the bytes already consumed.
    The solution is to discard any use of 'rx_sa->used' and use 'offset' 
instead.
    
    An skb of SDP_MID_SRCAVAIL is not needed anymore when all its inline data 
was
    consumed and the RDMA operation was canceled (due to MSG_PEEK flag).
    This fix eliminates the warning message: "Trying to read beyond SKB".
    
    Signed-off-by: Eldad Zinger <[email protected]>

commit d5d74dca47e9d23ee01c584530398b1e8c21bc96
Author: Eldad Zinger <[email protected]>
Date:   Thu Jun 17 10:44:42 2010 +0300

    sdp: On MSG_PEEK, no rdma_rd_complete should be sent.
    
    Signed-off-by: Eldad Zinger <[email protected]>

commit a31343c3ddcc278b797ca372845b2f3dda5ac2b1
Author: Eldad Zinger <[email protected]>
Date:   Mon Jun 14 15:49:19 2010 +0300

    sdp: update for sdp_cma_handler() events debug messages.
    
    Signed-off-by: Eldad Zinger <[email protected]>

commit 3f9e200a65e669cf9a2cf5017f72f0e12f0faf52
Author: Eldad Zinger <[email protected]>
Date:   Tue Jun 15 08:08:22 2010 +0300

    sdp: fix for handling multi iov's in ZCOPY.
    
    Signed-off-by: Eldad Zinger <[email protected]>

commit bf852735c4081433c2d85af9e8f262fee531b227
Author: Amir Vadai <[email protected]>
Date:   Wed Jun 16 11:54:11 2010 +0300

    sdp: Fix iperf multistream hanging
    
    When trying to send and no credits available. Sometime coulnd't get
    credit update, because interrupts are turned off by default.
    Added poll RX CQ to fix in this case.
    
    Signed-off-by: Amir Vadai <[email protected]>

commit c0905871576dde4f3138c0c89991423a46d3845a
Author: Amir Vadai <[email protected]>
Date:   Wed Jun 16 11:46:56 2010 +0300

    sdp: Fix wrong use of ssk->sdp_disconnect
    
    Due to miss-use of sdp_disconnect rx cq is not armed after disconnect is
    sent
    
    Signed-off-by: Amir Vadai <[email protected]>

commit 5e8cc9370813181a5e75f16349e28f27b08a38b6
Author: Amir Vadai <[email protected]>
Date:   Tue Jun 15 16:00:15 2010 +0300

    sdp: protect sdp_auto_moderation from device removal
    
    Signed-off-by: Amir Vadai <[email protected]>

commit e2c8355eeab3d2fd8ea186d5c452bad1e4e2922f
Author: Eldad Zinger <[email protected]>
Date:   Mon Jun 14 13:03:16 2010 +0300

    sdp: sdp_recvmsg() shouldn't handle SDP_MID_DISCONN when MSG_PEEK flag is 
up.
    
    Signed-off-by: Eldad Zinger <[email protected]>

commit 4d5ae7a75acfe5fda3efdd065abc0158e876db31
Author: Eldad Zinger <[email protected]>
Date:   Mon Jun 14 12:01:31 2010 +0300

    sdp: added lock_sock() to sdp_poll()
    
    Signed-off-by: Eldad Zinger <[email protected]>

commit e86a0c91e89229f9c299ad5e10e1c238b45523ef
Author: Amir Vadai <[email protected]>
Date:   Sun Jun 13 16:10:29 2010 +0300

    sdp: Cleanedup some commented lines
    
    Signed-off-by: Amir Vadai <[email protected]>

commit 8f843c0fdcc38de214c5dab5565f289ebac25523
Author: Amir Vadai <[email protected]>
Date:   Thu Jun 10 11:32:50 2010 +0300

    sdp: Fix bad handling of small rcvbuf size in zcopy
    
    - Do not send RdmaRdComp when no buffers
    - Same for SendSm
    - post at least 3 buffers in RX to have the minimal number of credits
    - make purge_tx_ring ignore WR used by RDMA
    - fixed a typo, to reschedule tx_cq_poll timer according to tx queue and
      not to rx queue (!)
    - Allow credit updates when less than half RX Q is filled
    
    Signed-off-by: Amir Vadai <[email protected]>

commit d110c24d79ac27417642647c21b6c742c5fddd93
Author: Amir Vadai <[email protected]>
Date:   Mon Jun 14 13:08:36 2010 +0300

    sdp: fix issues in orphan count
    
    Signed-off-by: Amir Vadai <[email protected]>

commit 266f9dd74208eaa14bc1b66dea245f6f39e2eebb
Author: Amir Vadai <[email protected]>
Date:   Sun Jun 13 15:51:01 2010 +0300

    sdp: protect rx_ring access with a lock
    
    because sdp_poll_rx_cq() is accessed both from interrupt and from user
    context, need to protect it with a lock.
    
    Signed-off-by: Amir Vadai <[email protected]>

commit 6643e33462c8a7a88c0e506d0210944769f50c57
Author: Aleksey Senin <[email protected]>
Date:   Tue Jun 15 16:11:41 2010 +0300

    IB CORE: RAW ETH support
    
    Fix QP type when using iWARP transport.
    
    Signed-off-by: Aleksey Senin <[email protected]>

commit 16bda78b6aa4736148fc4c02c122443ffc05bcc9
Author: Aleksey Senin <[email protected]>
Date:   Tue Jun 8 16:54:01 2010 +0300

    RAW_ETH support
    
    This patch will fix existing NES code to support RAW_ETH instead of RAW_ETY 
type.
    
    Signed-off-by: Aleksey Senin <[email protected]>

commit a31b55f3ba3fdae8f87e52d316ef0b03e131b3db
Author: Aleksey Senin <[email protected]>
Date:   Mon Jun 14 15:40:35 2010 +0300

    MLX: RAW ETH support
    
    This patch add RAW ETH support in mlx4 driver.
    
    Signed-off-by: Aleksey Senin <[email protected]>

commit 25d639fd4264ef911758a567e4df3d2b9d5e1bde
Author: Aleksey Senin <[email protected]>
Date:   Mon Jun 14 15:38:48 2010 +0300

    IB CORE: RAW ETH support
    
    This patch add RAW ETH support to IB core.
    
    Signed-off-by: Aleksey Senin <[email protected]>

commit 0837dd77b6735e244ab1ece7ff1e82d97d2932fa
Author: Jack Morgenstein <[email protected]>
Date:   Sun Jun 13 11:16:52 2010 +0300

    mlx4_ib: Add spinlock to xrc_reg_list changes and scanning in interrupt 
context.
    
    mlx4_ib_qp_event traverses the xrc_reg_list to distribute XRC_RCV_QP events 
to
    all processes registered to that QP.  When changing the list (list_add/del) 
in
    process context, need to have a spinlock so that the event traversal will 
find
    a stable list.
    
    Fixes FM 85289
    
    Signed-off-by: Jack Morgenstein <[email protected]>

commit 21d8e77f5cfda1abba93957d08559f1e8384b6f1
Author: Jack Morgenstein <[email protected]>
Date:   Sun Jun 13 11:15:42 2010 +0300

    mlx4_ib: XRC locking fixes, and prevention of null dereference.
    
    Based on feedback from Roland Dreier in his post (for null deref):
    http://www.spinics.net/lists/linux-rdma/msg03675.html
    (can have a null dereference if the xrc_rcv verbs are invoked for a
     non-xrc_rcv QP by mistake).
    
    Also fixes a kernel Oops seen at a customer site, most likely due to
    insufficient locking on __mlx4_qp_lookup (this is the general QP database).
    (FM 83858).
    
    Signed-off-by: Jack Morgenstein <[email protected]>

commit 17c3298d08816fa2123891078e4a22b8c878c6ec
Merge: f4a08fc 64c7aba
Author: Vladimir Sokolovsky <[email protected]>
Date:   Sun Jun 13 13:43:54 2010 +0300

    Merge remote branch 'eli/ofed_kernel_1_5' into ofed_kernel_1_5

commit f4a08fc1b090fe5d1df1d7e890fbf940ec88df98
Author: Mike Heinz <[email protected]>
Date:   Wed May 26 15:00:58 2010 -0500

    ofa_kernel madeye.c
    
    This is a simple fix. Several of the snoop filters in 
./drivers/infiniband/util/madeye.c don't switch the attribute id to host byte 
order before checking it.
    
    Signed-off-by: Michael Heinz <[email protected]>

commit 64c7aba292a634b8b6230270106965d07a566ae1
Author: Eli Cohen <[email protected]>
Date:   Sun Jun 13 10:14:41 2010 +0300

    Fix backports for previous commit
    
    Signed-off-by: Eli Cohen <[email protected]>

commit 0d5c5a6a2de045911d5d3b17217eb80bc42555e6
Author: Eli Cohen <[email protected]>
Date:   Thu Jun 10 17:17:08 2010 +0300

    mlx4_en: help disperse interrupts workload
    
    make sure that at least once in 1024 completions, the CQ will be armed so an
    EQE will will be issued and eventually an interrupt request. This is 
necessary
    since the irq balancing code takes effect after the handler is run and we 
want
    the irq balance daemon configurations to take effect.
    
    Signed-off-by: Eli Cohen <[email protected]>

commit f99498779bd4684612c2b322a8e5b5c217070a5e
Author: Alexander Schmidt <[email protected]>
Date:   Fri Jun 11 15:04:02 2010 +0200

    ib/ehca: bitmaks handling for lock_hcalls
    
    Signed-off-by: ALexander Schmidt <[email protected]>

commit 362049c6f54667fa5f992a4de6eb6531e94aa3ee
Merge: d080e35 2d390c2
Author: Vladimir Sokolovsky <[email protected]>
Date:   Sat Jun 12 17:49:51 2010 +0300

    Merge branch 'ofed_kernel_1_5' of 
ssh://sofa.openfabrics.org/home/ctung/scm/ofed-1.5 into ofed_kernel_1_5

commit 2d390c24133cc8cda1fd79baf088b88d6236e3f4
Author: Mirek Walukiewicz <[email protected]>
Date:   Fri Jun 11 10:48:55 2010 -0500

    RDMA/NES: fix for wrong device name set for ibdev
    
    The problem caused problems with MPI applications. Now nes is used instead
    of eth name
    
    Signed-off-by: Mirek Walukiewicz <[email protected]>

commit d080e35bc5d9494c25d534d4984edc46778d1cd5
Author: Eli Cohen <[email protected]>
Date:   Thu Jun 10 09:18:29 2010 +0300

    mlx4_ib: Fix failure to set port number
    
    Fix failure to set the port number of a struct gid_entry object causing 
later
    on crashes due to array bounds overflow.
    
    Signed-off-by: Eli Cohen <[email protected]>

commit d9f3ed19d2c6cdef50dd3975cf85781cf8d869c7
Author: Mirek Walukiewicz <[email protected]>
Date:   Wed Jun 9 20:04:12 2010 -0500

    RDMA/nes: Added missing mutex during memory registration
    
    Fix for crash problem when multiple processes make a memory registration
    
    Signed-off-by: Mirek Walukiewicz <[email protected]>

commit 70f02889ae061264711611989173786fe7d6a2d4
Author: Chien Tung <[email protected]>
Date:   Wed Jun 9 20:02:17 2010 -0500

    RDMA/nes: get and print eeprom version number
    
    Signed-off-by: Chien Tung <[email protected]>

commit ba2b58575e39b3c652c5f919be74223be23426a0
Author: Amir Vadai <[email protected]>
Date:   Wed Jun 9 12:40:23 2010 +0300

    sdp: cleanup skb allocations
    
    - Bad sizing of inline data on send sockets had implications on the
      performance.
    - All sent data is placed on the skb itself (unless accumulated by nagle)
    - Do not count sdp header twice when allocating skb
      added some likely/unlikely
    
    Signed-off-by: Amir Vadai <[email protected]>

commit 2df9ff6e04fcb6a7d1cfa1cf172bf6c0aee1fd3a
Merge: b17f431 1423596
Author: Vladimir Sokolovsky <[email protected]>
Date:   Tue Jun 8 17:09:27 2010 +0300

    Merge remote branch 'amirv/ofed_kernel_1_5' into ofed_kernel_1_5

commit 1423596cfc5a6b02b0d4d4f680f95fe43d3ee825
Author: Amir Vadai <[email protected]>
Date:   Mon Jun 7 18:17:59 2010 +0300

    sdp: Reuse buffers in rx ring
    
    To improve latency in small packets - reuse already pinned and allocated
    pages that wasn't used.
    
    Signed-off-by: Amir Vadai <[email protected]>

commit b17f431349fce94113360d2279d122948087dd5b
Author: Steve Wise <[email protected]>
Date:   Thu Mar 12 09:32:14 2009 -0500

    RDS/IW+IB: Set recv ring low water mark to 1/2 full.
    
    Currently the recv ring low water mark is 1/4 the depth.  Performance
    measurements show that this limits iWARP throughput by flow controlling
    the rds-stress senders.  Setting it to 1/2 seems to max the T3
    performance.  I tried even higher levels but that didn't help and it
    started to increase the rds thread cpu utilization.
    
    Signed-off-by: Steve Wise <[email protected]>
    Signed-off-by: Andy Grover <[email protected]>

commit 2208e58c1d59e6ae1d33421d217dd5209c6f2e9b
Author: Andy Grover <[email protected]>
Date:   Mon Mar 29 16:50:54 2010 -0700

    RDS: Fix locking in send on m_rs_lock
    
    Do not nest m_rs_lock under c_lock
    
    Disable interrupts in {rdma,atomic}_send_complete
    
    Signed-off-by: Andy Grover <[email protected]>

commit 2b3d3ebe25139d4e65bc897721fb3545a6ee1cfe
Author: Andy Grover <[email protected]>
Date:   Fri Feb 19 18:01:41 2010 -0800

    RDS: Rewrite rds_send_drop_to() for clarity
    
    This function has been the source of numerous bugs; it's just
    too complicated. Simplified to nest spinlocks cleanly within
    the second loop body, and kick out early if there are no
    rms to drop.
    
    This will be a little slower because conn lock is grabbed for
    each entry instead of "caching" the lock across rms, but this
    should be entirely irrelevant to fastpath performance.
    
    Signed-off-by: Andy Grover <[email protected]>

commit 7a7699a208de4bb5c4b2b6d207fa38f94a7614d9
Author: Tina Yang <[email protected]>
Date:   Thu Apr 1 14:09:00 2010 -0700

    RDS: Fix corrupted rds_mrs
    
    On second look at this bug (OFED #2002), it seems that the
    collision is not with the retransmission queue (packet acked
    by the peer), but with the local send completion.  A theoretical
    sequence of events (from time t0 to t3) is thought to be as
    follows,
    
    Thread #1
    t0:
        sock_release
        rds_release
        rds_send_drop_to /* wait on send completion */
    t2:
        rds_rdma_drop_keys()   /* destroy & free all mrs */
    
    Thread #2
    t1:
        rds_ib_send_cq_comp_handler
        rds_ib_send_unmap_rm
        rds_message_unmapped   /* wake up #1 @ t0 */
    t3:
        rds_message_put
        rds_message_purge
        rds_mr_put   /* memory corruption detected */
    
    The problem with the rds_rdma_drop_keys() is it could
    remove a mr's refcount more than its due (i.e. repeatedly
    as long as it still remains in the tree (mr->r_refcount > 0)).
    Theoretically it should remove only one reference - reference
    by the tree.
    
            /* Release any MRs associated with this socket */
            while ((node = rb_first(&rs->rs_rdma_keys))) {
                    mr = container_of(node, struct rds_mr, r_rb_node);
                    if (mr->r_trans == rs->rs_transport)
                            mr->r_invalidate = 0;
                    rds_mr_put(mr);
            }
    
    I think the correct way of doing it is to remove the mr from
    the tree and rds_destroy_mr it first, then a rds_mr_put()
    to decrement its reference count by one.  Whichever thread
    holds the last reference will free the mr via rds_mr_put().
    
    Signed-off-by: Tina Yang <[email protected]>
    Signed-off-by: Andy Grover <[email protected]>

commit bd824b27e7a44ff15a297ab375a5211fef8382d8
Author: Amir Vadai <[email protected]>
Date:   Sun Jun 6 12:23:55 2010 +0300

    sdp: cpu affinity in sdpstats
    
    Signed-off-by: Amir Vadai <[email protected]>

commit e8e93577d56f199cb31d3145dab05da4d16ecd02
Author: Amir Vadai <[email protected]>
Date:   Tue Jun 1 17:04:59 2010 +0300

    sdp: use polling in rx
    
    poll on RX whenever possible, use interrupts only before sleeping,
    during graceful close or in zcopy.
    
    Signed-off-by: Amir Vadai <[email protected]>

commit d0f5a79834846b37d86f316844124ebf16ddbfbd
Author: Eldad Zinger <[email protected]>
Date:   Sun Jun 6 13:48:32 2010 +0300

    sdp: fix for a bug of lost refcnt in TCP_TIME_WAIT state.
    
    OFED addons break semantics of cancel_delayed_work_sync(), like in SLES 10.
    
    Signed-off-by: Eldad Zinger <[email protected]>

commit 2736caaba79ac27e3e99e1eef7c0609c57c74566
Author: Eldad Zinger <[email protected]>
Date:   Sun Jun 6 12:07:59 2010 +0300

    sdp: BUG2038 - transmission goal size won't exceed SDP_MAX_PAYLOAD
    
    Signed-off-by: Eldad Zinger <[email protected]>

commit 62d9286df8871081eee8f73759954790c64f4182
Author: Eldad Zinger <[email protected]>
Date:   Tue Jun 1 11:42:35 2010 +0300

    sdp: SDP_WARN_ON defined to be used instead of WARN_ON, for better 
compatibility
    
    Signed-off-by: Eldad Zinger <[email protected]>
bug_id,"bug_severity","priority","op_sys","assigned_to","bug_status","resolution","short_short_desc"
1965,"major","P3","RHEL 5","[email protected]","VERIFIED","FIXED","Bonding mlx4_en: ping does not resume after failover between 10G ports"
2038,"major","P3","All","[email protected]","RESOLVED","FIXED","Running 3 instances of ttcp benchmark on PPC64 machines, results in client machine hang"
2040,"major","P3","All","[email protected]","RESOLVED","FIXED","Running 2 instances of ttcp benchmark on PPC64 machines, results in client machine hang"
2049,"critical","P1","RHEL 5","[email protected]","RESOLVED","FIXED","MPI applications that use fork() hang"
2054,"critical","P3","RHEL 4","[email protected]","CLOSED","FIXED","Compilation of OFED 1.5.2 rc1 failed on RH4 up 7"
2059,"critical","P3","RHEL 5","[email protected]","VERIFIED","FIXED","Compilation on OFED 1.5.2-rc1 failed on RH5 up5 (All packages)"

Reply via email to