> 
> On 2/4/2015 6:29 PM, [email protected] wrote:
> > From: Ira Weiny <[email protected]>
> >
> > The following patch series modifies the kernel MAD processing
> > (ib_mad/ib_umad) and related interfaces to send and receive Intel
> > Omni-Path Architecture MADs on devices which support them.
> >
> > In addition to supporting some IBTA management classes, OPA devices
> > use MADs with lengths up to 2K.  These "jumbo" MADs increase the
> > performance of management traffic.
> >
> > To distinguish IBTA MADs from OPA MADs a new Base Version is introduced.
> 
> With your recent changes, I don't think that statement above is strictly true 
> any
> longer. While OPA does use a different base version for it's jumbo MADs, 
> aren't
> OPA MADs distinguished from IBTA MADs by the new OPA MAD device
> capability bit ?
> 

True.

However, OPA MADs with a base version of 0x1 are compatible with and therefore 
can be processed by the same code as IBTA MADs.

If I need to respin the series I will update the comment.

> 
> What performance tests were run in terms of IBTA MADs ?

Sorry for the delay.  As there have been a couple of versions of this series 
since I ran those tests in December I took the time to re-run these tests.  

OpenSM (sweep time) and infiniband-diag (iblinkinfo and saquery) were timed on 
my small cluster with no noticeable change in performance.  But this is not the 
best test as I only have 6 or so nodes on 2 switches.

For example iblinkinfo runs very quickly:

[root@phcppriv12 OPENIB_FF]# time iblinkinfo > /dev/null

real    0m0.072s
user    0m0.002s
sys     0m0.041s


The better test we have at this small scale are a couple of tools (closed 
source) which send SMA and PMA packets as rapidly as possible.

Those showed no difference in performance.

For example I ran these tools with 3 different kernels 1) "stock roland",  2) 
the series in question up to the kmalloc patch,  and  final 3) then the full 
OPA series.

Here is a summary of the results:

                   Roland for-next (ecb7b12)             up to kmalloc patch    
               full OPA patch set

SMA            21072                                                21324       
                                 21381
SMA rcv      17139                                                17329         
                               17303

PMA            24159.4                                             24401.2      
                               24166.5
PMA rcv      24159.4                                             24401.2        
                             24166.5


NOTE: The results shown above are specifically shown without units as I am not 
allowed to publish performance numbers.  However larger numbers equal better 
performance.

The numbers are quite repeatable and as you can see are all close to each other 
from kernel to kernel.
 
The remote node for these tests was running the stock RHEL7 kernel.  All 
software and hardware was held constant except for the kernel patches in 
question.
        
Ira

> 
> -- Hal
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to [email protected]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Reply via email to