Re: [ewg] [PATCH] IB/qib: latest QIB driver fixes

2009-12-02 Thread Or Gerlitz

Ralph Campbell wrote:

I don't think this is likely to happen since it fixes Roland's original concern 
with exporting struct ib_port
whatever, still its a patch to the core that adds new API, etc, needs to 
pass the maintainer acceptance. Its been two weeks since you sent v2, so 
a kindly reminder to Roland might do the job, try it.


Or.

___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg


[ewg] ib_rdma_bw on OFED-1.5rc3

2009-12-02 Thread Higor Aparecido Vieira Alves
Hello, 

I don`t found ib_rdma_bw on OFED-1.5rc3, I built it using --all option. 
Do you know it it was removed?

Regards, 
-- 
Higor Aparecido Vieira Alves
Software Engineer
Linux Technology Center 
IBM Systems & Technology Group

___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg


Re: [ewg] [PATCH] IB/qib: latest QIB driver fixes

2009-12-02 Thread Ralph Campbell
No reason not to wait unless it goes beyond OFED-1.5
ship date. I don't think waiting until the last minute
is a good idea. Note that this interface is between ib_qib
and ib_core so if the final form accepted upstream is
different, it can be changed without impact since the kernel
modules are all compiled at the same time. I don't think
this is likely to happen since it fixes Roland's original
concern with exporting struct ib_port.

On Tue, 2009-12-01 at 01:48 -0800, Or Gerlitz wrote:
> Ralph Campbell wrote:
> > Vlad, please pull from
> >   
> Ralph, Tziporet, any reason not to wait till this core patch is accepted 
> to the mainline kernel?
> 
> Or.
> 
> > commit 840bbefeda26d21bffae6b7cdc88e981fcfb0a45
> > Author: Ralph Campbell (QLogic) 
> > Date:   Mon Nov 30 14:09:34 2009 -0800
> >
> > IB/core: allow HCAs to create IB port sysfs files
> > This patch adds a new function to sysfs.c so that HCAs can
> > create files in /sys/class/infiniband//ports//.
> > There is no need for an unregister function since the kobject
> > reference will go to zero when ib_unregister_device() is called.
> > 
> > Signed-off-by: Ralph Campbell 
> >
> >   
> 

___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg


Re: [ewg] NFSoRDMA: Can't restart NFS server

2009-12-02 Thread Jon Mason
On Wed, Dec 02, 2009 at 07:44:30AM -0600, Jon Mason wrote:
> On Wed, Dec 02, 2009 at 06:00:03PM +0800, PN wrote:
> > Hi,
> > 
> > I'm using CentOS 5.4 with OFED-1.5-rc3 and Mellanox ConnectX cards.
> > I follow the steps of nfs-rdma.release-notes.txt and can use NFS over RDMA
> > successfully in the first time.
> > However, after umount the file system and restart the NFS server, it shows
> > the following error:
> > 
> > # /etc/init.d/nfs stop
> > Shutting down NFS mountd:  [  OK  ]
> > Shutting down NFS daemon:  [  OK  ]
> > Shutting down NFS quotas:  [  OK  ]
> > Shutting down NFS services:
> > 
> > # cat /proc/fs/nfsd/portlist
> > rdma 20049
> > tcp 2049
> > udp 2049
> > 
> > # lsmod |grep rdma
> > svcrdma69184  1
> > sunrpc245544  14 svcrdma,nfsd,nfs,nfs_acl,auth_rpcgss,lockd
> > rdma_ucm   47872  0
> > rdma_cm69140  3 svcrdma,rdma_ucm,ib_sdp
> > iw_cm  43656  1 rdma_cm
> > ib_addr43016  1 rdma_cm
> > ib_cm  73000  2 rdma_cm,ib_ipoib
> > ib_sa  75144  3 rdma_cm,ib_ipoib,ib_cm
> > ib_uverbs  75696  1 rdma_ucm
> > ib_core   109312  15
> > svcrdma,rdma_ucm,ib_sdp,rdma_cm,iw_cm,ib_ipoib,ib_cm,ib_sa,ib_uverbs,ib_umad,iw_nes,iw_cxgb3,mlx4_ib,ib_mthca,ib_mad
> > 
> > # /etc/init.d/nfs start
> > Starting NFS services: [  OK  ]
> > Starting NFS quotas:   [  OK  ]
> > Starting NFS daemon:   [FAILED]
> 
> You want to run the following to restart:
> # /etc/init.d/nfs restart
> 
> > In /var/log/messages, it shows:
> > Dec  2 17:34:30 mgmt nfsd[5106]: nfssvc: Setting version failed: errno 16
> > (Device or resource busy)
> > Dec  2 17:34:30 mgmt nfsd[5106]: nfssvc: unable to bind UPD socket: errno 98
> > (Address already in use)
> > 
> > And in the client side, I can no longer mount the server again.
> 
> This could be a bug, I'll look into it.

I can replicate the issue running /etc/init.d/nfs restart on a CentOS
5.3 server and a CentOS 5.4 client.  Please open a bug.

Thanks,
Jon

> 
> > Is there a correct ways to restart the NFS server after configuring
> > NFSoRDMA?
> > 
> > Thanks in advance.
> > 
> > PN
> 
> > ___
> > ewg mailing list
> > ewg@lists.openfabrics.org
> > http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg
> ___
> ewg mailing list
> ewg@lists.openfabrics.org
> http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg
___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg


Re: [ewg] NFSoRDMA: Can't restart NFS server

2009-12-02 Thread Jon Mason
On Wed, Dec 02, 2009 at 06:00:03PM +0800, PN wrote:
> Hi,
> 
> I'm using CentOS 5.4 with OFED-1.5-rc3 and Mellanox ConnectX cards.
> I follow the steps of nfs-rdma.release-notes.txt and can use NFS over RDMA
> successfully in the first time.
> However, after umount the file system and restart the NFS server, it shows
> the following error:
> 
> # /etc/init.d/nfs stop
> Shutting down NFS mountd:  [  OK  ]
> Shutting down NFS daemon:  [  OK  ]
> Shutting down NFS quotas:  [  OK  ]
> Shutting down NFS services:
> 
> # cat /proc/fs/nfsd/portlist
> rdma 20049
> tcp 2049
> udp 2049
> 
> # lsmod |grep rdma
> svcrdma69184  1
> sunrpc245544  14 svcrdma,nfsd,nfs,nfs_acl,auth_rpcgss,lockd
> rdma_ucm   47872  0
> rdma_cm69140  3 svcrdma,rdma_ucm,ib_sdp
> iw_cm  43656  1 rdma_cm
> ib_addr43016  1 rdma_cm
> ib_cm  73000  2 rdma_cm,ib_ipoib
> ib_sa  75144  3 rdma_cm,ib_ipoib,ib_cm
> ib_uverbs  75696  1 rdma_ucm
> ib_core   109312  15
> svcrdma,rdma_ucm,ib_sdp,rdma_cm,iw_cm,ib_ipoib,ib_cm,ib_sa,ib_uverbs,ib_umad,iw_nes,iw_cxgb3,mlx4_ib,ib_mthca,ib_mad
> 
> # /etc/init.d/nfs start
> Starting NFS services: [  OK  ]
> Starting NFS quotas:   [  OK  ]
> Starting NFS daemon:   [FAILED]

You want to run the following to restart:
# /etc/init.d/nfs restart

> In /var/log/messages, it shows:
> Dec  2 17:34:30 mgmt nfsd[5106]: nfssvc: Setting version failed: errno 16
> (Device or resource busy)
> Dec  2 17:34:30 mgmt nfsd[5106]: nfssvc: unable to bind UPD socket: errno 98
> (Address already in use)
> 
> And in the client side, I can no longer mount the server again.

This could be a bug, I'll look into it.

> Is there a correct ways to restart the NFS server after configuring
> NFSoRDMA?
> 
> Thanks in advance.
> 
> PN

> ___
> ewg mailing list
> ewg@lists.openfabrics.org
> http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg
___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg


[ewg] ofa_1_5_kernel 20091202-0200 daily build status

2009-12-02 Thread Vladimir Sokolovsky (Mellanox)
This email was generated automatically, please do not reply


git_url: git://git.openfabrics.org/ofed_1_5/linux-2.6.git
git_branch: ofed_kernel_1_5

Common build parameters: 

Passed:
Passed on i686 with linux-2.6.18
Passed on i686 with linux-2.6.19
Passed on i686 with linux-2.6.21.1
Passed on i686 with linux-2.6.26
Passed on i686 with linux-2.6.24
Passed on i686 with linux-2.6.22
Passed on i686 with linux-2.6.27
Passed on x86_64 with linux-2.6.16.60-0.54.5-smp
Passed on x86_64 with linux-2.6.16.60-0.21-smp
Passed on x86_64 with linux-2.6.18
Passed on x86_64 with linux-2.6.18-128.el5
Passed on x86_64 with linux-2.6.18-164.el5
Passed on x86_64 with linux-2.6.20
Passed on x86_64 with linux-2.6.19
Passed on x86_64 with linux-2.6.18-93.el5
Passed on x86_64 with linux-2.6.21.1
Passed on x86_64 with linux-2.6.24
Passed on x86_64 with linux-2.6.22
Passed on x86_64 with linux-2.6.26
Passed on x86_64 with linux-2.6.27
Passed on x86_64 with linux-2.6.25
Passed on x86_64 with linux-2.6.27.19-5-smp
Passed on x86_64 with linux-2.6.9-89.ELsmp
Passed on x86_64 with linux-2.6.9-78.ELsmp
Passed on x86_64 with linux-2.6.9-67.ELsmp
Passed on ia64 with linux-2.6.21.1
Passed on ia64 with linux-2.6.19
Passed on ia64 with linux-2.6.18
Passed on ia64 with linux-2.6.23
Passed on ia64 with linux-2.6.24
Passed on ia64 with linux-2.6.22
Passed on ia64 with linux-2.6.26
Passed on ia64 with linux-2.6.25
Passed on ppc64 with linux-2.6.18
Passed on ppc64 with linux-2.6.19

Failed:
___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg


Re: [ewg] Open MPI 1.3.4 release in OFED 1.5?

2009-12-02 Thread Tziporet Koren

Jeff Squyres wrote:
Works for me.  I'll upload it now and update the tag that Vlad pulls 
from.  If others hate this idea, it's easy to back it out.




I think 1.3.4 is better - and we still have 2 and a half weeks of testing

Tziporet

___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg


[ewg] NFSoRDMA: Can't restart NFS server

2009-12-02 Thread PN
Hi,

I'm using CentOS 5.4 with OFED-1.5-rc3 and Mellanox ConnectX cards.
I follow the steps of nfs-rdma.release-notes.txt and can use NFS over RDMA
successfully in the first time.
However, after umount the file system and restart the NFS server, it shows
the following error:

# /etc/init.d/nfs stop
Shutting down NFS mountd:  [  OK  ]
Shutting down NFS daemon:  [  OK  ]
Shutting down NFS quotas:  [  OK  ]
Shutting down NFS services:

# cat /proc/fs/nfsd/portlist
rdma 20049
tcp 2049
udp 2049

# lsmod |grep rdma
svcrdma69184  1
sunrpc245544  14 svcrdma,nfsd,nfs,nfs_acl,auth_rpcgss,lockd
rdma_ucm   47872  0
rdma_cm69140  3 svcrdma,rdma_ucm,ib_sdp
iw_cm  43656  1 rdma_cm
ib_addr43016  1 rdma_cm
ib_cm  73000  2 rdma_cm,ib_ipoib
ib_sa  75144  3 rdma_cm,ib_ipoib,ib_cm
ib_uverbs  75696  1 rdma_ucm
ib_core   109312  15
svcrdma,rdma_ucm,ib_sdp,rdma_cm,iw_cm,ib_ipoib,ib_cm,ib_sa,ib_uverbs,ib_umad,iw_nes,iw_cxgb3,mlx4_ib,ib_mthca,ib_mad

# /etc/init.d/nfs start
Starting NFS services: [  OK  ]
Starting NFS quotas:   [  OK  ]
Starting NFS daemon:   [FAILED]

In /var/log/messages, it shows:
Dec  2 17:34:30 mgmt nfsd[5106]: nfssvc: Setting version failed: errno 16
(Device or resource busy)
Dec  2 17:34:30 mgmt nfsd[5106]: nfssvc: unable to bind UPD socket: errno 98
(Address already in use)

And in the client side, I can no longer mount the server again.

Is there a correct ways to restart the NFS server after configuring
NFSoRDMA?

Thanks in advance.

PN
___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg

Re: [ewg] Strange behavior with IPoIB in today's Dec 1 OFED daily build

2009-12-02 Thread Eli Cohen
Did you check that ib0 is up on c0-6?

On Tue, Dec 01, 2009 at 03:36:12PM -0800, Woodruff, Robert J wrote:
> I loaded today's daily build on RedHat EL 5.4 and
> I am seeing some strange behavior with the route table
> that is generated.
> 
> my ifcfg-ib0 script lools like this
> 
> DEVICE=ib0
> BOOTPROTO=static
> IPADDR=169.254.104.248
> ONBOOT=yes
> 
> On OFED-1.5-rc2, the route table looks like this, which is what I expect.
> 
> 
> [r...@compute-0-7 woody]# netstat -r
> Kernel IP routing table
> Destination Gateway Genmask Flags   MSS Window  irtt Iface
> 192.168.25.0*   255.255.255.0   U 0 0  0 eth0
> 169.254.0.0 *   255.255.0.0 U 0 0  0 ib0
> default cst-head.local  0.0.0.0 UG0 0  0 eth0
> 
> On today's daily build, the route table seems to be routing the IPoIB trafic
> out eth0 instead of ib0.
> 
> [r...@compute-0-7 woody]# ssh c0-6
> r...@c0-6's password: 
> Last login: Tue Dec  1 11:09:10 2009 from compute-0-5.local
> [r...@compute-0-6 ~]# netstat -r
> Kernel IP routing table
> Destination Gateway Genmask Flags   MSS Window  irtt Iface
> 192.168.25.0*   255.255.255.0   U 0 0  0 eth0
> 169.254.0.0 *   255.255.0.0 U 0 0  0 eth0 
><-- should be ib0
> default cst-head.local  0.0.0.0 UG0 0  0 
> eth0___
> ewg mailing list
> ewg@lists.openfabrics.org
> http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg
___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg