Christopher Tanner wrote:
I am setting up a 16-node (homogeneous) cluster running Ubuntu 8.04 server with Mellanox Infiniband cards. I downloaded (from the OpenFabrics website), compiled, and installed the following IB packages on the master node into the /usr/local/lib directory. The /usr/local directory is being shared to all of the nodes via NFS. All packages seemed to compile and install fine.

libibverbs
librdmacm
libibcm
libipathverbs
dapl
compat-dapl
libmlx4
libmthca
libcxgb3
libibcommon
libibumad
libibmad
opensm
infiniband-diags

I have a few questions:
a) Do I need to run 'make install' on each node or just the master node? All of the libraries in /usr/local/lib are visible to all nodes... Stated another way, does 'make install' put files elsewhere beside the /usr/local/lib directory? Does it alter OS configuration files to tell it to look for certain files in /usr/local/lib?


No, all the packages above will put their files under /usr/local

b) I know I need to load the IB kernel modules (mlx4_core, mlx4_ib, rdma_ucm, ib_core, ib_mad, ib_mthca, ib_umad, ib_uverbs) in order for the IB cards to work. Are these compiled and installed with the above packages? Where does the kernel know where to look for modules? (Sorry, this question is very similar to the first one).


The packages above are user space libraries/binaries. To install kernel
modules you should download the latest version of the ofa_1_4_kernel tgz file 
from:

http://www.openfabrics.org/downloads/ofa_1_4_kernel/
To install, run:
./configure --with-core-mod --with-user_mad-mod --with-user_access-mod --with-addr_trans-mod --with-mthca-mod --with-mthca_debug-mod --with-mlx4-mod --with-mlx4_en-mod --with-mlx4_debug-mod --with-cxgb3-mod --with-ehca-mod --with-ipoib-mod --with-ipoib_debug-mod (... , see --help)
make
make install


c) The OFED software stack contains some stuff that isn't available for source download (e.g. ib-bonding, ibsim, libsdp). Are these necessary for the IB network to operate correctly? Since I'm running Ubuntu, obviously the src.rpm file won't work...


All OFED tgz files that are available under:
http://www.openfabrics.org/~vlad/ofed_1_4/SOURCES/

ib-bonding source RPM can be downloaded from (you can open it to get tgz file 
using cpio, if you need):
http://www.openfabrics.org/~monis/ofed_1_4/

This packages are not necessary for the IB network to operate correctly, but
it depends on what are you planning to do.

Regards,
Vladimir

Thanks to all for you help. Previous responses regarding issues with OpenSM worked great.

-------------------------------------------
Chris Tanner
Space Systems Design Lab
Georgia Institute of Technology
[EMAIL PROTECTED]
-------------------------------------------
_______________________________________________
general mailing list
[email protected]
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general

To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general

Reply via email to