Hi all,
I was experimenting with 1.8.2 patchless client and bump into a problem with using Infiniband. * Server kickstarted with RHEL 5.3 * Installed lustre-client-1.8.2-2.6.18_164.11.1.el5_lustre.1.8.2.x86_64.rpm * Installed lustre-client-modules-1.8.2-2.6.18_164.11.1.el5_lustre.1.8.2.x86_64.rpm * Performed "chkconfig -level 2345 openibd on" so ib0 is working * Performed "rsync -alP /lib/modules/2.6.18-164.11.1.el5 /lib/modules/2.6.18-128.el5 * Performed "depmod -a" * /etc/modprobe.conf has "options lnet networks=o2ib0(ib0),tcp1(eth2),tcp2(eth3)" * /etc/fstab has 10.103.34...@o2ib0:/spfs /lustre1_fifo lustre rw,noauto,_netdev 0 0 * Performed "modprobe lnet" * Performed "lctl net up" with error - LNET configure error 100: Network is down * bundle of errors on /var/log/messages, portion of the tail message: Feb 16 16:20:43 bg8mo33sn kernel: ko2iblnd: disagrees about version of symbol ib_destroy_fmr_pool Feb 16 16:20:43 bg8mo33sn kernel: ko2iblnd: Unknown symbol ib_destroy_fmr_pool Feb 16 16:20:43 bg8mo33sn modprobe: FATAL: Error inserting ko2iblnd (/lib/modules/2.6.18-128.el5/2.6.18-164.11.1.el5/kernel/net/lustre/ko2ib lnd.ko): Unknown symbol in module, or unknown parameter (see dmesg) Feb 16 16:20:43 bg8mo33sn kernel: ko2iblnd: disagrees about version of symbol rdma_destroy_id Feb 16 16:20:44 bg8mo33sn kernel: ko2iblnd: Unknown symbol rdma_destroy_id Feb 16 16:20:44 bg8mo33sn kernel: ko2iblnd: disagrees about version of symbol rdma_accept Feb 16 16:20:44 bg8mo33sn kernel: ko2iblnd: Unknown symbol rdma_accept Feb 16 16:20:44 bg8mo33sn kernel: ko2iblnd: disagrees about version of symbol ib_dealloc_pd Feb 16 16:20:44 bg8mo33sn kernel: ko2iblnd: Unknown symbol ib_dealloc_pd Feb 16 16:20:44 bg8mo33sn kernel: ko2iblnd: disagrees about version of symbol ib_fmr_pool_map_phys Feb 16 16:20:44 bg8mo33sn kernel: ko2iblnd: Unknown symbol ib_fmr_pool_map_phys Feb 16 16:20:44 bg8mo33sn kernel: LustreError: 4768:0:(api-ni.c:1043:lnet_startup_lndnis()) Can't load LND o2ib, module ko2iblnd, rc=256 * In 1.8.1.1 there was kernel-ib-1.4.2-2.6.18_128.7.1.el5.x86_64.rpm. No problem with 1.8.1.1. * In 1.8.2 there is no specific RPM to handle IB? Using stocked openib from RHEL worked for patched kernel installation. * Are there any other tricks to make this work with patchless client? Thanks in advance... Steve Stephen Chu AT&T Labs CSO C5-3C03 200 Laurel Ave Middletown, NJ [email protected]
_______________________________________________ Lustre-discuss mailing list [email protected] http://lists.lustre.org/mailman/listinfo/lustre-discuss
