Hello,
   I tried connecting 2 endpoints using the new uDAPL library.
The connection fails with an invalid route option.
Attached is the log with DAT_DBG_TYPE=0xffff and DAPL_DBG_TYPE=0xffff.
I traced the call to ib_at_route_by_ip(),
It seems like all the arguments (dst_ip, src_ip, r_qual..) are 0x00.

I have also attached my dat.conf, lsmod, /etc/ibhosts, and /etc/hosts.

Could you please direct me where to look?

Thanks
Aniruddha

DAT Registry: Started (dat_init)
DAT Registry: static registry file </etc/dat.conf> 

DAT Registry: token
 type  eor
 value <>


DAT Registry: token
 type  string
 value <IB1>


DAT Registry: token
 type  string
 value <u1.2>


DAT Registry: token
 type  string
 value <nonthreadsafe>


DAT Registry: token
 type  string
 value <default>


DAT Registry: token
 type  string
 value </usr/local/lib/libdapl.so>


DAT Registry: token
 type  string
 value <mv_dapl.1.2>


DAT Registry: token
 type  string
 value <mthca0 1>


DAT Registry: token
 type  string
 value < >


DAT Registry: token
 type  eor
 value <>


DAT Registry: entry 
 ia_name IB1
 api_version
     type 0x0
     major.minor 1.2
 is_thread_safe 0
 is_default 1
 lib_path /usr/local/lib/libdapl.so
 provider_version
     id mv_dapl
     major.minor 1.2
 ia_params mthca0 1

DAT Registry: loading provider for IB1

DAT Registry: token
 type  eof
 value <>

DAPL: NOT Setting Loopback
 dapl_ib_init: 
 cm_thread_init(1348)
 at_thread_init(1348)
Initializing xprt library
RPC:      setting up UDP transport...
RPC:      created transport 0x804ad78 with 16 slots
TRACE: Created xpt...
DAT Registry: dat_ia_openv (IB1,1:2,0) called
DAT Registry: IA IB1, trying to load library /usr/local/lib/libdapl.so
 at_thread(1348,0xb7751bb0): ENTER: at_fd 3
DAT Registry: dat_registry_add_provider (IB1,1:2,0)
 cm_thread(1348,0xb7f52bb0): ENTER: cm_fd 4
dapl_ia_open (IB1, 32, 0x804bdac, 0x804bda4)
 open_hca: mthca0 - 0x804c868
 open_hca: Found dev mthca0 0002c901081e7470
 open_hca: GID subnet fe80000000000000 id 0002c901081e7471
 get_hca_addr: ips_by_gid ret 0 at_rec 0xbfad58c0 -> id 72
 ip_comp_handler: at_rec 0xbfad58c0 ->id 72 id 72 rec_num 1 c0a0a0a
 at_thread: callback woke
 cq_thread_init(0x804c868)
 open_hca: mthca0, port 1, AF_INET  10.10.10.12 INLINE_MAX=128
 cq_thread: ENTER hca 0x804c868
 query_hca: mthca0 AF_INET  10.10.10.12
 query_hca: (0.180000) ep 65520 ep_q 65535 evd 65408 evd_q 65535
 query_hca: msg 2147483648 rdma 2147483648 iov 60 lmr 131056 rmr 0
 setup_async_cb: ia 0x8051618 type 0 handle (nil) cb 0xb7f87348 ctx 0x80516a0
 setup_async_cb: ia 0x8051618 type 1 handle (nil) cb 0xb7f87188 ctx 0x80516a0
 setup_async_cb: ia 0x8051618 type 3 handle (nil) cb 0xb7f8727c ctx 0x8051618
dapl_ia_open () returns 0x0
dat_set_handle 0x8051618 to 1
dat_get_ia_handle from 1 to 0x8051618
dapl_ia_query (0x8051618, (nil), 0x7ffffffff, 0x804bdb8, 0x3ffffff, (nil))
 query_hca: mthca0 AF_INET  10.10.10.12
 query_hca: (0.180000) ep 65520 ep_q 65535 evd 65408 evd_q 65535
 query_hca: msg 2147483648 rdma 2147483648 iov 60 lmr 131056 rmr 0
dapl_ia_query () returns 0x0
dat_get_ia_handle from 1 to 0x8051618
dapl_pz_create (0x8051618, 0x804bda8)
 pd_alloc: pd_handle=0x8050cc0
dat_get_ia_handle from 1 to 0x8051618
dat_get_ia_handle from 1 to 0x8051618
dat_get_ia_handle from 1 to 0x8051618
dapl_evd_create (0x8051618, 32, 0x8053308, 0x20, 0x805329c)
dapl_ia_query (0x8051618, (nil), 0x0, (nil), 0x3ffffff, (nil))
dapl_ia_query () returns 0x0
dapls_ib_cq_alloc: evd 0x80533b0 cqlen=32 
dapls_ib_cq_alloc: new_cq 0x80534a8 cqlen=63 
 setup_async_cb: ia 0x8051618 type 2 handle 0x80533b0 cb 0xb7f87624 ctx 0x80533b0
dapl_evd_create () returns 0x0
dat_get_ia_handle from 1 to 0x8051618
dapl_evd_create (0x8051618, 32, (nil), 0xa0, 0x80532a0)
dapl_ia_query (0x8051618, (nil), 0x0, (nil), 0x3ffffff, (nil))
dapl_ia_query () returns 0x0
dapls_ib_cq_alloc: evd 0x8055008 cqlen=32 
dapls_ib_cq_alloc: new_cq 0x8053f40 cqlen=63 
 setup_async_cb: ia 0x8051618 type 2 handle 0x8055008 cb 0xb7f87624 ctx 0x8055008
dapl_evd_create () returns 0x0
dat_get_ia_handle from 1 to 0x8051618
dapl_evd_create (0x8051618, 32, (nil), 0x40, 0x8053298)
dapl_ia_query (0x8051618, (nil), 0x0, (nil), 0x3ffffff, (nil))
dapl_ia_query () returns 0x0
dapl_evd_create () returns 0x0
dat_get_ia_handle from 1 to 0x8051618
dapl_ep_create (0x8051618, 0x8052fb8, 0x80533b0, 0x8055008, 0x8055af8, (nil), 0x80532a4)
 query_hca: MAX msg 2147483648 dto 65535 iov 60 rdma i4,o4
 qp_alloc: ia_ptr 0x8051618 ep_ptr 0x8057218 ep_ctx_ptr 0x8057218
 qp_alloc: qpn 0x2a0016 sq 16,9 rq 16,4
 modify_qp_INIT: pi 0 port 1 acc f
dat_get_ia_handle from 1 to 0x8051618
 mr_register: ia=0x8051618, lmr=0x8057f20 va=0x805a008 ln=1024 pv=0x0
 mr_register: mr=0x8057fd0 h 3 pd 0x8050cc0 ctx 0x805154c ,lkey=0x804003b, rkey=0x804003b priv=400
dapl_ep_connect (0x8057218, {4294967280.4294967295.4294967295.4294967295}, 4A275800, 0, -1, (nil), 0, 0)
 connect: r_SID 1244092416, pdata (nil), plen 0
 connect: at_route ret=-1,Invalid argument req_id 0 GID 0000000000000000 0000000000000000
 ib_at_route_by_ip Invalid argument
 destroy_cm_id: conn 0x80832c8 id 134755424
dapl_ep_connect () returns 0x50000
dapl_evd_dequeue (0x80533b0, 0xbfad5ab0)
dapl_evd_dequeue () returns 0xd0000
dapl_ep_free (0x8057218)
dapl_ep_disconnect (0x8057218, 0)
dapl_ep_disconnect () returns 0x8007002e
dapl_ep_free: Free EP: 0, ep 0x8057218 qp_state 1 qp_handle 80578b0
 qp_free:  ep_ptr 0x8057218 qp 0x80578b0
dapl_evd_free (0x8055af8)
dapl_evd_free () returns 0x0
dapl_evd_free (0x8055008)
dapl_evd_free () returns 0x0
dapl_evd_free (0x80533b0)
dapl_evd_free () returns 0x0
TRACE: >>> °²ú·hÆú·
TRACE: Destroyed xpt...
DAPL: Stopped (dapl_fini)
 dapl_ib_release: 
 at_thread_destroy(1348)
 at_thread_destroy(1348) SIGUSR1 sent
 at_thread_destroy: waiting for at_thread
 at_thread: SIG? ret=Interrupted system call, destroy=1
 at_thread: callback woke
 at_thread(1348) EXIT 
 at_thread_destroy(1348) exit
 cm_thread_destroy(1348)
 cm_thread_destroy(1348) SIGUSR1 sent
 cm_thread_destroy: waiting for cm_thread
 cm_thread(1348): ERR Interrupted system call poll
 cm_thread(1348) EXIT, cm_list=EMPTY
 cm_thread_destroy(1348) exit
DAT Registry: Stopped (dat_fini)
#
# DAT 1.1 and 1.2 configuration file
#
# Each entry should have the following fields:
#
# <ia_name> <api_version> <threadsafety> <default> <lib_path> \
#           <provider_version> <ia_params> <platform_params>
#
# Example for openib using the first Mellanox adapter,  port 1 and port 2

IB1 u1.2 nonthreadsafe default /usr/local/lib/libdapl.so mv_dapl.1.2 "mthca0 1" 
" "
# Do not remove the following line, or various programs
# that require network functionality will fail.
127.0.0.1               localhost.localdomain localhost
10.10.10.11             filer-ib6a.discolab.rutgers.edu filer-ib6a
10.10.10.12             hora-3-ib0.discolab.rutgers.edu hora-3-ib0
Module                  Size  Used by
ib_uat                 15244  0 
ib_ucm                 19596  1 
ib_cm                  37196  1 ib_ucm
ib_at                  28096  1 ib_uat
kdapl                  11268  0 
ib_uverbs              31632  0 
ib_umad                18200  4 
ib_ipoib               43396  0 
ib_sa                  16532  2 ib_at,ib_ipoib
ipv6                  267264  18 
i2c_dev                13952  0 
i2c_core               25216  1 i2c_dev
sunrpc                145476  1 
dm_mod                 60316  0 
video                  20228  0 
button                 10896  0 
battery                13700  0 
ac                      9092  0 
uhci_hcd               36624  0 
hw_random               9748  0 
ib_mthca              111260  0 
ib_mad                 41248  4 ib_cm,ib_umad,ib_sa,ib_mthca
ib_core                46336  7 
ib_cm,ib_uverbs,ib_umad,ib_ipoib,ib_sa,ib_mthca,ib_mad
e1000                 105524  0 
floppy                 63188  0 
filer-ib6a      0xfe80000000000000 0x0001730000008461
hora-3-ib0      0xfe80000000000000 0x0002c901081e7471
_______________________________________________
openib-general mailing list
[email protected]
http://openib.org/mailman/listinfo/openib-general

To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general

Reply via email to