Hey Arlin,

Have you ever seen this? I hit this 100% of the time trying the 1.2 version of dapltest on an ofed-1.3 system. The debug info below was obtained by builting the src rpm with debug enabled...

(gdb) r -T T -d -s vic11-10g -D chelsio -i 10 client SR 256 server SR 256 client SR 256 server SR 256 Starting program: /usr/bin/dapltest -T T -d -s vic11-10g -D chelsio -i 10 client SR 256 server SR 256 client SR 256 server SR 256
[Thread debugging using libthread_db enabled]
[New Thread 46912498371600 (LWP 6654)]
-------------------------------------
TransCmd.server_name              : vic11-10g
TransCmd.num_iterations           : 10
TransCmd.num_threads              : 1
TransCmd.eps_per_thread           : 1
TransCmd.validate                 : 0
TransCmd.dapl_name                : chelsio
TransCmd.num_ops                  : 4
TransCmd.op[0].transfer_type      : SEND_RECV  (client)
TransCmd.op[0].seg_size           : 256
TransCmd.op[0].num_segs           : 1
TransCmd.op[0].reap_send_on_recv  : 0
TransCmd.op[1].transfer_type      : SEND_RECV  (server)
TransCmd.op[1].seg_size           : 256
TransCmd.op[1].num_segs           : 1
TransCmd.op[1].reap_send_on_recv  : 0
TransCmd.op[2].transfer_type      : SEND_RECV  (client)
TransCmd.op[2].seg_size           : 256
TransCmd.op[2].num_segs           : 1
TransCmd.op[2].reap_send_on_recv  : 0
TransCmd.op[3].transfer_type      : SEND_RECV  (server)
TransCmd.op[3].seg_size           : 256
TransCmd.op[3].num_segs           : 1
TransCmd.op[3].reap_send_on_recv  : 0
Server Name: vic11-10g

Program received signal SIGSEGV, Segmentation fault.
[Switching to Thread 46912498371600 (LWP 6654)]
0x00000032f04760b0 in strlen () from /lib64/libc.so.6
(gdb) bt
#0  0x00000032f04760b0 in strlen () from /lib64/libc.so.6
#1  0x00000032f044602b in vfprintf () from /lib64/libc.so.6
#2  0x00000032f044bdea in printf () from /lib64/libc.so.6
#3 0x0000000000403900 in DT_NetAddrLookupHostAddress (to_netaddr=0x7e16f88, hostname=0x7e1658c "vic11-10g") at cmd/dapl_netaddr.c:136 #4 0x00000000004026cb in DT_Params_Parse (argc=<value optimized out>, argv=<value optimized out>, params_ptr=0x7e16580) at cmd/dapl_params.c:205 #5 0x000000000040211f in dapltest (argc=22, argv=0x7fff48e9b5f8) at cmd/dapl_main.c:88
#6  0x00000032f041d8a4 in __libc_start_main () from /lib64/libc.so.6
#7  0x0000000000401f59 in _start ()
(gdb)

Its hurling in DT_Mdep_printf() here:

134         /* Pull out IP address and print it as a sanity check */
135         DT_Mdep_printf ("Server Name: %s \n", hostname);
136         DT_Mdep_printf ("Server Net Address: %s\n",
137 inet_ntoa(((struct sockaddr_in *)target->ai_addr)->sin_addr));

The ai_addr looks ok though:
(gdb) p/x *((struct sockaddr_in *)target->ai_addr)
$3 = {sin_family = 0x2, sin_port = 0x0, sin_addr = {s_addr = 0x8846a8c0}, sin_zero = {0x0, 0x0, 0x0, 0x0, 0x0, 0x0, 0x0, 0x0}}
(gdb)


Ever seen this?

Steve.
_______________________________________________
general mailing list
[email protected]
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general

To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general

Reply via email to