Hi Sam,
Kyle sent me the code and I compiled it this morning.
First, I was using mpich2-mx compiled with PVFS2 support. It failed
with the error that MX was already initialized. Both mpich2-mx and
bmi_mx are calling mx_init(). I changed bmi_mx to ignore
MX_ALREADY_INITIALIZED.
Second, I do not see any errors returned in bmi_mx. It fails in
PINT_process_request (see call trace below). The request has segs =
0, bytemax = -1291, and bytes = 0.
It could well be that these values are incorrect due to a bug in
bmi_mx that is not flagging an error, but I have no idea.
Can you take a look at this?
Thanks,
Scott
0: (gdb) b PINT_process_request
0: Breakpoint 2 at 0x4701c8: file src/io/description/pint-request.c,
line 72.
0: (gdb) run -fname pvfs2://mnt/pvfs2/atchley/blah -fsize 1 -timing
0: Continuing.
0: ========= Parameter space dump =========
0: filename: pvfs2://mnt/pvfs2/atchley/blah ionodes
0: file size (MB): 1 buffer size 0
0: vector length: 10 element count: 1 vector count: 0
0: striping factor: 0 striping size: -1 collective buffer size: 0
0: loops: 1 displacement 0
0: ========= Dump done =========
0: #* no verification possible!
0: calling noncontigmem_noncontigfile(pvfs2://mnt/pvfs2/atchley/
blah, 0x0x2aaaaaaab010, 1048560)
0:
0: # testing noncontiguous in memory, noncontiguous in file using
independent I/O
0: # vector count = 26214 - access count = 26214
0: calling MPI_File_open(pvfs2://mnt/pvfs2/atchley/blah)
0: calling MPI_File_set_view()
0: calling MPI_File_seek()
0: calling MPI_File_write()
0: [New Thread 1082132816 (LWP 29290)]
0: [New Thread 1090525520 (LWP 29291)]
0:
0: Breakpoint 2, PINT_process_request (req=0x6aea50, mem=0x6aeb00,
0: rfdata=0x7fffd112b880, result=0x7fffd112b850, mode=2)
0: at src/io/description/pint-request.c:72
0: 72 void *temp_space = NULL; /* temp copy of req state
for size call */
0: (gdb) 0: (gdb) bt
0: #0 PINT_process_request (req=0x6aea50, mem=0x6aeb00,
rfdata=0x7fffd112b880,
0: result=0x7fffd112b850, mode=2) at src/io/description/pint-
request.c:72
0: #1 0x00000000004844e0 in io_find_target_datafiles
(mem_req=0x6ad160,
0: file_req=0x6ae960, file_req_offset=0, dist_p=0x6ae9c0,
fs_id=1825963815,
0: io_type=PVFS_IO_WRITE, input_handle_array=0x6b9510,
input_handle_count=4,
0: handle_index_array=0x6b9240,
handle_index_out_count=0x7fffd112b944,
0: sio_handle_index_array=0x6aea30,
sio_handle_index_count=0x7fffd112b940)
0: at src/client/sysint/sys-io.sm:2320
0: #2 0x0000000000480010 in io_datafile_setup_msgpairs (sm_p=0x6ba4a0,
0: js_p=0x7fffd112b9f0) at src/client/sysint/sys-io.sm:489
0: #3 0x0000000000476a66 in PINT_state_machine_next (s=0x6ba4a0,
0: r=0x7fffd112b9f0) at ./src/common/misc/state-machine-fns.h:158
0: #4 0x0000000000476645 in PINT_client_state_machine_post
(sm_p=0x6ba4a0,
0: pvfs_sys_op=6, op_id=0x7fffd112bb30, user_ptr=0x0)
0: at src/client/sysint/client-state-machine.c:312
0: #5 0x000000000047f9fc in PVFS_isys_io (ref=
0: {handle = 1048563, fs_id = 1825963815, __pad1 = 0},
file_req=0x6ae960,
0: file_req_offset=0, buffer=0x0, mem_req=0x6ad160,
credentials=0x6b8ea0,
0: resp_p=0x7fffd112bba0, io_type=PVFS_IO_WRITE,
op_id=0x7fffd112bb30,
0: user_ptr=0x0) at src/client/sysint/sys-io.sm:328
0: #6 0x000000000047facf in PVFS_sys_io (ref=
0: {handle = 1048563, fs_id = 1825963815, __pad1 = 0},
file_req=0x6ae960,
0: file_req_offset=0, buffer=0x0, mem_req=0x6ad160,
credentials=0x6b8ea0,
0: resp_p=0x7fffd112bba0, io_type=PVFS_IO_WRITE)
0: at src/client/sysint/sys-io.sm:351
0: #7 0x0000000000458cb2 in ADIOI_PVFS2_WriteStrided (fd=0x6b8d00,
0: buf=0x2aaaaaaab010, count=26214, datatype=-1946157050,
file_ptr_type=101,
0: offset=0, status=0x7fffd112be30, error_code=0x7fffd112bd70)
0: at /nfs/home/atchley/projects/mpich2/mpich2-snap-200706132016/
src/mpi/romio/adio/ad_pvfs2/ad_pvfs2_write.c:1001
0: #8 0x000000000041afcb in MPIOI_File_write (mpi_fh=0x6b8d00,
offset=0,
0: file_ptr_type=101, buf=0x2aaaaaaab010, count=26214,
datatype=-1946157050,
0: myname=0x63ac74 "MPI_FILE_WRITE", status=0x7fffd112be30)
0: at /nfs/home/atchley/projects/mpich2/mpich2-snap-200706132016/
src/mpi/romio/mpi-io/write.c:156
0: #9 0x000000000041aafd in PMPI_File_write (mpi_fh=0x6b8d00,
0: buf=0x2aaaaaaab010, count=26214, datatype=-1946157050,
0: status=0x7fffd112be30)
0: at /nfs/home/atchley/projects/mpich2/mpich2-snap-200706132016/
src/mpi/romio/mpi-io/write.c:52
0: #10 0x000000000040461e in noncontigmem_noncontigfile (
0: filename=0x668110 "pvfs2://mnt/pvfs2/atchley/blah",
buf=0x2aaaaaaab010,
0: bufsize=1048560, dtype=-1946157050, offset=0, displs=0,
finfo=-1677721600,
0: veclen=10, elmtcount=1, veccount=26214) at noncontig.c:185
0: #11 0x000000000040738d in main (argc=1, argv=0x7fffd112c608)
0: at noncontig.c:1020
0: (gdb) s
0: 74 PVFS_offset contig_offset = 0; /* temp for offset of
a contig region */
0: (gdb)
0: 78 if (!PINT_IS_MEMREQ(mode))
0: (gdb)
0: 79 gossip_debug(GOSSIP_REQUEST_DEBUG,
0: (gdb)
0: 81 gossip_debug
(GOSSIP_REQUEST_DEBUG,"PINT_process_request\n");
0: (gdb)
0: 83 if (!req)
0: (gdb)
0: 88 if (!result || !result->segmax || !result->bytemax)
0: (gdb) p *result
0: $1 = {offset_array = 0x7fffd112b8a8, size_array = 0x7fffd112b8a0,
segmax = 1,
0: segs = 0, bytemax = -1291, bytes = 0}
_______________________________________________
Pvfs2-developers mailing list
[email protected]
http://www.beowulf-underground.org/mailman/listinfo/pvfs2-developers