hi again, I set "EventLogging: to 'bstream, trove' and i send you the logs
Best regards Christos data server log: ---------------------------------------- PVFS2 Server ready. [D 07/12/2011 11:44:11] dbpf_bstream_rw_list: mem_count: 1, stream_count: 1 [D 07/12/2011 11:44:11] dbpf_bstream_rw_list: mem_offset: 0x4250b000, mem_size: 262144 [D 07/12/2011 11:44:11] dbpf_bstream_rw_list: stream_offset: 0, stream_size: 262144 [D 07/12/2011 11:44:11] DBPF I/O ops in progress: 1 [D 07/12/2011 11:44:11] lio_listio called with the following aiocbs: [D 07/12/2011 11:44:11] aiocb_ptr_array[0]: fd: 12, off: 0, bytes: 262144, buf: 0x4250b000, type: 1 [D 07/12/2011 11:44:11] [alt-aio]: pthread_create completed: id: 0, thread_id: (nil) [D 07/12/2011 11:44:11] issue_or_delay_io_operation: lio_listio posted 0x154be0 (handle 9223372036854775296, ret 0) [D 07/12/2011 11:44:11] [alt-aio]: pwrite: cb_p: 0x157428, fd: 12, bufp: 0x4250b000, size: 262144 off:0 [D 07/12/2011 11:44:11] dbpf_bstream_rw_list: mem_count: 1, stream_count: 1 [D 07/12/2011 11:44:11] dbpf_bstream_rw_list: mem_offset: 0x42d4d000, mem_size: 262144 [D 07/12/2011 11:44:11] dbpf_bstream_rw_list: stream_offset: 262144, stream_size: 262144 [D 07/12/2011 11:44:11] DBPF I/O ops in progress: 2 [D 07/12/2011 11:44:11] lio_listio called with the following aiocbs: [D 07/12/2011 11:44:11] aiocb_ptr_array[0]: fd: 12, off: 262144, bytes: 262144, buf: 0x42d4d000, type: 1 [D 07/12/2011 11:44:11] [alt-aio]: pthread_create completed: id: 0, thread_id: (nil) [D 07/12/2011 11:44:11] issue_or_delay_io_operation: lio_listio posted 0x154d10 (handle 9223372036854775296, ret 0) [D 07/12/2011 11:44:11] --- aio_progress_notification called with handle 9223372036854775296 (0x154be0) [D 07/12/2011 11:44:11] aio_progress_notification: BSTREAM_WRITE_LIST complete: aio_return() says 262144 [fd = 12] [E 07/12/2011 11:44:11] trove_write_callback_fn: I/O error occurred [E 07/12/2011 11:44:11] handle_io_error: flow proto error cleanup started on 0x154938: No such file or directory [D 07/12/2011 11:44:11] *** starting delayed ops if any. [D 07/12/2011 11:44:11] DBPF I/O ops in progress: 1 [D 07/12/2011 11:44:11] [alt-aio]: pwrite: cb_p: 0x159830, fd: 12, bufp: 0x42d4d000, size: 262144 off:262144 [D 07/12/2011 11:44:11] dbpf_dspace_cancel called for id 1395984. [D 07/12/2011 11:44:11] Trove cancellation is not supported for this operation type; ignoring. [E 07/12/2011 11:44:11] handle_io_error: flow proto 0x154938 canceled 2 operations, will clean up. [E 07/12/2011 11:44:11] bmi_recv_callback_fn: I/O error occurred [D 07/12/2011 11:44:11] --- aio_progress_notification called with handle 9223372036854775296 (0x154d10) [D 07/12/2011 11:44:11] aio_progress_notification: BSTREAM_WRITE_LIST complete: aio_return() says 262144 [fd = 12] [E 07/12/2011 11:44:11] trove_write_callback_fn: I/O error occurred [E 07/12/2011 11:44:11] handle_io_error: flow proto 0x154938 error cleanup finished: No such file or directory [D 07/12/2011 11:44:11] *** starting delayed ops if any. [D 07/12/2011 11:44:11] DBPF I/O ops in progress: 0 ---------------------------------------- client log mem_to_bmi_callback_fn: I/O error occurred [E 14:44:12.211306] handle_io_error: flow proto error cleanup started on 0x18568758: Connection reset by peer [E 14:44:12.211316] handle_io_error: flow proto 0x18568758 canceled 0 operations, will clean up. [E 14:44:12.211324] handle_io_error: flow proto 0x18568758 error cleanup finished: Connection reset by peer [E 14:44:14.216852] io_process_context_recv (op_status): No such file or directory [E 14:44:14.216869] server: tcp://nas-0-1:3334 [E 14:44:16.221357] io_process_context_recv (op_status): No such file or directory [E 14:44:16.221372] server: tcp://nas-0-1:3334 [E 14:44:18.228376] io_process_context_recv (op_status): No such file or directory [E 14:44:18.228409] server: tcp://nas-0-1:3334 [E 14:44:20.232403] io_process_context_recv (op_status): No such file or directory [E 14:44:20.232428] server: tcp://nas-0-1:3334 [E 14:44:22.236368] io_process_context_recv (op_status): No such file or directory [E 14:44:22.236383] server: tcp://nas-0-1:3334 PVFS_sys_write: Connection reset by peer (error class: 128) Error in write ---------------------------------- -------------------------------- --------------------------------------------------------- sbin/pvfs2-server pvfs-orange_all.conf -d -a nas-0-1 [S 07/12/2011 11:37:09] PVFS2 Server on node nas-0-1 version 2.8.4-orangefs starting... [D 07/12/2011 11:37:09] dbpf_thread_initialize: initialized [D 07/12/2011 11:37:09] dbpf_collection_lookup of coll: pvfs2-fs [D 07/12/2011 11:37:09] Unlinking old db cache file: //orange5/34a3980f//__db.001 [D 07/12/2011 11:37:09] dbpf using default db cache size. [D 07/12/2011 11:37:09] dbpf using shm key: 1529703770 [D 07/12/2011 11:37:09] dbpf_thread_function started [D 07/12/2011 11:37:09] collection lookup: version is 0.1.5 [D 07/12/2011 11:37:09] dbpf collection 883136527 - Setting handle timeout to 360000000 microseconds [D 07/12/2011 11:37:09] - set handle re-use timeout to 360 seconds (ret=0) [D 07/12/2011 11:37:09] dbpf collection 883136527 - Setting cache keywords of attribute cache to dh, [D 07/12/2011 11:37:09] dbpf collection 883136527 - Setting cache size of attribute cache to 511 [D 07/12/2011 11:37:09] dbpf collection 883136527 - Setting maximum elements of attribute cache to 1024 [D 07/12/2011 11:37:09] dbpf collection 883136527 - Initialize collection attr. cache [D 07/12/2011 11:37:09] dbpf collection 883136527 - Setting collection handle ranges to 4611686018427387905-9223372036854775806 [D 07/12/2011 11:37:09] dbpf collection 883136527 - Setting HIGH_WATERMARK to 8 [D 07/12/2011 11:37:09] dbpf collection 883136527 - Setting LOW_WATERMARK to 1 [D 07/12/2011 11:37:09] dbpf collection 883136527 - Enabling sync mode [D 07/12/2011 11:37:09] dbpf collection 883136527 - Disabling immediate completion [S 07/12/2011 11:37:09] PVFS2 Server ready. ------------------------------- PVFS2 Server on node nas-0-1 version 2.8.4-orangefs starting... [D 07/12/2011 11:35:22] dbpf_initialize failure: storage lookup failed [D 07/12/2011 11:35:22] dbpf_thread_initialize: initialized [D 07/12/2011 11:35:22] dbpf_collection_lookup of coll: pvfs2-fs [D 07/12/2011 11:35:22] dbpf_thread_function started [D 07/12/2011 11:35:22] wrote trove-dbpf version 0.1.5 to collection attribute database [D 07/12/2011 11:35:23] dbpf_collection_lookup of coll: pvfs2-fs [D 07/12/2011 11:35:23] dbpf using default db cache size. [D 07/12/2011 11:35:23] dbpf using shm key: 646567223 [D 07/12/2011 11:35:23] collection lookup: version is 0.1.5 [D 07/12/2011 11:35:23] dbpf collection 883136527 - Setting collection handle ranges to 4611686018427387905-9223372036854775806 [D 07/12/2011 11:35:23] dbpf_thread_function ending [D 07/12/2011 11:35:23] dbpf_thread_finalize: finalized [D 07/12/2011 11:35:23] PVFS2 Server: storage space created. Exiting. --------------------------------- ----- Original Message ----- From: "Michael Moore" <[email protected]> To: "Christos Filippidis" <[email protected]> Cc: [email protected] Sent: Tuesday, July 12, 2011 2:17:36 PM Subject: Re: [Pvfs2-users] trove_write_callback_fn: I/O error occurred I don't have any ideas off-hand, you may be right that it's an architecture issue. Can you re-run the test with EventLogging set to 'bstream, trove' in the server configuration and send those logs? Michael On Tue, Jul 12, 2011 at 4:03 AM, Christos Filippidis < [email protected] > wrote: Hi, I am trying to configure orangefs for testing to the following system: Metadata server /client: Centos 5 :Linux 2.6.18-194.17.4.el5 #1 SMP Mon Oct 25 15:50:53 EDT 2010 x86_64 x86_64 x86_64 GNU/Linux Data server: Debian Lenny :Linux 2.6.17.14 #16 PREEMPT Sat Feb 5 12:45:55 UTC 2011 armv5tejl GNU/Linux Data server configuration options: ./configure --prefix=/shares/internal/PUBLIC/orange Metadata server /client configuration options: ./configure --prefix=/state/partition1/orange --with-kernel=/usr/src/kernels/2.6.18-194.17.4.el5-x86_64/ The compilation and installation was successful. "bin/pvfs2-ping -m /mnt/orange" was also successful. But every time i try to "pvfs2-cp" a file i am receiving a "trove_write_callback_fn: I/O error occurred" at the data server side(the same happens with pvfs-2.8.2) I tried to reconfigure orangefs with many different options but nothing seams to works. I also tried "alt-aio" "null-aio" "directio" but nothing happened. It seams that something strange happens with the data server(ARM processor - 200MHZ,32MB RAM)because both orangefs and pvfs2 works fine when i am using as data servers intel or AMD processors. Do you have any idea what is wrong ? i really need to install orangefs at this ARM processor. Thanks in advance Christos ps: i send you some log files from the data server alt-aio -------- PVFS2 Server version 2.8.4-orangefs starting. [E 07/12/2011 07:38:30] trove_write_callback_fn: I/O error occurred [E 07/12/2011 07:38:30] handle_io_error: flow proto error cleanup started on 0x154a88: No such file or directory [E 07/12/2011 07:38:30] handle_io_error: flow proto 0x154a88 canceled 0 operations, will clean up. [E 07/12/2011 07:38:30] handle_io_error: flow proto 0x154a88 error cleanup finished: No such file or directory directio ----------- PVFS2 Server version 2.8.4-orangefs starting. [E 07/12/2011 07:56:39] dbpf_bstream_direct_write_op_svc: failed to get dspace attr for bstream: (error=-1073742082) [E 07/12/2011 07:57:09] job_time_mgr_expire: job time out: cancelling flow operation, job_id: 505. [E 07/12/2011 07:57:09] fp_multiqueue_cancel: flow proto cancel called on 0x113f00 [E 07/12/2011 07:57:09] fp_multiqueue_cancel: I/O error occurred [E 07/12/2011 07:57:09] handle_io_error: flow proto error cleanup started on 0x113f00: Operation cancelled (possibly due to timeout) bin/pvfs2-ping -m /mnt/orange/ (1) Parsing tab file... (2) Initializing system interface... (3) Initializing each file system found in tab file: /etc/fstab... PVFS2 servers: tcp://nas-0-1:3334 Storage name: pvfs2-fs Local mount point: /mnt/orange /mnt/orange: Ok (4) Searching for /mnt/orange/ in pvfstab... PVFS2 servers: tcp://nas-0-1:3334 Storage name: pvfs2-fs Local mount point: /mnt/orange meta servers: tcp://ikaros:3334 data servers: tcp://nas-0-1:3334 (5) Verifying that all servers are responding... meta servers: tcp://ikaros:3334 Ok data servers: tcp://nas-0-1:3334 Ok (6) Verifying that fsid 883136527 is acceptable to all servers... Ok; all servers understand fs_id 883136527 (7) Verifying that root handle is owned by one server... Root handle: 1048576 Ok; root handle is owned by exactly one server. ============================================================= The PVFS2 filesystem at /mnt/orange/ appears to be correctly configured. -- Christos Filippidis Institute of Nuclear Physics National Center for Scientific Research "Demokritos" Patr. Grigoriou & Neapoleos Str 153 10 Agia Paraskevi Attikis Athens, Greece Tel:+30 2106503425 http://www.inp.demokritos.gr/~filippidisx/ _______________________________________________ Pvfs2-users mailing list [email protected] http://www.beowulf-underground.org/mailman/listinfo/pvfs2-users -- Christos Filippidis Institute of Nuclear Physics National Center for Scientific Research "Demokritos" Patr. Grigoriou & Neapoleos Str 153 10 Agia Paraskevi Attikis Athens, Greece Tel:+30 2106503425 http://www.inp.demokritos.gr/~filippidisx/ _______________________________________________ Pvfs2-users mailing list [email protected] http://www.beowulf-underground.org/mailman/listinfo/pvfs2-users
