hi again,

I set "EventLogging: to 'bstream, trove' and i send you the logs

Best regards
Christos



data server log:
----------------------------------------
PVFS2 Server ready.
[D 07/12/2011 11:44:11] dbpf_bstream_rw_list: mem_count: 1, stream_count: 1
[D 07/12/2011 11:44:11] dbpf_bstream_rw_list: mem_offset: 0x4250b000, mem_size: 
262144
[D 07/12/2011 11:44:11] dbpf_bstream_rw_list: stream_offset: 0, stream_size: 
262144
[D 07/12/2011 11:44:11] DBPF I/O ops in progress: 1
[D 07/12/2011 11:44:11] lio_listio called with the following aiocbs:
[D 07/12/2011 11:44:11] aiocb_ptr_array[0]: fd: 12, off: 0, bytes: 262144, buf: 
0x4250b000, type: 1
[D 07/12/2011 11:44:11] [alt-aio]: pthread_create completed: id: 0, thread_id: 
(nil)
[D 07/12/2011 11:44:11] issue_or_delay_io_operation: lio_listio posted 0x154be0 
(handle 9223372036854775296, ret 0)
[D 07/12/2011 11:44:11] [alt-aio]: pwrite: cb_p: 0x157428, fd: 12, bufp: 
0x4250b000, size: 262144 off:0
[D 07/12/2011 11:44:11] dbpf_bstream_rw_list: mem_count: 1, stream_count: 1
[D 07/12/2011 11:44:11] dbpf_bstream_rw_list: mem_offset: 0x42d4d000, mem_size: 
262144
[D 07/12/2011 11:44:11] dbpf_bstream_rw_list: stream_offset: 262144, 
stream_size: 262144
[D 07/12/2011 11:44:11] DBPF I/O ops in progress: 2
[D 07/12/2011 11:44:11] lio_listio called with the following aiocbs:
[D 07/12/2011 11:44:11] aiocb_ptr_array[0]: fd: 12, off: 262144, bytes: 262144, 
buf: 0x42d4d000, type: 1
[D 07/12/2011 11:44:11] [alt-aio]: pthread_create completed: id: 0, thread_id: 
(nil)
[D 07/12/2011 11:44:11] issue_or_delay_io_operation: lio_listio posted 0x154d10 
(handle 9223372036854775296, ret 0)
[D 07/12/2011 11:44:11]  --- aio_progress_notification called with handle 
9223372036854775296 (0x154be0)
[D 07/12/2011 11:44:11] aio_progress_notification: BSTREAM_WRITE_LIST complete: 
aio_return() says 262144 [fd = 12]
[E 07/12/2011 11:44:11] trove_write_callback_fn: I/O error occurred
[E 07/12/2011 11:44:11] handle_io_error: flow proto error cleanup started on 
0x154938: No such file or directory
[D 07/12/2011 11:44:11] *** starting delayed ops if any.
[D 07/12/2011 11:44:11] DBPF I/O ops in progress: 1
[D 07/12/2011 11:44:11] [alt-aio]: pwrite: cb_p: 0x159830, fd: 12, bufp: 
0x42d4d000, size: 262144 off:262144
[D 07/12/2011 11:44:11] dbpf_dspace_cancel called for id 1395984.
[D 07/12/2011 11:44:11] Trove cancellation is not supported for this operation 
type; ignoring.
[E 07/12/2011 11:44:11] handle_io_error: flow proto 0x154938 canceled 2 
operations, will clean up.
[E 07/12/2011 11:44:11] bmi_recv_callback_fn: I/O error occurred
[D 07/12/2011 11:44:11]  --- aio_progress_notification called with handle 
9223372036854775296 (0x154d10)
[D 07/12/2011 11:44:11] aio_progress_notification: BSTREAM_WRITE_LIST complete: 
aio_return() says 262144 [fd = 12]
[E 07/12/2011 11:44:11] trove_write_callback_fn: I/O error occurred
[E 07/12/2011 11:44:11] handle_io_error: flow proto 0x154938 error cleanup 
finished: No such file or directory
[D 07/12/2011 11:44:11] *** starting delayed ops if any.
[D 07/12/2011 11:44:11] DBPF I/O ops in progress: 0

----------------------------------------
client log

mem_to_bmi_callback_fn: I/O error occurred
[E 14:44:12.211306] handle_io_error: flow proto error cleanup started on 
0x18568758: Connection reset by peer
[E 14:44:12.211316] handle_io_error: flow proto 0x18568758 canceled 0 
operations, will clean up.
[E 14:44:12.211324] handle_io_error: flow proto 0x18568758 error cleanup 
finished: Connection reset by peer
[E 14:44:14.216852] io_process_context_recv (op_status): No such file or 
directory
[E 14:44:14.216869] server: tcp://nas-0-1:3334
[E 14:44:16.221357] io_process_context_recv (op_status): No such file or 
directory
[E 14:44:16.221372] server: tcp://nas-0-1:3334
[E 14:44:18.228376] io_process_context_recv (op_status): No such file or 
directory
[E 14:44:18.228409] server: tcp://nas-0-1:3334
[E 14:44:20.232403] io_process_context_recv (op_status): No such file or 
directory
[E 14:44:20.232428] server: tcp://nas-0-1:3334
[E 14:44:22.236368] io_process_context_recv (op_status): No such file or 
directory
[E 14:44:22.236383] server: tcp://nas-0-1:3334
PVFS_sys_write: Connection reset by peer (error class: 128)
Error in write


----------------------------------

--------------------------------


---------------------------------------------------------
sbin/pvfs2-server pvfs-orange_all.conf  -d -a nas-0-1
[S 07/12/2011 11:37:09] PVFS2 Server on node nas-0-1 version 2.8.4-orangefs 
starting...
[D 07/12/2011 11:37:09] dbpf_thread_initialize: initialized
[D 07/12/2011 11:37:09] dbpf_collection_lookup of coll: pvfs2-fs
[D 07/12/2011 11:37:09] Unlinking old db cache file: 
//orange5/34a3980f//__db.001
[D 07/12/2011 11:37:09] dbpf using default db cache size.
[D 07/12/2011 11:37:09] dbpf using shm key: 1529703770
[D 07/12/2011 11:37:09] dbpf_thread_function started
[D 07/12/2011 11:37:09] collection lookup: version is 0.1.5
[D 07/12/2011 11:37:09] dbpf collection 883136527 - Setting handle timeout to 
360000000 microseconds
[D 07/12/2011 11:37:09] - set handle re-use timeout to 360 seconds (ret=0)
[D 07/12/2011 11:37:09] dbpf collection 883136527 - Setting cache keywords of 
attribute cache to dh,
[D 07/12/2011 11:37:09] dbpf collection 883136527 - Setting cache size of 
attribute cache to 511
[D 07/12/2011 11:37:09] dbpf collection 883136527 - Setting maximum elements of 
attribute cache to 1024
[D 07/12/2011 11:37:09] dbpf collection 883136527 - Initialize collection attr. 
cache
[D 07/12/2011 11:37:09] dbpf collection 883136527 - Setting collection handle 
ranges to 4611686018427387905-9223372036854775806
[D 07/12/2011 11:37:09] dbpf collection 883136527 - Setting HIGH_WATERMARK to 8
[D 07/12/2011 11:37:09] dbpf collection 883136527 - Setting LOW_WATERMARK to 1
[D 07/12/2011 11:37:09] dbpf collection 883136527 - Enabling sync mode
[D 07/12/2011 11:37:09] dbpf collection 883136527 - Disabling immediate 
completion
[S 07/12/2011 11:37:09] PVFS2 Server ready.
-------------------------------
 PVFS2 Server on node nas-0-1 version 2.8.4-orangefs starting...
[D 07/12/2011 11:35:22] dbpf_initialize failure: storage lookup failed
[D 07/12/2011 11:35:22] dbpf_thread_initialize: initialized
[D 07/12/2011 11:35:22] dbpf_collection_lookup of coll: pvfs2-fs
[D 07/12/2011 11:35:22] dbpf_thread_function started
[D 07/12/2011 11:35:22] wrote trove-dbpf version 0.1.5 to collection attribute 
database
[D 07/12/2011 11:35:23] dbpf_collection_lookup of coll: pvfs2-fs
[D 07/12/2011 11:35:23] dbpf using default db cache size.
[D 07/12/2011 11:35:23] dbpf using shm key: 646567223
[D 07/12/2011 11:35:23] collection lookup: version is 0.1.5
[D 07/12/2011 11:35:23] dbpf collection 883136527 - Setting collection handle 
ranges to 4611686018427387905-9223372036854775806
[D 07/12/2011 11:35:23] dbpf_thread_function ending
[D 07/12/2011 11:35:23] dbpf_thread_finalize: finalized
[D 07/12/2011 11:35:23] PVFS2 Server: storage space created. Exiting.
---------------------------------


----- Original Message -----
From: "Michael Moore" <[email protected]>
To: "Christos Filippidis" <[email protected]>
Cc: [email protected]
Sent: Tuesday, July 12, 2011 2:17:36 PM
Subject: Re: [Pvfs2-users] trove_write_callback_fn: I/O error occurred

I don't have any ideas off-hand, you may be right that it's an architecture 
issue. Can you re-run the test with EventLogging set to 'bstream, trove' in the 
server configuration and send those logs? 

Michael 


On Tue, Jul 12, 2011 at 4:03 AM, Christos Filippidis < 
[email protected] > wrote: 


Hi, 

I am trying to configure orangefs for testing to the following system: 

Metadata server /client: 
Centos 5 :Linux 2.6.18-194.17.4.el5 #1 SMP Mon Oct 25 15:50:53 EDT 2010 x86_64 
x86_64 x86_64 GNU/Linux 

Data server: 
Debian Lenny :Linux 2.6.17.14 #16 PREEMPT Sat Feb 5 12:45:55 UTC 2011 armv5tejl 
GNU/Linux 

Data server configuration options: 

./configure --prefix=/shares/internal/PUBLIC/orange 

Metadata server /client configuration options: 

./configure --prefix=/state/partition1/orange 
--with-kernel=/usr/src/kernels/2.6.18-194.17.4.el5-x86_64/ 

The compilation and installation was successful. 
"bin/pvfs2-ping -m /mnt/orange" was also successful. 

But every time i try to "pvfs2-cp" a file i am receiving a 
"trove_write_callback_fn: I/O error occurred" at the data server side(the same 
happens with pvfs-2.8.2) 

I tried to reconfigure orangefs with many different options but nothing seams 
to works. 
I also tried "alt-aio" "null-aio" "directio" but nothing happened. 

It seams that something strange happens with the data server(ARM processor - 
200MHZ,32MB RAM)because both orangefs and pvfs2 works fine when i am using as 
data servers intel or AMD processors. 

Do you have any idea what is wrong ? 
i really need to install orangefs at this ARM processor. 
Thanks in advance 
Christos 

ps: i send you some log files from the data server 

alt-aio 
-------- 
PVFS2 Server version 2.8.4-orangefs starting. 
[E 07/12/2011 07:38:30] trove_write_callback_fn: I/O error occurred 
[E 07/12/2011 07:38:30] handle_io_error: flow proto error cleanup started on 
0x154a88: No such file or directory 
[E 07/12/2011 07:38:30] handle_io_error: flow proto 0x154a88 canceled 0 
operations, will clean up. 
[E 07/12/2011 07:38:30] handle_io_error: flow proto 0x154a88 error cleanup 
finished: No such file or directory 


directio 
----------- 
PVFS2 Server version 2.8.4-orangefs starting. 
[E 07/12/2011 07:56:39] dbpf_bstream_direct_write_op_svc: failed to get dspace 
attr for bstream: (error=-1073742082) 
[E 07/12/2011 07:57:09] job_time_mgr_expire: job time out: cancelling flow 
operation, job_id: 505. 
[E 07/12/2011 07:57:09] fp_multiqueue_cancel: flow proto cancel called on 
0x113f00 
[E 07/12/2011 07:57:09] fp_multiqueue_cancel: I/O error occurred 
[E 07/12/2011 07:57:09] handle_io_error: flow proto error cleanup started on 
0x113f00: Operation cancelled (possibly due to timeout) 



bin/pvfs2-ping -m /mnt/orange/ 

(1) Parsing tab file... 

(2) Initializing system interface... 

(3) Initializing each file system found in tab file: /etc/fstab... 

PVFS2 servers: tcp://nas-0-1:3334 
Storage name: pvfs2-fs 
Local mount point: /mnt/orange 
/mnt/orange: Ok 

(4) Searching for /mnt/orange/ in pvfstab... 

PVFS2 servers: tcp://nas-0-1:3334 
Storage name: pvfs2-fs 
Local mount point: /mnt/orange 

meta servers: 
tcp://ikaros:3334 

data servers: 
tcp://nas-0-1:3334 

(5) Verifying that all servers are responding... 

meta servers: 
tcp://ikaros:3334 Ok 

data servers: 
tcp://nas-0-1:3334 Ok 

(6) Verifying that fsid 883136527 is acceptable to all servers... 

Ok; all servers understand fs_id 883136527 

(7) Verifying that root handle is owned by one server... 

Root handle: 1048576 
Ok; root handle is owned by exactly one server. 

============================================================= 

The PVFS2 filesystem at /mnt/orange/ appears to be correctly configured. 



-- 
Christos Filippidis 
Institute of Nuclear Physics 
National Center for Scientific Research "Demokritos" 
Patr. Grigoriou & Neapoleos Str 
153 10 Agia Paraskevi Attikis 
Athens, Greece 
Tel:+30 2106503425 

http://www.inp.demokritos.gr/~filippidisx/ 
_______________________________________________ 
Pvfs2-users mailing list 
[email protected] 
http://www.beowulf-underground.org/mailman/listinfo/pvfs2-users 

-- 
Christos Filippidis
Institute of Nuclear Physics
National Center for Scientific Research "Demokritos"
Patr. Grigoriou & Neapoleos Str
153 10 Agia Paraskevi Attikis
Athens, Greece        
Tel:+30 2106503425 

http://www.inp.demokritos.gr/~filippidisx/
_______________________________________________
Pvfs2-users mailing list
[email protected]
http://www.beowulf-underground.org/mailman/listinfo/pvfs2-users

Reply via email to