Hi all, 

I have some problem in getting the client connected with few partition of a 
lustre file-system

In particular some days ago, we have serious issues on two raidset. 
After a reboot the partition becomes already available, at least locally to the 
file server. I tried to mount the partition, after an "e2fsck" on the 
partition. The e2fsck found some issue and fixed them. 

It seems that most of the nodes, keep connected to the partition that 
experienced problems, but the nodes on which those partition where 
"deactivated" are not able to re-join affected partitions.

In particular, on the server side I see those error on the logs,

Jan 26 14:43:59 dot1-se-01 kernel: LustreError: 
6542:0:(filter_io_26.c:684:filter_commitrw_write()) error starting transaction: 
rc = -30
Jan 26 14:44:08 dot1-se-01 kernel: LustreError: 
6578:0:(filter_io_26.c:684:filter_commitrw_write()) error starting transaction: 
rc = -30
Jan 26 14:44:18 dot1-se-01 kernel: LustreError: 
6485:0:(filter_io_26.c:684:filter_commitrw_write()) error starting transaction: 
rc = -30
Jan 26 14:44:18 dot1-se-01 kernel: LustreError: 
6555:0:(filter_io_26.c:684:filter_commitrw_write()) error starting transaction: 
rc = -30
Jan 26 14:44:26 dot1-se-01 kernel: LustreError: 
6496:0:(filter_io_26.c:684:filter_commitrw_write()) error starting transaction: 
rc = -30
Jan 26 14:44:28 dot1-se-01 kernel: LustreError: 
6512:0:(filter_io_26.c:684:filter_commitrw_write()) error starting transaction: 
rc = -30

while on the client I see: 

Jan 26 15:24:06 pccms35 kernel: LustreError: 11-0: an error occurred while 
communicating with 212.189.205...@tcp. The ost_connect operation failed with -30
Jan 26 15:24:06 pccms35 kernel: LustreError: Skipped 77 previous similar 
messages
Jan 26 15:25:46 pccms35 kernel: Lustre: 
9624:0:(import.c:508:import_select_connection()) 
lustre-OST0001-osc-ffff81019f1d3800: tried all connections, increasing latency 
to 36s
Jan 26 15:25:46 pccms35 kernel: Lustre: 
9624:0:(import.c:508:import_select_connection()) Skipped 77 previous similar 
messages

The same behavior  is shown also by "new" client joining the cluster. 

Any hint on this kind of issue? 

Best Regards, 
Cheers,
Giacinto

-- 
-- 
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Giacinto Donvito    LIBI -- EGEE3 SA1 INFN - Bari ITALY
------------------------------------------------------------------
[email protected]                   | GTalk/GMail: 
[email protected]
tel. +39 080 5443244   Fax  +39 0805442470    | Skype: giacinto_it
VOIP:  +41225481596           | MSN: [email protected]
AIM/iChat: gdonvito1                          | Yahoo: eric1_it 
------------------------------------------------------------------
"At least once in a lifetime
it is convenient to put everything to discussion"
Descartes

Attachment: smime.p7s
Description: S/MIME cryptographic signature

_______________________________________________
Lustre-discuss mailing list
[email protected]
http://lists.lustre.org/mailman/listinfo/lustre-discuss

Reply via email to