I have good news, my problem is fixed.  After having the issues with ADSM/TSM 
restores, I tried to mitigate my issues by ftp'ing the data to another machine.  After 
the ftp, the checksums on the files did not match.  I tried rcp and again, the 
checksums did not match.

So, armed with that information, I contacted IBM, and the problem was with the drivers 
for the gigabit ethernet cards.  The cards in question are 10/100/1000 Base-TX PCI-X 
Adapter (14106902) and the driver at fault was devices.pci.14106902.rte at the 5.1.0.0 
level.  IBM cited two apars, IY35540 at the 5.1.0.1 level and IY38866 at the 5.1.0.3 
level.  IBM specifically said the 5.1.0.3 level was needed to correct the problem.

As this was a production system, and reboots to get the driver and kernal changes was 
not a good option, the work-around was to set two attributes on the card to "no", 
those attributes being chksum_offload and large_send.

Turning off these two attributes, made ftp, rcp and adsm/tsm work correctly again.

Just wanted to pass along the details, if anyone ever searched for a similar problem.

-Tom 



>>Tom Melton sent on 01/15/04 at 15:58

Details (I know - backlevel) -

TSM client 4.2.3 32bit
Client OS AIX 5.1  5100-03

TSM server 4.1.4
Server OS AIX 4.3.2

Tape subsystem - SCSI attached 3590 256 track

An oracle database file was archived last night, and according to the log, it was 
archived fine.  Attempt to retrieve the file today, and the TSM client waits for the 
tape mount, tape gets mounted, and then the retrieve progresses to a point, and stops. 
 Then the session times-out due to commtimeout at 1200 seconds.  

The dsmerror.log is full of the following:

01/15/04   11:55:37 The 103068111th code was found to be out of sequence.
The code (3432) was greater than (2259), the next available slot in the string table.
01/15/04   11:55:37 The 103068112th code was found to be out of sequence.
The code (3914) was greater than (2260), the next available slot in the string table.
01/15/04   11:55:37 The 103068114th code was found to be out of sequence.
The code (2472) was greater than (2262), the next available slot in the string table.
01/15/04   11:55:37 The 103068118th code was found to be out of sequence.
The code (3407) was greater than (2266), the next available slot in the string table.
01/15/04   11:55:37 The 103068120th code was found to be out of sequence.
The code (3463) was greater than (2268), the next available slot in the string table.
01/15/04   11:55:37 The 103068123th code was found to be out of sequence.
The code (2906) was greater than (2271), the next available slot in the string table.

I have retrieved other files off the tape.  The file that cannot be retrieved was 
compressed at the client.  The file that was retrieved successfully was NOT compressed 
at the client (grew during archive - then retried).

Any idea what this actually means?  I searched the archives at adsm.org, and saw 
several other posts, but no real explanation.  Any help would be appreciated.  

Thanks..

-Tom

Reply via email to