Thanks!
I do have a question, though. Feature level 1340 I believe is equivalent
to GPFS version 3.5.0.11. Feature level 1502 is GPFS 4.2 if I understand
correctly. That suggests to me there are 3.5 and 4.2 nodes in the same
cluster? Or at least 4.2 nodes in a cluster where the max feature level
is 1340. I didn't think either of those are supported configurations? Am
I missing something?
-Aaron
On 12/7/16 11:56 AM, Sander Kuusemets wrote:
It might have been some kind of a bug only we got, but I thought I'd
share, just in case.
The email when they said they opened a ticket for this bug's fix was
quite exactly a month ago, so I doubt they've fixed it, as they said it
might take a while.
I don't know if this is of any help, but a paragraph from the explanation:
The assert "msgLen >= (sizeof(Pad32) + 0)" is from routine
PIT_HelperGetWorkMH(). There are two RPC structures used in this routine
- PitHelperWorkReport
- PitInodeListPacket
The problematic one is the 'PitInodeListPacket' subrpc which is a part
of an "interesting inode" code change. Looking at the dumps its
evident that node 'stage3' which sent the RPC is not capable of
supporting interesting inode (max feature level is 1340) and node
tank1 which is receiving it is trying to interpret the RPC beyond the
valid region (as its feature level 1502 supports PIT interesting
inodes). This is resulting in the assert you see. As a short term
measure bringing all the nodes to the same feature level should make
the problem go away. But since we support backward compatibility, we
are opening an APAR to create a code fix. It's unfortunately going to
be a tricky fix, which is going to take a significant amount of time.
Therefore I don't expect the team will be able to provide an efix
anytime soon. We recommend you bring all nodes in all clusters up the
latest level 4.2.0.4 and run the "mmchconfig release=latest" and
"mmchfs -V full" commands that will ensure all daemon levels and fs
levels are at the necessary level that supports the 1502 RPC feature
level.
Best regards,
--
Aaron Knister
NASA Center for Climate Simulation (Code 606.2)
Goddard Space Flight Center
(301) 286-2776
_______________________________________________
gpfsug-discuss mailing list
gpfsug-discuss at spectrumscale.org
http://gpfsug.org/mailman/listinfo/gpfsug-discuss