Fellow HDFS developers,
Want to bring this issue up to your attention: HDFS-10935 
<https://issues.apache.org/jira/browse/HDFS-10935> (Java-based EC codec does 
not reconstruct blocks correctly)

Initially I found 9 out of 30 TestFileChecksum tests failed consistently on my 
Mac IntelliJ. More specifically, they failed with error “Checksum mismatch”.
Because jenkins job never picked up this failure, I thought it must have been 
some kind of environment issue.
However, after consulting a number of fellow HDFS developers (Kai, Sammi, Eddy, 
Sean, Andrew and John), it turns out that this set of tests fail without native 
ISA-L library.

I was able to set up two local Hadoop repo on a Linux box, one is built with 
native ISA-L lib and the other isn’t. And the one with native lib always 
succeeds and the other always fails.

If I understand them correctly, these tests verifies that blocks are 
reconstructed correctly after one datanode is shutdown, and therefore, the 
checksum mismatch w/o native ISA-L lib seems to suggest that Java-based EC 
codec does not reconstruct strips correctly. I do not have sufficient insight 
into EC reconstruction so I maybe wrong to interpret the failures this way. But 
I think this is severe enough to become a blocker for the next Hadoop 3 alpha 
release.

Best,
Wei-Chiu Chuang
A very happy Clouderan

Reply via email to