Hi, I have binary files in the HDFS that I am creating a InputFormat (and RecordReader) for. The binary format is something like [X of length 4 bytes][Y of X size], where X evaluates to an int, and the pattern continues as XYXYXYXY. I use X (size) to know the length of the next record to read (Y).
Does that mean I then cannot support isSplitable() == true because the records are variable length? Are there any tips or best practices in reading in binary file formats? Thanks, Will