Hi,

I have binary files in the HDFS that I am creating a InputFormat (and
RecordReader) for. The binary format is something like [X of length 4
bytes][Y of X size], where X evaluates to an int, and the pattern
continues as XYXYXYXY. I use X (size) to know the length of the next
record to read (Y).

Does that mean I then cannot support isSplitable() == true because the
records are variable length?

Are there any tips or best practices in reading in binary file formats?

Thanks,
Will

Reply via email to