Tim Sears created BEAM-683: ------------------------------ Summary: Make BZIP compressed files splittable Key: BEAM-683 URL: https://issues.apache.org/jira/browse/BEAM-683 Project: Beam Issue Type: Wish Reporter: Tim Sears Priority: Minor
Bzip2 is compressed as blocks, so it should be possible to do dynamic splitting. To do this: Seek to a location in the bzip, then keep seeking until you find the 6 byte block-start sequence 0x314159265359 (which is the 12 digit approximation of pi). You can use a bzip2 decompressor from that point onwards. -- This message was sent by Atlassian JIRA (v6.3.4#6332)