I was just looking at this. It looks relatively easy to simply create a new codec and register it in the config files.
I have to say, btw, that the source tree structure of this project is pretty ornate and not very parallel. I needed to add 10 source roots in IntelliJ to get a clean compile. In this process, I noticed some circular dependencies. Would the committers be open to some small set of changes to remove cyclic dependencies? -----Original Message----- From: Milind Bhandarkar [mailto:[EMAIL PROTECTED] Sent: Fri 8/31/2007 11:53 AM To: [email protected] Subject: Re: Compression using Hadoop... On 8/31/07 10:43 AM, "Doug Cutting" <[EMAIL PROTECTED]> wrote: > > We really need someone to contribute an InputFormat for bzip files. > This has come up before: bzip is a standard compression format that is > splittable. +1 - milind -- Milind Bhandarkar 408-349-2136 ([EMAIL PROTECTED])
