Goel, Ankur wrote:
Hi All,
           Is there an input format available for reading from tarballs
(.tar.gz files) ?

Not at present. There is support for reading .gz files, but not .tar files. A problem is that that there's no way to read a chunk of such archives without reading everything preceding that chunk. So, if such an InputFormat were written, it would be unable to efficiently split the processing of an archive among map tasks.

Doug

Reply via email to