[jira] Updated: (HADOOP-5281) GzipCodec fails second time it is used in a process

Chris Douglas (JIRA) Mon, 16 Mar 2009 18:24:17 -0700

     [ 
https://issues.apache.org/jira/browse/HADOOP-5281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Chris Douglas updated HADOOP-5281:
----------------------------------

    Attachment: 5281-0.patch
                GZt.java

When GzipCodec is specified, it's used only for the data, not the index (which 
always uses DefaultCodec). When the two match, there's no problem.

It's clear that the index is being read with GzipCodec after being written with 
DefaultCodec (so it expects a header, but doesn't find one). Still trying to 
determine how that's happening...

This didn't work with previous versions of Hadoop, right?

> GzipCodec fails second time it is used in a process
> ---------------------------------------------------
>
>                 Key: HADOOP-5281
>                 URL: https://issues.apache.org/jira/browse/HADOOP-5281
>             Project: Hadoop Core
>          Issue Type: Bug
>    Affects Versions: 0.19.0
>         Environment: 0.19.0 / Linux / amd64
>            Reporter: Ben Maurer
>            Priority: Blocker
>         Attachments: 5281-0.patch, GZt.java, GZt.java
>
>
> The attached code (GZt.java) raises:
> {noformat}
> java.io.IOException: incorrect header check
>       at 
> org.apache.hadoop.io.compress.zlib.ZlibDecompressor.inflateBytesDirect(Native 
> Method)
>       at 
> org.apache.hadoop.io.compress.zlib.ZlibDecompressor.decompress(ZlibDecompressor.java:221)
>       at 
> org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:80)
>       at 
> org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:74)
>       at 
> org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:62)
>       at java.io.DataInputStream.readByte(DataInputStream.java:248)
>       at org.apache.hadoop.io.WritableUtils.readVLong(WritableUtils.java:325)
>       at org.apache.hadoop.io.WritableUtils.readVInt(WritableUtils.java:346)
>       at org.apache.hadoop.io.SequenceFile$Reader.next(SequenceFile.java:1853)
>       at org.apache.hadoop.io.SequenceFile$Reader.next(SequenceFile.java:1876)
>       at org.apache.hadoop.io.MapFile$Reader.readIndex(MapFile.java:319)
>       at org.apache.hadoop.io.MapFile$Reader.seekInternal(MapFile.java:435)
>       at org.apache.hadoop.io.MapFile$Reader.seekInternal(MapFile.java:417)
>       at org.apache.hadoop.io.MapFile$Reader.seek(MapFile.java:404)
>       at org.apache.hadoop.io.MapFile$Reader.get(MapFile.java:523)
> {noformat}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (HADOOP-5281) GzipCodec fails second time it is used in a process

Reply via email to