Hello,

I have a sequence of MR Jobs that are using the SequenceFile for their output 
and input format. If I run them without any compression enabled they work fine. 
If I use the LzoCodec they also work just fine (but then the output is not Lzop 
compatible which is inconvenient).

If I try using the LzopCodec, then the first MR job (which reads from a 
TextFile and outputs to a SequenceFile) runs OK, but when the second job tries 
to read what the first job wrote, I get the following exception:

java.io.EOFException: Premature EOF from inputStream
        at 
com.hadoop.compression.lzo.LzopInputStream.readFully(LzopInputStream.java:75)
        at 
com.hadoop.compression.lzo.LzopInputStream.readHeader(LzopInputStream.java:114)
        at 
com.hadoop.compression.lzo.LzopInputStream.<init>(LzopInputStream.java:54)
        at 
com.hadoop.compression.lzo.LzopCodec.createInputStream(LzopCodec.java:83)
        at org.apache.hadoop.io.SequenceFile$Reader.init(SequenceFile.java:1591)
        at 
org.apache.hadoop.io.SequenceFile$Reader.<init>(SequenceFile.java:1493)
        at 
org.apache.hadoop.io.SequenceFile$Reader.<init>(SequenceFile.java:1480)
        at 
org.apache.hadoop.io.SequenceFile$Reader.<init>(SequenceFile.java:1475)
        at 
org.apache.hadoop.mapreduce.lib.input.SequenceFileRecordReader.initialize(SequenceFileRecordReader.java:50)
        at 
org.apache.hadoop.mapred.MapTask$NewTrackingRecordReader.initialize(MapTask.java:451)
        at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:646)
        at org.apache.hadoop.mapred.MapTask.run(MapTask.java:323)
        at org.apache.ha

Does anyone know why this could be happening? I'm using the latest's Couldera 
CDH3 distribution and I'm configuring the compression through the 
mapred.output.compression.codec property in the mapred-site.xml file.

Thanks!
Ximo.

________________________________
Este mensaje se dirige exclusivamente a su destinatario. Puede consultar 
nuestra pol?tica de env?o y recepci?n de correo electr?nico en el enlace 
situado m?s abajo.
This message is intended exclusively for its addressee. We only send and 
receive email on the basis of the terms set out at
http://www.tid.es/ES/PAGINAS/disclaimer.aspx

Reply via email to