Zhijie Shen created YARN-872:
--------------------------------

             Summary: BlockDecompressorStream#decompress will throw 
EOFException instead of return -1 when EOF
                 Key: YARN-872
                 URL: https://issues.apache.org/jira/browse/YARN-872
             Project: Hadoop YARN
          Issue Type: Bug
            Reporter: Zhijie Shen
            Assignee: Zhijie Shen
            Priority: Critical


BlockDecompressorStream#decompress ultimately calls rawReadInt, which will 
throw EOFException instead of return -1 when encountering end of a stream. 
Then, decompress will be called by read. However, InputStream#read is supposed 
to return -1 instead of throwing EOFException to indicate the end of a stream. 
This explains why in LineReader,
{code}
      if (bufferPosn >= bufferLength) {
        startPosn = bufferPosn = 0;
        if (prevCharCR)
          ++bytesConsumed; //account for CR from previous read
        bufferLength = in.read(buffer);
        if (bufferLength <= 0)
          break; // EOF
      }
{code}
-1 is checked instead of catching EOFException.

Now the problem will occur with SnappyCodec. If an input file is compressed 
with SnappyCodec, it needs to be decompressed through BlockDecompressorStream 
when it is read. Then, if it empty, EOFException will been thrown from 
rawReadInt and break LineReader.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to