[
https://issues.apache.org/jira/browse/HADOOP-6307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12782220#action_12782220
]
Tsz Wo (Nicholas), SZE commented on HADOOP-6307:
------------------------------------------------
> SequenceFile.Reader actually do not need the file length. ...
Thanks Chris and Arun pointing out that the file length (i.e.
SequenceFile.Reader.end) cannot be removed. Otherwise, SequenceFile.Sorter
won't work.
I guess we have to introduce a new public constructor, which takes length as a
parameter. So, that user applications could possibly pass the correct length
when creating a new Reader.
> Support reading on un-closed SequenceFile
> -----------------------------------------
>
> Key: HADOOP-6307
> URL: https://issues.apache.org/jira/browse/HADOOP-6307
> Project: Hadoop Common
> Issue Type: Improvement
> Components: io
> Reporter: Tsz Wo (Nicholas), SZE
>
> When a SequenceFile.Reader is constructed, it calls
> fs.getFileStatus(file).getLen(). However, fs.getFileStatus(file).getLen()
> does not return the hflushed length for un-closed file since the Namenode
> does not know the hflushed length. DFSClient have to ask a datanode for the
> length last block which is being written; see also HDFS-570.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.