[ https://issues.apache.org/jira/browse/HADOOP-928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Hairong Kuang updated HADOOP-928: --------------------------------- Attachment: checksum1.patch This patch incorporates most of Doug's comments. I finally decided to keep the FSInputStream interface, the raw file system input stream. The reason is that every filesystem needs to implement the raw stream interface in order to provide a data input stream, i.e. FSDataInputStream. Currently the default implementation of FSDataInputStream is a wrapper over the raw stream, providing a buffered, seekable, and position readable data stream. > make checksums optional per FileSystem > -------------------------------------- > > Key: HADOOP-928 > URL: https://issues.apache.org/jira/browse/HADOOP-928 > Project: Hadoop > Issue Type: Improvement > Components: fs > Reporter: Doug Cutting > Assigned To: Hairong Kuang > Attachments: checksum.patch, checksum1.patch > > > Checksumming is currently built into the base FileSystem class. It should > instead be optional, with each FileSystem implementation electing whether to > use the Hadoop-provided checksum system, or to disable it, or to implement > its own custom checksum system. > To implement this, a ChecksumFileSystem implementation can be provided that > wraps another FileSystem implementation, implementing checksums as in > Hadoop's current mandatory implementation (i.e., as a separate crc file per > file that's elided from directory listings). The 'raw' FileSystem methods > would be removed. FSDataInputStream and FSDataOutputStream would be made > interfaces. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.