[ https://issues.apache.org/jira/browse/HADOOP-1381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12496435 ]
Doug Cutting commented on HADOOP-1381: -------------------------------------- > reduce the overhead by a factor of 500 But if the overhead is already considerably less than 1%, is this really important? We want to keep it substantially smaller than 1% of what we ever expect the FileSystem block size to be. 1MB seems to be pushing that boundary. > The distance between sync blocks in SequenceFiles should be configurable > rather than hard coded to 2000 bytes > ------------------------------------------------------------------------------------------------------------- > > Key: HADOOP-1381 > URL: https://issues.apache.org/jira/browse/HADOOP-1381 > Project: Hadoop > Issue Type: Improvement > Components: io > Reporter: Owen O'Malley > Fix For: 0.14.0 > > > Currently SequenceFiles put in sync blocks every 2000 bytes. It would be much > better if it was configurable with a much higher default (1mb or so?). -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.