It seems RFile has a preference for the Hadoop configuration object holding Accumulo configuration over Accumulo per-table configuration in ZooKeeper.
See RFileOperations.openWriter(...). The affected configuration properties are: table.file.replication table.file.blocksize table.file.compress.blocksize table.file.compress.blocksize.index table.file.compress.type Furthermore, when they appear in Hadoop configuration, they cannot contain the Accumulo shortcuts for specifying byte sizes (like "1G"). Is this a bug, or a feature? It seems like there's a potential for it to be a feature, particularly in AccumuloFileOutputFormat, so one can specify the property in Hadoop, but it could also be a bug if it shows up in the Hadoop configuration files... especially since we don't prefix these configuration properties with something unique, like "accumulo." Thoughts? -- Christopher L Tubbs II http://gravatar.com/ctubbsii
