Dear Wiki user, You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.
The "Hive/LanguageManual/Archiving" page has been changed by PaulYang. http://wiki.apache.org/hadoop/Hive/LanguageManual/Archiving?action=diff&rev1=1&rev2=2 -------------------------------------------------- Hive comes with the HiveHarFileSystem class that addresses some of these issues, and is by default the value for {{{fs.har.impl}}}. Keep this in mind if you're rolling own version of HarFileSystem. - * The default HiveHarFileSystem.getFileBlockLocations() has NO LOCALITY. That means it may introduce higher network loads or reduced performance. + * The default HiveHarFileSystem.getFileBlockLocations() has '''no locality''. That means it may introduce higher network loads or reduced performance. * Archived partitions cannot be overwritten with INSERT OVERWRITE ... The partition must be unarchived first. + + * If two processes attempt to archive the same partition at the same time, bad things can happen. (Need to implement concurrency support..) == Under the hood == + Internally, when a partition is archived, t +
