[ https://issues.apache.org/jira/browse/HIVE-8808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14207183#comment-14207183 ]
Josh Elser commented on HIVE-8808: ---------------------------------- Thanks for looping me in, [~sushanth]. As far as I can recall, Accumulo's InputFormat classes are stateless, relying on the state to be provided through the JobConf/InputSplits as you described. I know we have some "annoyances" where multiple calls to the InputFormat which alter the JobConf are not idempotent (they typically throw an error if things are re-set). I work around most of that pain in the StorageHandler impl. Nothing is coming to mind that would be fundamentally broken if we get a re-used instance of the input format. HTH test/evaluate this too. > HiveInputFormat caching cannot work with all input formats > ---------------------------------------------------------- > > Key: HIVE-8808 > URL: https://issues.apache.org/jira/browse/HIVE-8808 > Project: Hive > Issue Type: Bug > Reporter: Brock Noland > > In {{HiveInputFormat}} we implement instance caching (see > {{getInputFormatFromCache}}). In HS2, this assumes that InputFormats are > stateless but I don't think this assumption is true, especially with regards > to HBase. -- This message was sent by Atlassian JIRA (v6.3.4#6332)