Eugene Koifman created HIVE-17296:
Summary: Acid tests with multiple splits
Issue Type: Test
Affects Versions: 3.0.0
Reporter: Eugene Koifman
Assignee: Eugene Koifman
data files in an Acid table are ORC files which may have multiple stripes
for such files in base/ or delta/ (and original files with non acid to acid
conversion) are split by OrcInputFormat into multiple (stripe sized) chunks.
There is additional logic in in OrcRawRecordMerger
(discoverKeyBounds/discoverOriginalKeyBounds) that is not tested by any E2E
tests since none of the have enough data to generate multiple stripes in a
in TestOrcRawRecordMerger has some logic to test this but it really needs e2e
With ORC-228 it will be possible to write such tests.
This message was sent by Atlassian JIRA