[
https://issues.apache.org/jira/browse/HIVE-26482?focusedWorklogId=802477&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-802477
]
ASF GitHub Bot logged work on HIVE-26482:
-----------------------------------------
Author: ASF GitHub Bot
Created on: 22/Aug/22 13:32
Start Date: 22/Aug/22 13:32
Worklog Time Spent: 10m
Work Description: InvisibleProgrammer opened a new pull request, #3532:
URL: https://github.com/apache/hive/pull/3532
There is no test about the output directory names after running compaction
on partitions
Compaction output directories' writeIds only reflect the writeIds of the
deltas it compacts, and not the max write id of the table
Example:
```
Pre-compaction...
Partition p=1 contains:
delta_1_1
delta_2_2
partition p=2 contains
delta_3_3
delta_4_4
After minor compaction...
Partition p=1 contains:
delta_1_2
partition p=2 contains
delta_3_4
```
### What changes were proposed in this pull request?
New test added: `testCompactionOutputDirectoryNamesOnPartitions`
### Does this PR introduce _any_ user-facing change?
No
### How was this patch tested?
New test added: `testCompactionOutputDirectoryNamesOnPartitions`
Issue Time Tracking
-------------------
Worklog Id: (was: 802477)
Time Spent: 0.5h (was: 20m)
> Create a unit test checking compaction output file names on a partitioned
> table
> -------------------------------------------------------------------------------
>
> Key: HIVE-26482
> URL: https://issues.apache.org/jira/browse/HIVE-26482
> Project: Hive
> Issue Type: Test
> Components: Hive
> Reporter: Zsolt Miskolczi
> Assignee: Zsolt Miskolczi
> Priority: Major
> Labels: pull-request-available
> Time Spent: 0.5h
> Remaining Estimate: 0h
>
> Compaction output directories' writeIds only reflect the writeIds of the
> deltas it compacts, and not the max write id of the table
> Example:
> Pre-compaction...
> {code:java}
> Partition p=1 contains:
> delta_1_1
> delta_2_2
> partition p=2 contains
> delta_3_3
> delta_4_4
> {code}
> After minor compaction...
> {code:java}
> Partition p=1 contains:
> delta_1_2
> partition p=2 contains
> delta_3_4
> {code}
> AFAIK there are no unit tests that reflect this.
> TestTxnCommands2#testFullACIDAbortWithManyPartitions is a good template to
> start with.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)