[
https://issues.apache.org/jira/browse/HIVE-14511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15421717#comment-15421717
]
Sergey Shelukhin commented on HIVE-14511:
-----------------------------------------
Shouldn't the table schema inform the correct partition directory structure?
So, in the above case, if the table has p1 partition column, the partition
should be added and file1 should follow the setting (ignore/fail); likewise if
it doesn't.
I actually wonder if patch should be updated to look for specific level? I.e.
if the table is partitioned on a and b, adding a=1/000000_0 file makes no sense.
This brings it back to using right tools for the right job. msck needs to be
strict as it's primarily intended for repair, and the use for ETL is
incidental. If we need "load my partitions" command that is more flexible for
ETL it should be a separate feature...
> Improve MSCK for partitioned table to deal with special cases
> -------------------------------------------------------------
>
> Key: HIVE-14511
> URL: https://issues.apache.org/jira/browse/HIVE-14511
> Project: Hive
> Issue Type: Sub-task
> Reporter: Pengcheng Xiong
> Assignee: Pengcheng Xiong
> Attachments: HIVE-14511.01.patch
>
>
> Some users will have a folder rather than a file under the last partition
> folder. However, msck is going to search for the leaf folder rather than the
> last partition folder. We need to improve that.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)