Taraka Rama Rao Lethavadla created HIVE-27072:
-------------------------------------------------

             Summary: create an sql query to validate a given table for 
partitions and list out any discrepancies in files/folders, list out empty 
files etc
                 Key: HIVE-27072
                 URL: https://issues.apache.org/jira/browse/HIVE-27072
             Project: Hive
          Issue Type: New Feature
          Components: HiveServer2
            Reporter: Taraka Rama Rao Lethavadla


There are couple of issues when partitions were corrupted or have additional 
unwanted files that will intervene query execution and fail. 

If we run query like "validate table table_name [partition(partition=a,..)]", 
the output should list
 * any unwanted files like empty/metadata files(like _SUCCESS etc)
 * any unwanted folders not confirming to the partition naming convention like 
test_folder where actual partition name looks like test=23
 * Too many staging directories, if we find many then cleanup is not happening 
properly after query execution
 * any file permission related issues like table has one owner, partition has 
another owner etc(Optional)



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to