[ https://issues.apache.org/jira/browse/HIVE-1950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12993852#comment-12993852 ]
Ning Zhang commented on HIVE-1950: ---------------------------------- Yongqiang, the patch doesn't compile. Below are some initial reviews from me: QTestUtil.java: 334: you may want to add those index tables that you want to keep in srcTables. Otherewise indexes that are created inside a test will not be cleaned -- side-effect. StatsTask: a StatsTask is added in DDLSemanticAnalyzer for the mege task but why set it to do nothing? ExecDriver: jobExecHelper is constructed in both the constructors and initialize(). Is there a reason? checkFatalError: why removed some code? Why remove METASTOREPWD? DDLTask: move semantics checking (index & archive checking etc.) to DDLSemanticAnalyzer. Execution time should only raise exception if there are runtime exceptions. In another word, explain plan of the query shoull throw an exception if there are indexes or table is archived. > Block merge for RCFile > ---------------------- > > Key: HIVE-1950 > URL: https://issues.apache.org/jira/browse/HIVE-1950 > Project: Hive > Issue Type: New Feature > Reporter: He Yongqiang > Assignee: He Yongqiang > Attachments: HIVE-1950.1.patch, HIVE-1950.2.patch, HIVE-1950.3.patch > > > In our env, there are a lot of small files inside one partition/table. In > order to reduce the namenode load, we have one dedicated housekeeping job > running to merge these file. Right now the merge is an 'insert overwrite' in > hive, and requires decompress the data and compress it. This jira is to add a > command in Hive to do the merge without decompress and recompress the data. > Something like "alter table tbl_name [partition ()] merge files". In this > jira the new command will only support RCFile, since there need some new APIs > to the fileformat. -- This message is automatically generated by JIRA. - For more information on JIRA, see: http://www.atlassian.com/software/jira