Sergey Shelukhin created HIVE-6190: -------------------------------------- Summary: redundant columns in metastore schema for stats Key: HIVE-6190 URL: https://issues.apache.org/jira/browse/HIVE-6190 Project: Hive Issue Type: Bug Components: Metastore Reporter: Sergey Shelukhin Assignee: Sergey Shelukhin Priority: Minor
package.jdo has: {noformat} <field name ="dbName"> <column name="DB_NAME" length="128" jdbc-type="VARCHAR" allows-null="false"/> </field> <field name="tableName"> <column name="TABLE_NAME" length="128" jdbc-type="VARCHAR" allows-null="false"/> </field> <field name="partitionName"> <column name="PARTITION_NAME" length="767" jdbc-type="VARCHAR" allows-null="false"/> </field> <field name="partition"> <column name="PART_ID"/> </field> {noformat} Only PART_ID is enough, the other fields are unnecessary and may potentially cause bugs; similarly for table stats. One could argue that they were intended for perf (denormalization), but stats retrieval currently is very slow on much deeper level so it's not really justified. -- This message was sent by Atlassian JIRA (v6.1.5#6160)