Yeah, the metastore still holds the definition for external tables. As you mentioned, for an external table, hive doesn't delete the data when you drop the table nor renames the directory when the table is renamed. Also, external tables can't be archived. In general, hive will not do any operations that affect the underlying files if the tables is declared external.
From: Pradeep Kamath [mailto:[email protected]] Sent: Wednesday, July 21, 2010 10:10 AM To: [email protected] Subject: Managed Vs External tables Hi, I am trying to understand the differences between managed Vs external tables. From http://wiki.apache.org/hadoop/Hive/StorageHandlers#Terminology: "A managed table is one for which the definition is primarily managed in Hive's metastore, and for whose data storage Hive is responsible. An external table is one whose definition is managed in some external catalog, and whose data Hive does not own (i.e. it will not be deleted when the table is dropped)." I am a little confused by the "external table is one whose definition is managed in some external catalog" - I thought the definition for external tables is still managed by the metastore (and not an external catalog) no? I thought the only difference between managed and external tables is that the data is not dropped when you drop an external table - are there any other differences? Thanks, Pradeep
