[
https://issues.apache.org/jira/browse/HIVE-126?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Johan Oskarsson updated HIVE-126:
---------------------------------
Attachment: HIVE-126.patch
First attempt at a patch using my preferred solution, removing the code that
reads partition information from the HDFS entirely and instead relying on the
MetaStore for accurate information.
Although I assume the code was put in there for a reason so I'd love to hear
more about it. Perhaps a fsck type command could be implemented to compare on
disk data with the MetaStore?
The other solution I can think about to allow HIVE-91 to move forward is to
only get partition information from HDFS if it's not an external table.
> Don't fetch information on Partitions from HDFS instead of MetaStore
> --------------------------------------------------------------------
>
> Key: HIVE-126
> URL: https://issues.apache.org/jira/browse/HIVE-126
> Project: Hadoop Hive
> Issue Type: Improvement
> Components: Metastore
> Affects Versions: 0.19.0
> Reporter: Johan Oskarsson
> Assignee: Johan Oskarsson
> Fix For: 0.19.0
>
> Attachments: HIVE-126.patch
>
>
> When investigating HIVE-91 an issue came up where the information on what
> partitions a table contains is loaded by listing the directories in the table
> directory on HDFS. This is then used to overrule what is in the MetaStore if
> any difference is found.
> * Would it not be preferable if MetaStore is the one authority on what the
> table contains?
> * It will also be a major hassle (or impossible?) to retrieve this
> information from HDFS with external tables that have non standard partition
> names (HIVE-91), such as: table/2008/01/08/portugal where "2008/01/08" is one
> partition value and "portugal" is another.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.