GitHub user vkorukanti opened a pull request:

    https://github.com/apache/drill/pull/215

    DRILL-3739: Fix issues in reading Hive tables with StorageHandler 
configuration (eg. Hive-HBase tables)

    Issue is for Hive tables with custom storage handlers (such as HBase backed 
Hive tables), InputFormat class is not stored in StorageDescriptor in Hive 
metastore. Instead it is retrieved from the StorageHandler.getInputFormatClass. 
This is new change made in Hive after Hive 0.13.
    
    Fix is: if we can't find the InputFormat class in metastore, create 
StorageHandler instance of the table and get the InputFormat from the instance. 
If the StorageHandler doesn't exists throw an exception. Behavior here is 
similar to Hive.
    
    In order for Hive-HBase tables to work following config properties need to 
be added to Hive storage plugin config section:
        "hbase.zookeeper.quorum": "zkhost1,zkhost2,zkhost3",
        "hbase.zookeeper.property.clientPort": "2181" // ZooKeeper port
    
    These properties are expected by HBaseStorageHandler to discover the HBase 
services.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/vkorukanti/drill DRILL-3739

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/drill/pull/215.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #215
    
----
commit 60dfac886a01f53ed94ab146bedbb16fdc90427f
Author: vkorukanti <[email protected]>
Date:   2015-10-19T18:35:09Z

    DRILL-3938: Support reading from Hive tables that have schema altered after 
the creation
    
    Also:
    + Remove "redoRecord" logic which is not needed after "automatic 
reallocation" (DRILL-1960) changes.
    + Remove HiveTestRecordReader. This is incomplete in implementation and not 
used anywhere. It is currently just
      a burden to maintain with changes in its superclass HiveRecordReader

commit 2b28eab82f6c34bc7a27c96ddd3caf7371529f7f
Author: vkorukanti <[email protected]>
Date:   2015-10-20T23:21:09Z

    DRILL-3893: Change Hive metadata cache invalidation policy to "1 min after 
last write".

commit 5d35df1b085f7c4add207c3017a08a531da65dee
Author: vkorukanti <[email protected]>
Date:   2015-10-21T18:01:23Z

    DRILL-3739: Fix issues in reading Hive tables with StorageHandler 
configuration (eg. Hive-HBase tables)

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

Reply via email to