Nitin Kak created SPARK-9042:
--------------------------------

             Summary: Spark SQL incompatibility with Apache Sentry
                 Key: SPARK-9042
                 URL: https://issues.apache.org/jira/browse/SPARK-9042
             Project: Spark
          Issue Type: Bug
          Components: SQL
    Affects Versions: 1.2.0
            Reporter: Nitin Kak


Hive queries executed from Spark using HiveContext use CLI to create the query 
plan and then access the Hive table directories(under /user/hive/warehouse/) 
directly. This gives AccessContolException if Apache Sentry is installed:

org.apache.hadoop.security.AccessControlException: Permission denied: 
user=kakn, access=READ_EXECUTE, 
inode="/user/hive/warehouse/mastering.db/sample_table":hive:hive:drwxrwx--t 

With Apache Sentry, only "hive" user(created only for Sentry) has the 
permissions to access the hive warehouse directory. After Sentry installations 
all the queries are directed to HiveServer2 which translates the changes the 
invoking user to "hive" and then access the hive warehouse directory. However, 
HiveContext does not execute the query through HiveServer2 which is leading to 
the issue. Here is an example of executing hive query through HiveContext.

val hqlContext = new HiveContext(sc) // Create context to run Hive queries 
val pairRDD = hqlContext.sql(hql) // where hql is the string with hive query 





--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to