Uma Maheswara Rao G created HIVE-24342:
------------------------------------------

             Summary: isPathEncrypted should make sure resolved path also from 
HDFS
                 Key: HIVE-24342
                 URL: https://issues.apache.org/jira/browse/HIVE-24342
             Project: Hive
          Issue Type: Bug
          Components: HiveServer2, Shims
    Affects Versions: 3.1.2
            Reporter: Uma Maheswara Rao G
            Assignee: Uma Maheswara Rao G


Currently isPathEncrypted will make sure path is from hdfs by check the path 
scheme is "hdfs"

In the case if mounted ViewFileSystem based files systems like 
ViewFSOverloadScheme or ViewHDFS (HDFS-15289) may need o check resolved path is 
really hdfs.

In ViewHDFS case, we can mount hdfs://ns1/test ---> o3fs://b.v.ozone1/test

When user calling queries with the path hdfs://ns1/test, isPathEncrypted will 
think the path is from hdfs only as its checking path scheme.

 
{code:java}
0: jdbc:hive2://umag-1.umag.root.xxx.site:218> select * from test30;
Error: Error while compiling statement: FAILED: SemanticException Unable to 
determine if hdfs://ns1/test is encrypted: 
java.lang.UnsupportedOperationException: This API:getEZForPath is specific to 
DFS. Can't run on other fs:o3fs://bucket.volume.ozone1 (state=42000,code=40000)
0: jdbc:hive2://umag-1.umag.root.xxx.site:218> cd Closing: 0: 
jdbc:hive2://umag-1.umag.root.xxx.site:2181,umag-2.umag.root.xxx.site:2181,umag-5.umag.root.xxx.site:2181/default;password=root;principal=hive/umag-5.umag.root.xxx.s...@root.hwx.site;retries=5;serviceDiscoveryMode=zooKeeper;user=root;zooKeeperNamespace=hiveserver2
{code}
 

So, here we should use resolvePath to make sure the resolved path really in 
hdfs. If the resolved path is not from hdfs (in above case, it o3fs path), then 
it will return false.

After fixing this, the query is passing.:

 
{code:java}
0: jdbc:hive2://umag-1.umag.root.xxx.site:218> select * from test30;
INFO  : Compiling 
command(queryId=hive_20201031002253_1691548f-6fa8-4ea9-9cd4-87b70fe8f6bb): 
select * from test30
INFO  : No Stats for default@test30, Columns: item, user_id, state, order_id
INFO  : Semantic Analysis Completed (retrial = false)
INFO  : Created Hive schema: 
Schema(fieldSchemas:[FieldSchema(name:test30.order_id, type:bigint, 
comment:null), FieldSchema(name:test30.user_id, type:string, comment:null), 
FieldSchema(name:test30.item, type:string, comment:null), 
FieldSchema(name:test30.state, type:string, comment:null)], properties:null)
INFO  : Completed compiling 
command(queryId=hive_20201031002253_1691548f-6fa8-4ea9-9cd4-87b70fe8f6bb); Time 
taken: 4.47 seconds
INFO  : Executing 
command(queryId=hive_20201031002253_1691548f-6fa8-4ea9-9cd4-87b70fe8f6bb): 
select * from test30
INFO  : Completed executing 
command(queryId=hive_20201031002253_1691548f-6fa8-4ea9-9cd4-87b70fe8f6bb); Time 
taken: 0.09 seconds
INFO  : OK
+------------------+-----------------+--------------+---------------+
| test30.order_id  | test30.user_id  | test30.item  | test30.state  |
+------------------+-----------------+--------------+---------------+
| 1234             | u1              | iphone7      | CA            |
| 2345             | u1              | ipad         | CA            |
| 3456             | u2              | desktop      | NY            |
 
 
+------------------+-----------------+--------------+---------------+
11 rows selected (6.975 seconds)
{code}
 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to