[ 
https://issues.apache.org/jira/browse/HIVE-11527?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15137936#comment-15137936
 ] 

Jing Zhao commented on HIVE-11527:
----------------------------------

So IIUC, after replacing hdfs with webhdfs, the new URI will still be resolved 
by FileSystem and finally a WebHdfsFileSystem instance will be created and 
used? In that case, I think we can:
# If there is both hostname and port contained in the original hdfs URI, both 
"hdfs" and port need to be replaced
# If there is no port in the original URI, this can be a logical URI (for 
NameNode HA setup). Since WebHdfsFileSystem can also correctly handle logical  
URI, replacing "hdfs" with "webhdfs" should be good enough
# It is also possible the URI only contains host name but no port, and the 
default port will be loaded from configuration for either hdfs/webhdfs. In that 
case replacing "hdfs" with "webhdfs" should also work.

> bypass HiveServer2 thrift interface for query results
> -----------------------------------------------------
>
>                 Key: HIVE-11527
>                 URL: https://issues.apache.org/jira/browse/HIVE-11527
>             Project: Hive
>          Issue Type: Improvement
>          Components: HiveServer2
>            Reporter: Sergey Shelukhin
>            Assignee: Takanobu Asanuma
>         Attachments: HIVE-11527.WIP.patch
>
>
> Right now, HS2 reads query results and returns them to the caller via its 
> thrift API.
> There should be an option for HS2 to return some pointer to results (an HDFS 
> link?) and for the user to read the results directly off HDFS inside the 
> cluster, or via something like WebHDFS outside the cluster
> Review board link: https://reviews.apache.org/r/40867



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to