[ 
https://issues.apache.org/jira/browse/HDFS-13894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16603735#comment-16603735
 ] 

Íñigo Goiri commented on HDFS-13894:
------------------------------------

I can try to abstract some of this out of HDFS and HttpFS and make it more 
generic and move it to Hadoop commons.

In addition, I should add some documentation.
I can add some pointer to {{hadoop-hdfs-httpfs/src/site/markdown/index.md}}.

> Access HDFS through a proxy and natively
> ----------------------------------------
>
>                 Key: HDFS-13894
>                 URL: https://issues.apache.org/jira/browse/HDFS-13894
>             Project: Hadoop HDFS
>          Issue Type: New Feature
>            Reporter: Íñigo Goiri
>            Assignee: Íñigo Goiri
>            Priority: Major
>         Attachments: HDFS-13894.000.patch
>
>
> HDFS deployments are usually behind a firewall where one can access the 
> Namenode but not the Datanodes. To mitigate this situation there are proxies 
> that catch the DN requests (e.g., HttpFS). However, if a user submits a job 
> using the HttpFS endpoint, all the workers will use such endpoint which will 
> usually be a bottleneck.
> We should create a new filesystem that supports accessing both:
> * HttpFS for submission from outside the firewal
> * HDFS from within the cluster



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to