[
https://issues.apache.org/jira/browse/BEAM-3099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Udi Meiri resolved BEAM-3099.
-----------------------------
Resolution: Implemented
Fix Version/s: 2.5.0
> Implement HDFS FileSystem for Python SDK
> ----------------------------------------
>
> Key: BEAM-3099
> URL: https://issues.apache.org/jira/browse/BEAM-3099
> Project: Beam
> Issue Type: New Feature
> Components: sdk-py-core
> Reporter: Chamikara Jayalath
> Assignee: Udi Meiri
> Priority: Major
> Fix For: 2.5.0
>
> Time Spent: 1h 50m
> Remaining Estimate: 0h
>
> Currently Java SDK has HDFS support but Python SDK does not. With current
> portability efforts other runners may soon be able to use Python SDK. Having
> HDFS support will allow these runners to execute large scale jobs without
> using GCS.
> Following suggests some libraries that can be used to connect to HDFS from
> Python.
> http://wesmckinney.com/blog/python-hdfs-interfaces/
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)