[
https://issues.apache.org/jira/browse/BEAM-371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Neville Li updated BEAM-371:
----------------------------
Description:
Right now there is a {{beam-sdks-java-io-hdfs}} module but only
{{HDFSFileSource}} is implemented and there's a known issue with reading Avro
files.
https://github.com/GoogleCloudPlatform/DataflowJavaSDK/issues/102
We at Spotify have implemented HDFS sinks, specialized source/sink for Avro and
simple authentication and would like to port it back to Beam.
https://github.com/apache/incubator-beam/pull/485
was:
Right now there is a {{beam-sdks-java-io-hdfs}} module but only
{{HDFSFileSource}} is implemented and there's a known issue with reading Avro
files.
https://github.com/GoogleCloudPlatform/DataflowJavaSDK/issues/102
We at Spotify have implemented HDFS sinks, specialized source/sink for Avro and
simple authentication and would like to port it back to Beam.
> Backport HDFS IO enhancements from Scio
> ---------------------------------------
>
> Key: BEAM-371
> URL: https://issues.apache.org/jira/browse/BEAM-371
> Project: Beam
> Issue Type: Improvement
> Components: sdk-java-extensions
> Affects Versions: 0.1.0-incubating
> Reporter: Neville Li
> Assignee: James Malone
> Priority: Minor
>
> Right now there is a {{beam-sdks-java-io-hdfs}} module but only
> {{HDFSFileSource}} is implemented and there's a known issue with reading Avro
> files.
> https://github.com/GoogleCloudPlatform/DataflowJavaSDK/issues/102
> We at Spotify have implemented HDFS sinks, specialized source/sink for Avro
> and simple authentication and would like to port it back to Beam.
> https://github.com/apache/incubator-beam/pull/485
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)