[ 
https://issues.apache.org/jira/browse/BEAM-2500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16270310#comment-16270310
 ] 

ASF GitHub Bot commented on BEAM-2500:
--------------------------------------

jacobmarble commented on issue #4080: [BEAM-2500] Add S3 FileSystem to Java SDK
URL: https://github.com/apache/beam/pull/4080#issuecomment-347774873
 
 
   There are a couple of TODOs remaining. Here are some notes from tonight:
   - rebased to master and squashed
   - WordCount works, reading from and writing to S3
   - need to do some bigger testing
   - haven't looked at coverage yet
   - Kinesis tests fail for me both before and after this PR, so I made a fix 
and we'll see what Jenkins does with it. If that doesn't work, AWS SDK version 
1.11.24 is the last version that should pass without any change in Beam.
   - there are two open questions to @jkff , mentioned inline
   - tried to do more tonight, but fixing the content encoding bug turned out 
to be more complex than I anticipated.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


> Add support for S3 as a Apache Beam FileSystem
> ----------------------------------------------
>
>                 Key: BEAM-2500
>                 URL: https://issues.apache.org/jira/browse/BEAM-2500
>             Project: Beam
>          Issue Type: Improvement
>          Components: sdk-java-extensions
>            Reporter: Luke Cwik
>            Assignee: Jacob Marble
>            Priority: Minor
>         Attachments: hadoop_fs_patch.patch
>
>
> Note that this is for providing direct integration with S3 as an Apache Beam 
> FileSystem.
> There is already support for using the Hadoop S3 connector by depending on 
> the Hadoop File System module[1], configuring HadoopFileSystemOptions[2] with 
> a S3 configuration[3].
> 1: https://github.com/apache/beam/tree/master/sdks/java/io/hadoop-file-system
> 2: 
> https://github.com/apache/beam/blob/master/sdks/java/io/hadoop-file-system/src/main/java/org/apache/beam/sdk/io/hdfs/HadoopFileSystemOptions.java#L53
> 3: https://wiki.apache.org/hadoop/AmazonS3



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to