[ 
https://issues.apache.org/jira/browse/FLUME-2437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14330238#comment-14330238
 ] 

Otis Gospodnetic commented on FLUME-2437:
-----------------------------------------

Sounds like Johny's implementation handles all the offset stuff.... but Ashish, 
I was going to point http://aws.amazon.com/cloudtrail/ as the likely the most 
common source of data in S3.  I *think* CloudTrail may include timestamps in 
file names ... ah, yes, it's all here: 
http://docs.aws.amazon.com/awscloudtrail/latest/userguide/getting_log_files_top_level.html

bq. Do we prefer Amazon SDK or jets3t for this implementation?
http://www.jets3t.org/downloads.html#latest seems to be released only very 
sporadically.  Maybe because S3 APIs don't change very often?  But I would 
think AWS itself has the best support for its own stuff.  
http://stackoverflow.com/questions/12661768/jets3t-vs-aws-apis

> S3 Source
> ---------
>
>                 Key: FLUME-2437
>                 URL: https://issues.apache.org/jira/browse/FLUME-2437
>             Project: Flume
>          Issue Type: New Feature
>            Reporter: Jonathan Natkins
>            Assignee: Ashish Paliwal
>
> There have been multiple requests on the mailing list for an S3 source



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to