[
https://issues.apache.org/jira/browse/FLUME-2437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14330238#comment-14330238
]
Otis Gospodnetic commented on FLUME-2437:
-----------------------------------------
Sounds like Johny's implementation handles all the offset stuff.... but Ashish,
I was going to point http://aws.amazon.com/cloudtrail/ as the likely the most
common source of data in S3. I *think* CloudTrail may include timestamps in
file names ... ah, yes, it's all here:
http://docs.aws.amazon.com/awscloudtrail/latest/userguide/getting_log_files_top_level.html
bq. Do we prefer Amazon SDK or jets3t for this implementation?
http://www.jets3t.org/downloads.html#latest seems to be released only very
sporadically. Maybe because S3 APIs don't change very often? But I would
think AWS itself has the best support for its own stuff.
http://stackoverflow.com/questions/12661768/jets3t-vs-aws-apis
> S3 Source
> ---------
>
> Key: FLUME-2437
> URL: https://issues.apache.org/jira/browse/FLUME-2437
> Project: Flume
> Issue Type: New Feature
> Reporter: Jonathan Natkins
> Assignee: Ashish Paliwal
>
> There have been multiple requests on the mailing list for an S3 source
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)