The key thing to realize is that S3 stores objects. You can update, replace, or delete an object, but you cannot append to it.

So you have to have the complete object available before you start the upload to S3. This means that you _must_ buffer the logs somewhere before you send them into S3

now, for hdfs and similar, you have a case where updates are written to a temporary location, and then periodically those updates get merged into the other files, at which point those other files get completely re-written. As long as these temporary copies are not in S3, having the main files in S3 is just fine.

Unfortunantly, I don't think this really helps you much.

David Lang

On Tue, 13 Aug 2013, Erik Steffl wrote:

 We have this overall architecture:

 - all hosts run rsyslog and send logs to load balancer over TCP using RELP

- load balancer has one or more log aggregator hosts that run rsyslog that accept logs (also RELP protocol of course)

 - on log aggregator: write logs to S3 for storage

 - on log aggregator: write logs to e.g. RedShift (amazon distributed DB)

What is the best way to get logs to S3? Looked around and it seems that the best way is to write them to files, on rotation transfer them to S3 using custom scripts. Seems fairly reasonable, is there anything better?

Looks like omhdfs might be able to handle S3 (Flume also uses hdfs module to write to S3) but not sure if it makes much sense. How does it write logs to different hdfs files? Looking at does at http://www.rsyslog.com/doc/omhdfs.html it seems that it would just write one file (forever?).

Any thoughts on DB output? Redshift can use Postgresql client, seems like there is ompgsql (see it mentioned at http://www.rsyslog.com/doc/rsyslog_conf_modules.html) but don't see any links to docs.

 Thanks!

        erik
_______________________________________________
rsyslog mailing list
http://lists.adiscon.net/mailman/listinfo/rsyslog
http://www.rsyslog.com/professional-services/
What's up with rsyslog? Follow https://twitter.com/rgerhards
NOTE WELL: This is a PUBLIC mailing list, posts are ARCHIVED by a myriad of sites beyond our control. PLEASE UNSUBSCRIBE and DO NOT POST if you DON'T LIKE THAT.

_______________________________________________
rsyslog mailing list
http://lists.adiscon.net/mailman/listinfo/rsyslog
http://www.rsyslog.com/professional-services/
What's up with rsyslog? Follow https://twitter.com/rgerhards
NOTE WELL: This is a PUBLIC mailing list, posts are ARCHIVED by a myriad of 
sites beyond our control. PLEASE UNSUBSCRIBE and DO NOT POST if you DON'T LIKE 
THAT.

Reply via email to