Great. Thanks for sharing. I added it to our ecosystem
<https://cwiki.apache.org/confluence/display/KAFKA/Ecosystem> wiki.

On Tue, Nov 25, 2014 at 9:58 AM, yazgoo <yaz...@gmail.com> wrote:

> Hi,
>
> First I'd like to thank kafka developers for writing kafka.
>
> This is an announcement for the first release of a file system logging
> agent based on kafka.
>
> It is written for collecting logs from servers running all kind of
> software,
> as a generic way to collect logs without needing to know about each logger.
>
> Home:
> https://github.com/yazgoo/fuse_kafka
>
> Here are some functionnalities:
>
>    - sends all writes to given directories to kafka
>    - passes through FS syscalls to underlying directory
>    - captures the pid, gid, uid, user, group, command line doing the write
>    - you can add metadata to identify from where the message comes from
>    (e.g. ip-address, ...)
>    - you can configure kafka destination cluster either by giving a broker
>    list or a zookeeper list
>    - you can specify a bandwidth quota: fuse_kafka won't send data if a
>    file is written more than a given size per second (useful for preventing
>    floods caused by core files dumped or log rotations in directories
> watched
>    by fuse_kafka)
>
> It is based on:
>
>    - FUSE (filesystem in userspace), to capture writes done under a given
>    directory
>    - kafka (messaging queue), as the event transport system
>    - logstash: events are written to kafka in logstash format (except
>    messages and commands which are stored in base64)
>
> It is written in C and python.
>
> Packages are provided for various distros, see installing section in
> README.md.
> FUSE adds an overhead, so it should not be used on filesystems where high
> throughput is necessary.
> Here are benchmarks:
>
> http://htmlpreview.github.io/?https://raw.githubusercontent.com/yazgoo/fuse_kafka/master/benchs/benchmarks.html
>
> Contributions are welcome, of course!
>
> Regards
>

Reply via email to