[ 
https://issues.apache.org/jira/browse/NIFI-856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mike de Rhino  updated NIFI-856:
--------------------------------
    Description: 
It would be great if NIFI could support the [lumberjack 
protocol|https://github.com/elastic/logstash-forwarder/blob/master/PROTOCOL.md] 
so to enable the use of logstash forwarder as a source of data.

A lot of non Java shops tend to avoid installing Java at data producing nodes 
and instead of Flume they end up using things like kafka, heka, fluentd or 
logstash-forwarded as data shipping mechanisms. 

Kafka is great but its architecture seem to be better focused on multi-DC 
environments instead of multi-branch scenarios (imagine having to manager 80 
Zookeeper quorum, one for each country where you operate?)

[Heka|https://github.com/mozilla-services/heka] is fine, it has decent 
backpressure buffering but no concept of acknowledgement on the receiving side 
of a TCP stream. If the other end of a TCP stream is capable of listening but 
gets stuck with its messages it will keep spitting data through the pipe, 
oblivious to the woes at the other end.

Logstash forwarded in the other hand, is a quite simple tool, with a reasonable 
implementation of acknowledgments on the receiving side but... it depends on 
Logstash(and logstash has its own issues).

It would be great if NIFI could serve as a middle man, receiving lumberjack 
messages and offloading some of the hard work Logstash seems to struggle with 
(e.g. using NIFI to save to HDFS while a downstream Logstash writes into ES).

  was:
It would be great if NIFI could support the lumberjack protocol so to enable 
the use of logstash forwarder as a source of data.

A lot of non Java shops tend to avoid installing Java at data producing nodes 
and instead of Flume they end up using things like kafka, heka, fluentd or 
logstash-forwarded as data shipping mechanisms. 

Kafka is great but its architecture seem to be better focused on multi-DC 
environments instead of multi-branch scenarios (imagine having to manager 80 
Zookeeper quorum, one for each country where you operate?)

[Heka|https://github.com/mozilla-services/heka] is fine, it has decent 
backpressure buffering but no concept of acknowledgement on the receiving side 
of a TCP stream. If the other end of a TCP stream is capable of listening but 
gets stuck with its messages it will keep spitting data through the pipe, 
oblivious to the woes at the other end.

Logstash forwarded in the other hand, is a quite simple tool, with a reasonable 
implementation of acknowledgments on the receiving side but... it depends on 
Logstash(and logstash has its own issues).

It would be great if NIFI could serve as a middle man, receiving [lumberjack 
messages|https://github.com/elastic/logstash-forwarder/blob/master/PROTOCOL.md] 
and offloading some of the hard work Logstash seems to struggle with (e.g. 
using NIFI to save to HDFS while a downstream Logstash writes into ES).

        Summary: Add Processor for Lumberjack protocol  (was: Add suport to 
Lumberjack protocol)

> Add Processor for Lumberjack protocol
> -------------------------------------
>
>                 Key: NIFI-856
>                 URL: https://issues.apache.org/jira/browse/NIFI-856
>             Project: Apache NiFi
>          Issue Type: New Feature
>            Reporter: Mike de Rhino 
>              Labels: features
>
> It would be great if NIFI could support the [lumberjack 
> protocol|https://github.com/elastic/logstash-forwarder/blob/master/PROTOCOL.md]
>  so to enable the use of logstash forwarder as a source of data.
> A lot of non Java shops tend to avoid installing Java at data producing nodes 
> and instead of Flume they end up using things like kafka, heka, fluentd or 
> logstash-forwarded as data shipping mechanisms. 
> Kafka is great but its architecture seem to be better focused on multi-DC 
> environments instead of multi-branch scenarios (imagine having to manager 80 
> Zookeeper quorum, one for each country where you operate?)
> [Heka|https://github.com/mozilla-services/heka] is fine, it has decent 
> backpressure buffering but no concept of acknowledgement on the receiving 
> side of a TCP stream. If the other end of a TCP stream is capable of 
> listening but gets stuck with its messages it will keep spitting data through 
> the pipe, oblivious to the woes at the other end.
> Logstash forwarded in the other hand, is a quite simple tool, with a 
> reasonable implementation of acknowledgments on the receiving side but... it 
> depends on Logstash(and logstash has its own issues).
> It would be great if NIFI could serve as a middle man, receiving lumberjack 
> messages and offloading some of the hard work Logstash seems to struggle with 
> (e.g. using NIFI to save to HDFS while a downstream Logstash writes into ES).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to