[ 
https://issues.apache.org/jira/browse/FLUME-2523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14191375#comment-14191375
 ] 

Hari Shreedharan commented on FLUME-2523:
-----------------------------------------

Thanks for writing these docs!

A few minor comments:
- The example shows kafka-1:9092 etc. That should be a full hostname, correct? 
(for both ZK and broker list).
- Mention that class that is used when parseAsFlumeEvent is true. This would be 
required if we just want Flume to write to Kafka and not remove it from the 
channel - so other consumers are able to remove the data (Log4j->Flume->Kafka 
etc). The class is org.apache.flume.source.avro.AvroFlumeEvent provided by the 
flume-ng-sdk artifact.
- Worth mentioning how multiple channels must use the same topic and group to 
ensure that when one agent fails another can remove the data.
- readSmallestOffset matters only if parseAsFlumeEvent is false. We should also 
mention why it is better to keep it to false (or not set it at all) if 
parseAsFlumeEvent is false.
- Mention that producer.type will always be sync and other properties which we 
override anyway - even if the customer has overriden it.
- Mention that by using Kafka Channel -> HDFS/HBase/Solr sink, Kafka+Flume 
provides low latency, fault-tolerant writes to the corresponding system.
 


> Document Kafka channel
> ----------------------
>
>                 Key: FLUME-2523
>                 URL: https://issues.apache.org/jira/browse/FLUME-2523
>             Project: Flume
>          Issue Type: Task
>          Components: Docs
>            Reporter: Gwen Shapira
>            Assignee: Gwen Shapira
>         Attachments: FLUME-2523.0.patch
>
>
> FLUME-2500 adds a Kafka channel. We need to document its usage.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to