[ 
https://issues.apache.org/jira/browse/HADOOP-13633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15511850#comment-15511850
 ] 

Huafeng Wang commented on HADOOP-13633:
---------------------------------------

Hi Allen, thanks for the reminding and we do know there is a hadoop-kafka 
module, which contains only one class. I think it would be better to integrate 
that kafka metric sink with the new hadoop-kafka module.

> Introduce Apache Kafka as a Service into Hadoop
> -----------------------------------------------
>
>                 Key: HADOOP-13633
>                 URL: https://issues.apache.org/jira/browse/HADOOP-13633
>             Project: Hadoop Common
>          Issue Type: New Feature
>            Reporter: Huafeng Wang
>            Assignee: Huafeng Wang
>
> In HDFS-7343 we want to develop a comprehensive storage management solution 
> originated from community discussions, in order for allowing convenient, 
> intelligent and effective utilization of various HDFS facilities such as 
> erasure coding, HDFS cache, HSM offering, and etc. based on valuable insights 
> from events and data collected from namenodes, datanodes, frameworks and 
> applications via a pub-sub messaging system. In HDFS-8940 it was discussed 
> that the proposed large scale inotify feature would be better to be 
> implemented via Kafka system to allowing thousands of consumers or inotify 
> clients.
> Apache Kafka is a distributed messaging system that aims to provide a 
> unified, high-throughput, low-latency platform for handling real-time data 
> feeds, and currently it’s widely used in real-time streaming process field. 
> Considering the above two important use cases desired in Hadoop, we’d like to 
> propose to introduce Kafka as a fundamental event pub-sub service into Hadoop 
> platform. Like FileSystem offering, we’d like to provide MessagingSystem in 
> Hadoop style and conforming Hadoop security, backed by an internal or 
> external existing Kafka cluster. Generally the new service is very convenient 
> to use, and can be used to distribute and exchange various types of events 
> across IO, storage, and computation that produced by Hadoop itself, 
> frameworks or applications on top of it. Then on this basis valuable events 
> can be analyzed in a centralized way so that meaningful applications and 
> usages can be developed.
> The design document is under-going and will be submitted in a week. Feedback 
> are very welcome. Thanks!



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-issues-h...@hadoop.apache.org

Reply via email to