[
https://issues.apache.org/jira/browse/KAFKA-2004?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14352237#comment-14352237
]
Neha Narkhede commented on KAFKA-2004:
--------------------------------------
Kafka uses local storage for latency and replication guarantees. Though of the
popular ways of copying data from Kafka into HDFS is Camus. You can take a look
at that here - http://confluent.io/docs/current/camus/docs/index.html. Is there
anything in particular you are looking for with the Storm/Spark streaming
approach?
> Write Kafka messages directly to HDFS
> -------------------------------------
>
> Key: KAFKA-2004
> URL: https://issues.apache.org/jira/browse/KAFKA-2004
> Project: Kafka
> Issue Type: Bug
> Components: consumer, core, producer
> Affects Versions: 0.8.1.1
> Reporter: sutanu das
> Assignee: Neha Narkhede
> Priority: Critical
>
> 1. Is there a way to write Kafka messages directly to HDFS without writing
> any consumer code?
> 2. Is there anyway to integrate Kafka with Storm or Spark so messages goes
> directly from Kafka consumers to HDFS sync?
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)