[ 
https://issues.apache.org/jira/browse/NUTCH-2631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16632496#comment-16632496
 ] 

ASF GitHub Bot commented on NUTCH-2631:
---------------------------------------

r0ann3l commented on a change in pull request #373: fix for NUTCH-2631 
contributed by AyalCiobotaru
URL: https://github.com/apache/nutch/pull/373#discussion_r221314429
 
 

 ##########
 File path: conf/index-writers.xml.template
 ##########
 @@ -161,4 +161,19 @@
       <remove />
     </mapping>
   </writer>
+  <writer id="indexer_kafka_1" 
class="org.apache.nutch.indexwriter.kafka.KafkaIndexWriter">
+    <parameters>
+      <param name="host" value=""/>
+      <param name="port" value=""/>
+      <param name="index" value=""/>
 
 Review comment:
   index parameter is not used to configure the producer. So, I think it should 
be removed from here or used.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> KafkaIndexWriter
> ----------------
>
>                 Key: NUTCH-2631
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2631
>             Project: Nutch
>          Issue Type: Improvement
>          Components: indexer
>            Reporter: Ayal Ciobotaru
>            Priority: Minor
>              Labels: patch
>   Original Estimate: 168h
>  Remaining Estimate: 168h
>
> There is no current way to index directly into Kafka in order to have a full 
> message based system controlled by Kafka. Created a KafkaIndexWriter in order 
> to produce the crawled documents into Kafka and have Kafka distribute the 
> messages as necessary.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to