[ 
https://issues.apache.org/jira/browse/NUTCH-2631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16743131#comment-16743131
 ] 

ASF GitHub Bot commented on NUTCH-2631:
---------------------------------------

r0ann3l commented on pull request #373: fix for NUTCH-2631 contributed by 
AyalCiobotaru
URL: https://github.com/apache/nutch/pull/373#discussion_r247923127
 
 

 ##########
 File path: conf/index-writers.xml.template
 ##########
 @@ -161,4 +161,21 @@
       <remove />
     </mapping>
   </writer>
+  <writer id="indexer_kafka_1" 
class="org.apache.nutch.indexwriter.kafka.KafkaIndexWriter">
+    <parameters>
+      <param name="host" value=""/>
+      <param name="port" value=""/>
+      <param name="topic" value=""/>
+      <param name="key.serializer" value=""/>
+      <param name="value.serializer" value=""/>
+      <param name="max.doc.count" value=""/>
 
 Review comment:
   ```suggestion
         <param name="max.doc.count" value="100"/>
   ```
 
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> KafkaIndexWriter
> ----------------
>
>                 Key: NUTCH-2631
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2631
>             Project: Nutch
>          Issue Type: Improvement
>          Components: indexer
>            Reporter: Ayal Ciobotaru
>            Priority: Minor
>              Labels: patch
>   Original Estimate: 168h
>  Remaining Estimate: 168h
>
> There is no current way to index directly into Kafka in order to have a full 
> message based system controlled by Kafka. Created a KafkaIndexWriter in order 
> to produce the crawled documents into Kafka and have Kafka distribute the 
> messages as necessary.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to