[ 
https://issues.apache.org/jira/browse/NUTCH-1631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Furkan KAMACI updated NUTCH-1631:
---------------------------------

    Attachment: NUTCH-1631.patch

This patch counts documents added to Solr Server and writes it to context as a 
Hadoop counter. So one can see how many documents are added simultaneously at 
Hadoop Map/Reduce Administration page. On the other hand SolrWriter logs how 
many documents are added at each batch (maximum of commit size) but does not 
log total count at the end of indexing process. This patch also logs total 
document count as well as writing to Hadoop context as a counter.
                
> Display Document Count Added To Solr Server
> -------------------------------------------
>
>                 Key: NUTCH-1631
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1631
>             Project: Nutch
>          Issue Type: Improvement
>          Components: indexer
>    Affects Versions: 2.1, 2.2, 2.2.1
>            Reporter: Furkan KAMACI
>            Priority: Minor
>             Fix For: 2.3
>
>         Attachments: NUTCH-1631.patch
>
>
> Currently you can not see how many documents are added to Solr Server. One 
> could see how many documents are added to Solr server simultaneously (as a 
> hadoop counter) and after all documents are added total document count should 
> be logged too.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to