[
https://issues.apache.org/jira/browse/NUTCH-2388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16021527#comment-16021527
]
Hudson commented on NUTCH-2388:
-------------------------------
SUCCESS: Integrated in Jenkins build Nutch-nutchgora #1584 (See
[https://builds.apache.org/job/Nutch-nutchgora/1584/])
NUTCH-2388 bin/crawl indexing only webpages containing batchID instead
(kaidulislam90:
[https://github.com/apache/nutch/commit/32a57b52a67cd5c2cb637c6fbae2dfce5a2c27b5])
* (edit) src/bin/crawl
> bin/crawl indexing only webpages containing batchID instead of all in 2.x
> -------------------------------------------------------------------------
>
> Key: NUTCH-2388
> URL: https://issues.apache.org/jira/browse/NUTCH-2388
> Project: Nutch
> Issue Type: Bug
> Components: bin
> Affects Versions: 2.3
> Reporter: Kaidul Islam
> Assignee: Kaidul Islam
> Priority: Trivial
> Fix For: 2.4
>
> Original Estimate: 24h
> Remaining Estimate: 24h
>
> During each iteration, after generating, fetching, parsing and updating the
> current batch into DB, the indexer is supposed to index the current batch
> too. But its indexing all currently.
> {code}
> __bin_nutch index $commonOptions -D solr.server.url=$SOLRURL -all -crawlId
> "$CRAWL_ID"
> {code}
> It should be like below i guess -
> {code}
> __bin_nutch index $commonOptions -D solr.server.url=$SOLRURL $batchId
> -crawlId "$CRAWL_ID"
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)