Hi Mambe,
you should check out this article from Lucid Imagination:
http://www.lucidimagination.com/blog/2009/03/09/nutch-solr/
Regards,
Hannes

http://de.linkedin.com/in/hannescarlmeyer
http://www.xing.com/profile/HannesCarl_Meyer

On Sat, Mar 20, 2010 at 3:27 PM, Mambe Churchill Nanje <
mambena...@afrovisiongroup.com> wrote:

> I would like to know if anybody has used nutch for crawling and indexing
> with Solr
>  I am currently doing intranet crawling with the nutch/bin crawl
> functionality and I use nutch/bin solrindex function to push the nutch
> segments, linkdb, crawldb to solr index.
>  I am wondering if I delete the crawl folder and do a full crawl at a new
> date, if it gets the new content and push to solr index, will this help me
> get only recent documents added to the solr ending, I want to be getting
> only recent documents indexed. So i was thinking that nutch crawl, and then
> push to solr will only add the new documents but i am not sure because I
> dont have experience.
>  Can someone tell me if this will work, because I have been searching for
> re-crawl for nutch for like ever and I think nutch crawl and solr index
> over
> and over can be like a work around for recrawling
>
> thanks
>
> Mambe Churchill Nanje
> 237 77545907,
> AfroVisioN Founder, President,CEO
> http://www.afrovisiongroup.com
> http://mambenanje.blogspot.com
> skypeID: mambenanje
>

Reply via email to