Hi Mambe, you should check out this article from Lucid Imagination: http://www.lucidimagination.com/blog/2009/03/09/nutch-solr/ Regards, Hannes
http://de.linkedin.com/in/hannescarlmeyer http://www.xing.com/profile/HannesCarl_Meyer On Sat, Mar 20, 2010 at 3:27 PM, Mambe Churchill Nanje < mambena...@afrovisiongroup.com> wrote: > I would like to know if anybody has used nutch for crawling and indexing > with Solr > I am currently doing intranet crawling with the nutch/bin crawl > functionality and I use nutch/bin solrindex function to push the nutch > segments, linkdb, crawldb to solr index. > I am wondering if I delete the crawl folder and do a full crawl at a new > date, if it gets the new content and push to solr index, will this help me > get only recent documents added to the solr ending, I want to be getting > only recent documents indexed. So i was thinking that nutch crawl, and then > push to solr will only add the new documents but i am not sure because I > dont have experience. > Can someone tell me if this will work, because I have been searching for > re-crawl for nutch for like ever and I think nutch crawl and solr index > over > and over can be like a work around for recrawling > > thanks > > Mambe Churchill Nanje > 237 77545907, > AfroVisioN Founder, President,CEO > http://www.afrovisiongroup.com > http://mambenanje.blogspot.com > skypeID: mambenanje >