Hi,

I have configured nutch 1.2 in Eclipse project. 

I need to run crawl from java code to follow it with debug.

 

This is the script in linux that I execute for crawl.

 

.         bin/nutch inject /home/administrator/nutch/albanian_crawl/crawldb
my_urls

.         bin/nutch generate
/home/administrator/nutch/albanian_crawl/crawldb
/home/administrator/nutch/albanian_crawl/segments

.         segment=`ls -d
/home/administrator/nutch/albanian_crawl/segments/2* | tail -1`

.         bin/nutch fetch $segment

.         bin/nutch updatedb
/home/administrator/nutch/albanian_crawl/crawldb $segment

.         bin/nutch mergesegs
/home/administrator/nutch/albanian_crawl/segments
/home/administrator/nutch/albanian_crawl/segments/*

.         bin/nutch invertlinks
/home/administrator/nutch/albanian_crawl/linkdb
/home/administrator/nutch/albanian_crawl/segments/*

.         bin/nutch index /home/administrator/nutch/albanian_crawl/indexes
/home/administrator/nutch/albanian_crawl/crawldb
/home/administrator/nutch/albanian_crawl/linkdb
/home/administrator/nutch/albanian_crawl/segments/*

.         bin/nutch dedup /home/administrator/nutch/albanian_crawl/indexes

 

Can anybody help to translate it in java.

 

 

Thanks in advance ,

Marseld.

 

Reply via email to