Unable to resolve the url-blah-blah, skipping
---------------------------------------------
Key: NUTCH-805
URL: https://issues.apache.org/jira/browse/NUTCH-805
Project: Nutch
Issue Type: Bug
Components: fetcher
Affects Versions: 0.9.0
Environment: CentOS, Nutch -0.9, jdk1.6.0_18
Reporter: P Kaustubh
I configured the nutch-0.9 as well as nutch-1.0 to crawl intranet website. The
machine access the internet/intranet using proxy i had made this setup in
nutch-default.xml
everything works well untill i run script, when fetcher tries to access the
urls from seed gives error as
unable to resolve www.urladdres.com , skipping
QueueFeeder finished: total 1 records.
-finishing thread FetcherThread, activeThreads=0
-finishing thread FetcherThread, activeThreads=0
-finishing thread FetcherThread, activeThreads=0
-finishing thread FetcherThread, activeThreads=0
-finishing thread FetcherThread, activeThreads=0
-activeThreads=0, spinWaiting=0, fetchQueues.totalSize=0
-activeThreads=0
Fetcher: done
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.