If you use local dns server for resolving, you should write nameservers in resolv.conf which nutch working servers.
You should be sure nutch's server can resolve this site. If you use console you ycan use lynx for checking Talat 3 Nis 2014 08:02 tarihinde "John Lafitte" <[email protected]> yazdı: > reddibabu, > > I cannot resolve wiki.ibm.com so I'm guessing nutch can't either. Is that > an internal dns record? > > > On Wed, Apr 2, 2014 at 11:54 PM, reddibabu <[email protected]> wrote: > > > Hi All, > > > > I am using Apache Nutch 1.7. I can able to crawl and index all most all > > sites except "wiki" pages. > > While trying to crawl wiki pages it is saying that "fetch of > > http://wiki.ibm.com/ failed with: java.net.UnknownHostException: > > wiki.ibm.com". > > > > Is it require any additional configuration for crawling wiki pages. > > Anyone assist me on the same would be helpful a lot. > > > > > > Thanks in advance. > > Reddi Babu > > > > > > > > -- > > View this message in context: > > > http://lucene.472066.n3.nabble.com/Unable-to-crawl-wiki-pages-through-Nutch-tp4128772.html > > Sent from the Nutch - User mailing list archive at Nabble.com. > > >

