If you use local dns server for resolving, you should write nameservers in
resolv.conf which nutch working servers.

You should be sure nutch's server can resolve this site. If you use console
you ycan use lynx for checking

Talat
3 Nis 2014 08:02 tarihinde "John Lafitte" <[email protected]> yazdı:

> reddibabu,
>
> I cannot resolve wiki.ibm.com so I'm guessing nutch can't either.  Is that
> an internal dns record?
>
>
> On Wed, Apr 2, 2014 at 11:54 PM, reddibabu <[email protected]> wrote:
>
> > Hi All,
> >
> > I am using Apache Nutch 1.7. I can able to crawl and index all most all
> > sites  except "wiki" pages.
> > While trying to crawl wiki pages it is saying that "fetch of
> > http://wiki.ibm.com/ failed with: java.net.UnknownHostException:
> > wiki.ibm.com".
> >
> > Is it require any additional configuration for crawling wiki pages.
> > Anyone assist me on the same would be helpful a lot.
> >
> >
> > Thanks in advance.
> > Reddi Babu
> >
> >
> >
> > --
> > View this message in context:
> >
> http://lucene.472066.n3.nabble.com/Unable-to-crawl-wiki-pages-through-Nutch-tp4128772.html
> > Sent from the Nutch - User mailing list archive at Nabble.com.
> >
>

Reply via email to