Hi, I am getting UnknownHostException while crawling the secured sites. This the error message i am getting.
INFO fetcher.Fetcher - fetch of https://secure.americanexpress.com/navigation failed with: org.apache.nutch.protocol.http.api.HttpException: java.net.UnknownHostException: secure.americanexpress.com 2008-12-05 16:49:13,442 INFO api.RobotRulesParser - Couldn't get robots.txt for http://home.americanexpress.com/home/mt_personal.shtml: javax.net.ssl.SSLException: Unrecognized SSL message, plaintext connection? 2008-12-05 16:49:13,817 INFO httpclient.HttpMethodDirector - Redirect requested but followRedirects is disabled 2008-12-05 16:49:14,051 INFO api.RobotRulesParser - Couldn't get robots.txt for http://home.americanexpress.com/home/mt_personal.shtml: javax.net.ssl.SSLException: Unrecognized SSL message, plaintext connection? 2008-12-05 16:49:14,051 ERROR httpclient.Http - org.apache.nutch.protocol.http.api.HttpException: java.net.UnknownHostException: home.americanexpress.com 2008-12-05 16:49:14,051 ERROR httpclient.Http - at org.apache.nutch.protocol.http.api.HttpBase.blockAddr(HttpBase.java:340) 2008-12-05 16:49:14,051 ERROR httpclient.Http - at org.apache.nutch.protocol.http.api.HttpBase.getProtocolOutput(HttpBase.java:212) 2008-12-05 16:49:14,051 ERROR httpclient.Http - at org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:145) 2008-12-05 16:49:14,051 ERROR httpclient.Http - Caused by: java.net.UnknownHostException: home.americanexpress.com 2008-12-05 16:49:14,051 ERROR httpclient.Http - at java.net.InetAddress.getAllByName0(Unknown Source) 2008-12-05 16:49:14,051 ERROR httpclient.Http - at java.net.InetAddress.getAllByName(Unknown Source) 2008-12-05 16:49:14,051 ERROR httpclient.Http - at java.net.InetAddress.getAllByName(Unknown Source) 2008-12-05 16:49:14,051 ERROR httpclient.Http - at java.net.InetAddress.getByName(Unknown Source) 2008-12-05 16:49:14,051 ERROR httpclient.Http - at org.apache.nutch.protocol.http.api.HttpBase.blockAddr(HttpBase.java:336) 2008-12-05 16:49:14,051 ERROR httpclient.Http - ... 2 more 2008-12-05 16:49:14,051 Vimal Varghese =====-----=====-----===== Notice: The information contained in this e-mail message and/or attachments to it may contain confidential or privileged information. If you are not the intended recipient, any dissemination, use, review, distribution, printing or copying of the information contained in this e-mail message and/or attachments to it are strictly prohibited. If you have received this communication in error, please notify us by reply e-mail or telephone and immediately and permanently delete the message and any attachments. Thank you
