I was hoping that someone on the list might be able to assist me. I've been successfully using wget 1.10.2 in an internal intranet environment to spider IIS 6.0 sites and Sharepoint sites. Here's an example of a verbose, debug enabled spider:
C:\wget>wget -v -d --http-user="DOMAIN\FIRST_LAST" --http-password="XXXXX" --no-proxy http://some.sharepointsite.com Setting --http-user (httpuser) to "DOMAIN\FIRST_LAST" Setting --http-password (httppassword) to XXXXX Setting --proxy (useproxy) to 0 DEBUG output created by Wget 1.10.2 on Windows. --01:30:28-- http://some.sharepointsite.com/ => `index.html' Resolving some.sharepointsite.com... seconds 0.00, XXX.XXX.XXX.XXX Caching some.sharepointsite.com => XXX.XXX.XXX.XXX Connecting to some.sharepointsite.com|XXX.XXX.XXX.XXX|:80... seconds 0.00, connected. Created socket 1892. Releasing 0x003d49a8 (new refcount 1). ---request begin--- GET / HTTP/1.0 User-Agent: Wget/1.10.2 Accept: */* Authorization: Basic YW1lcmljYXNcYnJ5YW5faG9mZnBhdWlyOlNBUzE2QDliOGo3Nmg= Host: some.sharepointsite.com Connection: Keep-Alive ---request end--- HTTP request sent, awaiting response... ---response begin--- HTTP/1.1 401 Access denied Connection: close Date: Sun, 02 Nov 2008 06:30:28 GMT Server: Microsoft-IIS/6.0 WWW-Authenticate: NTLM MicrosoftSharePointTeamServices: 12.0.0.6315 X-Powered-By: ASP.NET Content-type: text/html ---response end--- 401 Access denied Closed fd 1892 Empty NTLM message, starting transaction. Creating a type-1 NTLM message. Found some.sharepointsite.com in host_name_addresses_map (003D49A8) Connecting to some.sharepointsite.com|XXX.XXX.XXX.XXX|:80... seconds 0.00, connected. Created socket 1892. Releasing 0x003d49a8 (new refcount 1). ---request begin--- GET / HTTP/1.0 User-Agent: Wget/1.10.2 Accept: */* Authorization: NTLM TlRMTVNTUAABAAAAAgIAAAAAAAAgAAAAAAAAACAAAAA= Host: some.sharepointsite.com Connection: Keep-Alive ---request end--- HTTP request sent, awaiting response... ---response begin--- HTTP/1.1 401 Unauthorized Content-Length: 1539 Content-Type: text/html Server: Microsoft-IIS/6.0 WWW-Authenticate: NTLM TlRMTVNTUAACAAAAAAAAADgAAAACAgACazTDgoeCGb4AAAAAAAAAAAAA AA4AAAABQLODgAAAA8= MicrosoftSharePointTeamServices: 12.0.0.6315 X-Powered-By: ASP.NET Date: Sun, 02 Nov 2008 06:30:28 GMT Connection: keep-alive ---response end--- 401 Unauthorized Registered socket 1892 for persistent reuse. Skipping 1539 bytes of body: [<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN" "http://www.w3.org/TR/html4/strict.dtd"> <HTML><HEAD><TITLE>You are not authorized to view this page</TITLE> <META HTTP-EQUIV="Content-Type" Content="text/html; charset=Windows-1252"> <STYLE type="text/css"> BODY { font: 8pt/12pt verdana } H1 { font: 13pt/15pt verdana } H2 { font: 8pt/12pt verdana } A:link { color: red } A:visited { color: maroon } </STYLE> </HEAD><BODY><TABLE width=500 border=0 cellspacing=10><TR><TD> <h1>You are not authorized to view this page</h1> You do not have permission to view this directory or page using the credentials that you supplied. <hr> <p>Please try the following:</p> <ul> <li>Contact the Web site administrator if you believe you should be able to vie this directory or page.</li> <li>Click the <a href="javascript:location.reload()">Refresh</a> button to try gain with different credentials.</li> </ul> <h2>HTTP Error 401.1 - Unauthorized: Access is denied due to invalid credential .<br>Internet Information Services (IIS)</h2> <hr> <p>Technical Information (for support personnel)</p> <ul> <li>Go to <a href="http://go.microsoft.com/fwlink/?linkid=8180">Microsoft Produ t Support Services</a> and perform a title search for the words <b>HTTP</b> and <b>401</b>.</li> <li>Open <b>IIS Help</b>, which is accessible in IIS Manager (inetmgr), and search for topics titled <b>Authentication</b>, <b>Access Control</b>, and <b>About Custom Error Messages</b>.</li> </ul> </TD></TR></TABLE></BODY></HTML> ] done. Received a type-2 NTLM message. Creating a type-3 NTLM message. Reusing existing connection to some.sharepointsite.com:80. Reusing fd 1892. ---request begin--- GET / HTTP/1.0 User-Agent: Wget/1.10.2 Accept: */* Authorization: NTLM TlRMTVNTUAADAAAAGAAYAFcAAAAYABgAbwAAAAgACABAAAAADwAPAEgAAAA AAAAVwAAAAAAAACHAAAAAYIAAGFtZXJpY2FzYnJ5YW5faG9mZnBhdWlyZYGWc7NKTESDADIU3oGrrRH hZk7fx6hFn4RRpBS29cxvACr4AXt2Tb1Mib9Ldcv Host: some.sharepointsite.com Connection: Keep-Alive ---request end--- HTTP request sent, awaiting response... ---response begin--- HTTP/1.1 302 Object Moved Content-Length: 162 Content-Type: text/html Location: http://some.sharepointsite.com/Pages/default.aspx Server: Microsoft-IIS/6.0 MicrosoftSharePointTeamServices: 12.0.0.6315 X-Powered-By: ASP.NET Date: Sun, 02 Nov 2008 06:30:28 GMT Connection: keep-alive ---response end--- 302 Object Moved Location: http://some.sharepointsite.com/Pages/default.aspx [following] Skipping 162 bytes of body: [<head><title>Document Moved</title></head> <body><h1>Object Moved</h1>This document may be found <a HREF="http://moss.dell com/Pages/default.aspx">here</a></body>] done. --01:30:28-- http://some.sharepointsite.com/Pages/default.aspx => `default.aspx' Reusing existing connection to some.sharepointsite.com:80. Reusing fd 1892. ---request begin--- GET /Pages/default.aspx HTTP/1.0 User-Agent: Wget/1.10.2 Accept: */* Host: some.sharepointsite.com Connection: Keep-Alive ---request end--- HTTP request sent, awaiting response... ---response begin--- HTTP/1.1 200 OK Connection: keep-alive Date: Sun, 02 Nov 2008 06:30:29 GMT Server: Microsoft-IIS/6.0 MicrosoftSharePointTeamServices: 12.0.0.6315 X-Powered-By: ASP.NET X-AspNet-Version: 2.0.50727 Set-Cookie: WSS_KeepSessionAuthenticated=80; path=/ Cache-Control: private, max-age=0 Expires: Sat, 18 Oct 2008 06:30:28 GMT Last-Modified: Sun, 02 Nov 2008 06:30:28 GMT Content-Type: text/html; charset=utf-8 Content-Length: 28419 ---response end--- 200 OK Stored cookie some.sharepointsite.com -1 (ANY) / <session> <insecure> [expiry none] WSS_K epSessionAuthenticated 80 Length: 28,419 (28K) [text/html] 100%[====================================>] 28,419 --.--K/s 01:30:29 (112.46 MB/s) - `default.aspx' saved [28419/28419] Recently, I downloaded the new 1.11.4 wget from the mirrors and wanted to use some of the new features (like --ignore-case). But now, when I attempt to run the exact same command, I get this: C:\wget1.11.4>wget -v -d --http-user="DOMAIN\First_Last" --http-password= "[EMAIL PROTECTED]" --no-proxy http://some.sharepointsite.com Setting --http-user (httpuser) to DOMAIN\First_Last Setting --http-password (httppassword) to [EMAIL PROTECTED] Setting --proxy (useproxy) to 0 DEBUG output created by Wget 1.11.4 on Windows-MSVC. --2008-11-02 01:38:24-- http://some.sharepointsite.com/ Host `some.sharepointsite.com' has not issued a general basic challenge. Resolving some.sharepointsite.com... seconds 0.00, XXX.XXX.XXX.XXX Caching some.sharepointsite.com => XXX.XXX.XXX.XXX Connecting to some.sharepointsite.com|XXX.XXX.XXX.XXX|:80... seconds 0.00, connected. Created socket 1892. Releasing 0x003e2b68 (new refcount 1). ---request begin--- GET / HTTP/1.0 User-Agent: Wget/1.11.4 Accept: */* Host: some.sharepointsite.com Connection: Keep-Alive ---request end--- HTTP request sent, awaiting response... ---response begin--- HTTP/1.1 401 Access denied Connection: close Date: Sun, 02 Nov 2008 06:38:24 GMT Server: Microsoft-IIS/6.0 WWW-Authenticate: NTLM MicrosoftSharePointTeamServices: 12.0.0.6315 X-Powered-By: ASP.NET Content-type: text/html ---response end--- 401 Access denied Closed fd 1892 Unknown authentication scheme. Authorization failed. It seems as though the new 1.11.4 build is not completing the entire multi-step NTLM handshake like the 1.10.2 version does. Has anyone else seen similar behavior or perhaps could suggest a few troubleshooting steps to guide me in the right direction? I did find a discussion here, but I was unable to validate - http://www.experts-exchange.com/Networking/Linux_Networking/Q_23371057.html?sfQueryTermInfo=1+ntlm+wget Also, as you can tell in the examples, I've modified the username, but it is Active Directory, so it does have the "\" between the DOMAIN and First_Last. In addition, the password I am using contains the special characters "$" and "@" They both work with the 1.10.2 build, but not the new 1.11.4. Thanks in Advance! BJ Hoffpauir