What's the URL? I think someone else had a similar problem and it turned out to that the URL produced a redirect to URL containing a query string. Since Nutch was configured not to fetch URLs with query strings, it just failed.
Jake. -----Original Message----- From: Jon Shoberg [mailto:[EMAIL PROTECTED] Sent: Friday, September 23, 2005 12:27 PM To: [email protected] Subject: No external command defined for contentType: Anyone else get the message "No external command defined for contentType:" without any sort of MIME content type declaration? I can see HTML, PDF, and other documents getting fetched but failing on the parse with the above message. When I go directly to the server and manually get the document I see a valid MIME header for content type returned in the HTTP response header. Anyone else seen this? I'm fetching content but not parsing it reliably. -j
