Bugs item #978614, was opened at 2004-06-24 01:26
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=491356&aid=978614&group_id=59548

Category: fetcher
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: Lars Aronsson (aronsson)
Assigned to: Nobody/Anonymous (nobody)
Summary: Redirect to local URL "MalformedURLException: no protocol"

Initial Comment:
This website makes a redirect to a local URL
("/filename"), but the nutch crawler wants a protocol
at the beginning of the redirect URL (i.e.
"http://domain/filename";). From nutch's output:

040624 011812 fetching
http://susning.nu/Kurt_Vonnegut/Slakthus_5
040624 011812 fetch of
http://susning.nu/Kurt_Vonnegut/Slakthus_5 failed with:
java.net.MalformedURLException: no protocol:
/susning.fcgi?action=browse&id=Bok/Slakthus_5&oldid=Kurt_Vonnegut/Slakthus_5

That's my website. You can try that URL for a test case.

This happened with nutch-0.4 and the command line
"nutch crawl urls -delay 3 -depth 3"

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=491356&aid=978614&group_id=59548


-------------------------------------------------------
This SF.Net email sponsored by Black Hat Briefings & Training.
Attend Black Hat Briefings & Training, Las Vegas July 24-29 - 
digital self defense, top technical experts, no vendor pitches, 
unmatched networking opportunities. Visit www.blackhat.com
_______________________________________________
Nutch-developers mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to