-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Hey all,
has anyone tried crawling pages with URL parameters? I got stuck at a point where on a page (lets call it '/dir/page.jsp') containing a link like <a href="?param=value">my link text</a> In the browser everything works fine. Pressing the links will open the URL '/dir/page.jsp?param=value'. But the nutch-crawler is interpreting this link differently. Nutch's result looks like '/dir/?param=value'. So it is trying to open the href-target in the acutal directory instead of appending the target to actual page as browser would do. So the question is: who is wrong: all the browsers or the nutch crawler/link interpreter :) Cheers, mana -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.8 (Darwin) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/ iEYEARECAAYFAk4Jo6wACgkQzp84az+gLK0r3gCfaMXgcGr9hiLT3b5WebtZkOCm 0o0AmQGIrIaSTXfKUoa055fSs0UaX9E7 =SlLZ -----END PGP SIGNATURE-----

