Hello, I would like to have Nutch crawl web sites that are on ports other than port 80. So, I changed the regex-urlfilter.txt file so that it would allow an optional port number on the URL. I see the URLs with high ports show up as candidtates from my seed list but they aren't actually fetched. Would anybody be able to help me to understand what I might do? thanks, Lee
- crawing for content on port 8080 Moore, Lee C
- Re: crawing for content on port 8080 Dennis Kubes
