Marko
Do you crawl the intranet or do you crawl the web? If you crawl the
web then you must edit the urlfilter-regex.txt and not the crawl-
urlfilter.txt.
In your first mail you said you get an exception like
"org.apache.nutch.net.URLFilter not found". Does the exception still
occur?
- Re: URL containing "?", "&" and &q... Marko Bauhardt
- Re: URL containing "?", "&" a... Vertical Search
- Re: URL containing "?", "&" a... Vertical Search
- Re: URL containing "?", "&" a... Vertical Search
- crawling etiquette Howie Wang
