I'm using Apache Nutch version 1.5. Just I'd hosted the site in my local environment and trying to index the site. Providing the enteries in C:\nutch\local\conf\regex-urlfilter.txt and C:\nutch\local\conf\crawl-urlfilter.txt as specified in my above post.
-- View this message in context: http://lucene.472066.n3.nabble.com/Malformed-URL-skipping-java-net-MalformedURLException-tp3590159p4005804.html Sent from the Nutch - User mailing list archive at Nabble.com.

