Thanks for the reply. I tried your recommended solution. But it still does not crawl it. When i run the crawler main-method in standalone with the url "http://www.lequipe.fr/Football/" it works fine: outlinks, content... everything what was expected. Even if i use "http://www.lequipe.fr/Football/*index.html*" it doesn't
What else could be the reason? Is there a better way to log the crawlers decisions? -- View this message in context: http://lucene.472066.n3.nabble.com/urls-won-t-get-crawled-tp3650610p3665778.html Sent from the Nutch - User mailing list archive at Nabble.com.