Denis Pimenov пишет:

I used this +^.* in crawl-urlfilter.txt, but it's don't working..it doesn't crawl relative links, but only absolute...
Hello

I am a newbie in nutch... It seems to me that scrawling is not working by relative urls by default. How to fix it?

For example i have relative link on start page <a href="/test/my.jsp"> is not scrawled(but browsers opens in with proper prefix) , but if i have link <a href="http://mydomain.com:8080/test/my.jsp";> it's crawled well .. Is there any configuration file or something else to fix that?.. I have seen such question in mail archive but it wasn't answered

Denis Pimenov


Denis Pimenov

Reply via email to