Denis Pimenov пишет:
I used this +^.* in crawl-urlfilter.txt, but it's don't working..it
doesn't crawl relative links, but only absolute...
Hello
I am a newbie in nutch... It seems to me that scrawling is not
working by relative urls by default. How to fix it?
For example i have relative link on start page <a
href="/test/my.jsp"> is not scrawled(but browsers opens in with
proper prefix) , but if i have link <a
href="http://mydomain.com:8080/test/my.jsp"> it's crawled well .. Is
there any configuration file or something else to fix that?.. I have
seen such question in mail archive but it wasn't answered
Denis Pimenov
Denis Pimenov
- Re: nutch scrawls only relative links Denis Pimenov
-