LinkDb (invertlinks) should inform the user when it ignores internal links
--------------------------------------------------------------------------
Key: NUTCH-1090
URL: https://issues.apache.org/jira/browse/NUTCH-1090
Project: Nutch
Issue Type: Improvement
Components: linkdb
Affects Versions: 1.3
Reporter: Marek Bachmann
Priority: Trivial
Fix For: 1.3
I used nutch to crawl sites on a single domain. After the crawl was complete I
tried to build a LinkDb. The LinkDb was empty.
It comes up that this happens because the invertlinks command ignores internal
links to the same domain by default.
Unfortunately the LinkDb class doesn't tell anything about that. So it was hard
to find out why the LinkDb was empty.
I suggest to add an information for the user when the invertlinks command is
ignoring internal links.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira