[
https://issues.apache.org/jira/browse/NUTCH-2139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14959186#comment-14959186
]
ASF GitHub Bot commented on NUTCH-2139:
---------------------------------------
GitHub user jorgelbg opened a pull request:
https://github.com/apache/nutch/pull/78
Fix for NUTCH-2139 contributed by jorgelbg
Basic indexing capabilities for inlinks and outlinks.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/jorgelbg/nutch NUTCH-2139
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/nutch/pull/78.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #78
----
commit f1d16ac509146aada0817d58d40bbcbfd0bad44d
Author: Jorge Luis Betancourt <[email protected]>
Date: 2015-10-15T16:34:37Z
Fix for NUTCH-2139 contributed by jorgelbg
----
> Basic plugin to index inlinks and outlinks
> ------------------------------------------
>
> Key: NUTCH-2139
> URL: https://issues.apache.org/jira/browse/NUTCH-2139
> Project: Nutch
> Issue Type: Improvement
> Components: indexer, plugin
> Reporter: Jorge Luis Betancourt Gonzalez
> Priority: Minor
> Labels: link, plugin
> Fix For: 1.11
>
>
> Basic plugin that allows to index the inlinks and outlinks of the web pages,
> this could be very useful for analytic purposes, including neat
> visualizations using d3.js for instance.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)