[ 
https://issues.apache.org/jira/browse/NUTCH-2202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16342749#comment-16342749
 ] 

ASF GitHub Bot commented on NUTCH-2202:
---------------------------------------

HansBrende commented on a change in pull request #97: NUTCH-2202 Integration of 
Anthelion (Focused Crawling Module) into Nutch
URL: https://github.com/apache/nutch/pull/97#discussion_r164313783
 
 

 ##########
 File path: src/plugin/build.xml
 ##########
 @@ -99,13 +101,15 @@
     <ant dir="urlnormalizer-querystring" target="deploy"/>
     <ant dir="urlnormalizer-regex" target="deploy"/>
     <ant dir="urlnormalizer-slash" target="deploy"/>
+>>>>>>> master
 
 Review comment:
   Also there's this ">>>>>>> master"...

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Integration of Anthelion (Focused Crawling Module) into Nutch
> -------------------------------------------------------------
>
>                 Key: NUTCH-2202
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2202
>             Project: Nutch
>          Issue Type: Improvement
>          Components: parser, scoring
>            Reporter: Robert Meusel
>            Assignee: Lewis John McGibbney
>            Priority: Major
>              Labels: any23, online_learning
>
> We have recently released anthelion, which is a focused crawler plugin for 
> structured data which can be extracted with any23. 
> (https://github.com/yahoo/anthelion) As proposed by Lewis (Lewis John 
> McGibbney) we think the integration of the parser (any23) and the scoring 
> function based on the online learner could be a good improvement for nutch. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to