[jira] Closed: (NUTCH-704) ensure that more important pages are crawled first

2009-02-26 Thread Andrzej Bialecki (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrzej Bialecki closed NUTCH-704. --- Resolution: Invalid Please see the ScoringFilter framework, and the

[Nutch Wiki] Update of DownloadingNutch by BartoszGadzimski

2009-02-26 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on Nutch Wiki for change notification. The following page has been changed by BartoszGadzimski: http://wiki.apache.org/nutch/DownloadingNutch -- You

[Nutch Wiki] Update of SimpleMapReduceTutorial by BartoszGadzimski

2009-02-26 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on Nutch Wiki for change notification. The following page has been changed by BartoszGadzimski: http://wiki.apache.org/nutch/SimpleMapReduceTutorial The comment on the change is: It is not map reduce tutorial, it's only

[Nutch Wiki] Trivial Update of FrontPage by BartoszGadzimski

2009-02-26 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on Nutch Wiki for change notification. The following page has been changed by BartoszGadzimski: http://wiki.apache.org/nutch/FrontPage -- *

[jira] Created: (NUTCH-705) parse-rtf plugin

2009-02-26 Thread Dmitry Lihachev (JIRA)
parse-rtf plugin Key: NUTCH-705 URL: https://issues.apache.org/jira/browse/NUTCH-705 Project: Nutch Issue Type: New Feature Components: fetcher Affects Versions: 1.0.0 Reporter: Dmitry Lihachev

[jira] Commented: (NUTCH-705) parse-rtf plugin

2009-02-26 Thread Dmitry Lihachev (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12677242#action_12677242 ] Dmitry Lihachev commented on NUTCH-705: --- This parser correctly handles non ascii input

[jira] Updated: (NUTCH-705) parse-rtf plugin

2009-02-26 Thread Dmitry Lihachev (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-705?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dmitry Lihachev updated NUTCH-705: -- Attachment: NUTCH-705.patch parse-rtf plugin Key:

[jira] Commented: (NUTCH-644) RTF parser doesn't compile anymore

2009-02-26 Thread Dmitry Lihachev (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12677244#action_12677244 ] Dmitry Lihachev commented on NUTCH-644: --- this parser incorrectly handles non-ascii

[jira] Commented: (NUTCH-185) XMLParser is configurable xml parser plugin.

2009-02-26 Thread Gopikrishnan (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12677261#action_12677261 ] Gopikrishnan commented on NUTCH-185: Building XMLParser plugin with the latest (1.0-dev)

[jira] Resolved: (NUTCH-699) Add an official solr schema for solr integration

2009-02-26 Thread Sami Siren (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sami Siren resolved NUTCH-699. -- Resolution: Fixed committed Add an official solr schema for solr integration

[jira] Assigned: (NUTCH-669) Consolidate code for Fetcher and Fetcher2

2009-02-26 Thread Sami Siren (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sami Siren reassigned NUTCH-669: Assignee: Sami Siren Consolidate code for Fetcher and Fetcher2

[jira] Commented: (NUTCH-703) Upgrade to Hadoop 0.19.1

2009-02-26 Thread Sami Siren (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12677266#action_12677266 ] Sami Siren commented on NUTCH-703: -- Andrzej, are you working with this now? Upgrade to

Re: [jira] Commented: (NUTCH-703) Upgrade to Hadoop 0.19.1

2009-02-26 Thread Andrzej Bialecki
Sami Siren (JIRA) wrote: [ https://issues.apache.org/jira/browse/NUTCH-703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12677266#action_12677266 ] Sami Siren commented on NUTCH-703: -- Andrzej, are you working with