[jira] Commented: (NUTCH-693) Add configurable option for treating nofollow behaviour.

2010-03-19 Thread Andrzej Bialecki (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12847291#action_12847291 ] Andrzej Bialecki commented on NUTCH-693: - Thanks for the pointer to the article.

[jira] Updated: (NUTCH-693) Add configurable option for treating nofollow behaviour.

2010-03-19 Thread Andrzej Bialecki (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrzej Bialecki updated NUTCH-693: Assignee: (was: Otis Gospodnetic) Add configurable option for treating nofollow

[jira] Assigned: (NUTCH-797) parse-tika is not properly constructing URLs when the target begins with a ?

2010-03-19 Thread Andrzej Bialecki (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-797?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrzej Bialecki reassigned NUTCH-797: --- Assignee: Andrzej Bialecki parse-tika is not properly constructing URLs when the

[jira] Commented: (NUTCH-797) parse-tika is not properly constructing URLs when the target begins with a ?

2010-03-19 Thread Andrzej Bialecki (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-797?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12847300#action_12847300 ] Andrzej Bialecki commented on NUTCH-797: - If there are no futher comments I'm going

[jira] Updated: (NUTCH-787) Upgrade Lucene to 3.0.1.

2010-03-19 Thread Andrzej Bialecki (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-787?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrzej Bialecki updated NUTCH-787: Assignee: Andrzej Bialecki Summary: Upgrade Lucene to 3.0.1. (was: Upgrade Lucene to

[jira] Commented: (NUTCH-787) Upgrade Lucene to 3.0.0.

2010-03-19 Thread Andrzej Bialecki (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-787?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12847315#action_12847315 ] Andrzej Bialecki commented on NUTCH-787: - Using Lucene 3.0.1 artifacts I verified

[jira] Commented: (NUTCH-787) Upgrade Lucene to 3.0.1.

2010-03-19 Thread Dawid Weiss (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-787?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12847325#action_12847325 ] Dawid Weiss commented on NUTCH-787: --- Thanks Andrzej. Upgrade Lucene to 3.0.1.

[jira] Closed: (NUTCH-787) Upgrade Lucene to 3.0.1.

2010-03-19 Thread Andrzej Bialecki (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-787?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrzej Bialecki closed NUTCH-787. --- Resolution: Fixed Committed. Thanks Dawid! Upgrade Lucene to 3.0.1.

[jira] Created: (NUTCH-803) Upgrade Hadoop to 0.20.2

2010-03-19 Thread Andrzej Bialecki (JIRA)
Upgrade Hadoop to 0.20.2 Key: NUTCH-803 URL: https://issues.apache.org/jira/browse/NUTCH-803 Project: Nutch Issue Type: Improvement Affects Versions: 1.1 Reporter: Andrzej Bialecki

[jira] Closed: (NUTCH-803) Upgrade Hadoop to 0.20.2

2010-03-19 Thread Andrzej Bialecki (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-803?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrzej Bialecki closed NUTCH-803. --- Resolution: Fixed All tests pass - committed. Upgrade Hadoop to 0.20.2

[jira] Updated: (NUTCH-740) Configuration option to override default language for fetched pages.

2010-03-19 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche updated NUTCH-740: Attachment: NUTCH-740.patch Slightly modified version of the patch with modifs for protocol-http.

[DISCUSS] Nutch as a top level project (TLP)?

2010-03-19 Thread Andrzej Bialecki
Hi devs, The ASF Board indicated recently that so called umbrella projects, i.e. projects that host many significant sub-projects, should examine their structure towards simplification, such as merging or splitting out sub-projects. Lucene TLP is such a project. Recently the Lucene PMC

[jira] Commented: (NUTCH-787) Upgrade Lucene to 3.0.1.

2010-03-19 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-787?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12847709#action_12847709 ] Hudson commented on NUTCH-787: -- Integrated in Nutch-trunk #1101 (See

[jira] Commented: (NUTCH-803) Upgrade Hadoop to 0.20.2

2010-03-19 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-803?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12847710#action_12847710 ] Hudson commented on NUTCH-803: -- Integrated in Nutch-trunk #1101 (See