[jira] Commented: (NUTCH-753) Prevent new Fetcher to retrieve the robots twice

2009-11-28 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-753?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12783235#action_12783235 ] Hudson commented on NUTCH-753: -- Integrated in Nutch-trunk #995 (See

[jira] Commented: (NUTCH-773) some minor bugs in AbstractFetchSchedule.java

2009-11-28 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12783238#action_12783238 ] Hudson commented on NUTCH-773: -- Integrated in Nutch-trunk #995 (See

[jira] Commented: (NUTCH-772) Upgrade Nutch to use Lucene 2.9.1

2009-11-28 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12783236#action_12783236 ] Hudson commented on NUTCH-772: -- Integrated in Nutch-trunk #995 (See

[jira] Commented: (NUTCH-760) Allow field mapping from nutch to solr index

2009-11-28 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12783237#action_12783237 ] Hudson commented on NUTCH-760: -- Integrated in Nutch-trunk #995 (See

[jira] Commented: (NUTCH-765) Allow Crawl class to call Either Solr or Lucene Indexer

2009-11-28 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-765?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12783234#action_12783234 ] Hudson commented on NUTCH-765: -- Integrated in Nutch-trunk #995 (See

[jira] Commented: (NUTCH-761) Avoid cloningCrawlDatum in CrawlDbReducer

2009-11-28 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-761?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12783239#action_12783239 ] Hudson commented on NUTCH-761: -- Integrated in Nutch-trunk #995 (See

[jira] Updated: (NUTCH-770) Timebomb for Fetcher

2009-11-28 Thread MilleBii (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-770?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] MilleBii updated NUTCH-770: --- Attachment: log-770 Please find the logs of the patch... I did effectively try it but I could not compile

[jira] Updated: (NUTCH-769) Fetcher to skip queues for URLS getting repeated exceptions

2009-11-28 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche updated NUTCH-769: Attachment: NUTCH-769-2.patch Fetcher to skip queues for URLS getting repeated exceptions

[jira] Commented: (NUTCH-769) Fetcher to skip queues for URLS getting repeated exceptions

2009-11-28 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12783247#action_12783247 ] Julien Nioche commented on NUTCH-769: - Missed a couple of lines indeed when I was trying

[jira] Commented: (NUTCH-770) Timebomb for Fetcher

2009-11-28 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12783248#action_12783248 ] Julien Nioche commented on NUTCH-770: - The log simply shows that the patch has not been

[jira] Commented: (NUTCH-770) Timebomb for Fetcher

2009-11-28 Thread MilleBii (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12783252#action_12783252 ] MilleBii commented on NUTCH-770: That's what I did and just retried ... so I'm a bit

[jira] Commented: (NUTCH-770) Timebomb for Fetcher

2009-11-28 Thread Andrzej Bialecki (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12783283#action_12783283 ] Andrzej Bialecki commented on NUTCH-770: - I propose to change the name of this

[jira] Closed: (NUTCH-746) NutchBeanConstructor does not close NutchBean upon contextDestroyed, causing resource leak in the container.

2009-11-28 Thread Andrzej Bialecki (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-746?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrzej Bialecki closed NUTCH-746. --- Resolution: Fixed Assignee: Andrzej Bialecki NutchBeanConstructor does not close

[jira] Commented: (NUTCH-746) NutchBeanConstructor does not close NutchBean upon contextDestroyed, causing resource leak in the container.

2009-11-28 Thread Andrzej Bialecki (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12783287#action_12783287 ] Andrzej Bialecki commented on NUTCH-746: - Fixed in rev. 885148. Thanks!

[jira] Closed: (NUTCH-738) Close SegmentUpdater when FetchedSegments is closed

2009-11-28 Thread Andrzej Bialecki (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrzej Bialecki closed NUTCH-738. --- Resolution: Fixed Assignee: Andrzej Bialecki Close SegmentUpdater when

[jira] Closed: (NUTCH-739) SolrDeleteDuplications too slow when using hadoop

2009-11-28 Thread Andrzej Bialecki (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-739?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrzej Bialecki closed NUTCH-739. --- Resolution: Fixed Assignee: Andrzej Bialecki SolrDeleteDuplications too slow when

[jira] Commented: (NUTCH-739) SolrDeleteDuplications too slow when using hadoop

2009-11-28 Thread Andrzej Bialecki (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12783290#action_12783290 ] Andrzej Bialecki commented on NUTCH-739: - Fixed in rev. 885152. Thank you!

[jira] Closed: (NUTCH-755) DomainURLFilter crashes on malformed URL

2009-11-28 Thread Andrzej Bialecki (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-755?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrzej Bialecki closed NUTCH-755. --- Resolution: Cannot Reproduce Assignee: Andrzej Bialecki DomainURLFilter crashes on

[jira] Commented: (NUTCH-755) DomainURLFilter crashes on malformed URL

2009-11-28 Thread Andrzej Bialecki (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-755?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12783299#action_12783299 ] Andrzej Bialecki commented on NUTCH-755: - I could not verify that the filter indeed

[jira] Commented: (NUTCH-692) AlreadyBeingCreatedException with Hadoop 0.19

2009-11-28 Thread Andrzej Bialecki (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12783302#action_12783302 ] Andrzej Bialecki commented on NUTCH-692: - We should review this issue after the

[jira] Commented: (NUTCH-741) Job file includes multiple copies of nutch config files.

2009-11-28 Thread Andrzej Bialecki (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12783304#action_12783304 ] Andrzej Bialecki commented on NUTCH-741: - Fixed in rev. 885156. Thank you! Job

[jira] Closed: (NUTCH-741) Job file includes multiple copies of nutch config files.

2009-11-28 Thread Andrzej Bialecki (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrzej Bialecki closed NUTCH-741. --- Resolution: Fixed Fix Version/s: 1.1 Assignee: Andrzej Bialecki Job file

[jira] Closed: (NUTCH-712) ParseOutputFormat should catch java.net.MalformedURLException coming from normalizers

2009-11-28 Thread Andrzej Bialecki (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrzej Bialecki closed NUTCH-712. --- Resolution: Fixed Fix Version/s: 1.1 Assignee: Andrzej Bialecki

[jira] Commented: (NUTCH-712) ParseOutputFormat should catch java.net.MalformedURLException coming from normalizers

2009-11-28 Thread Andrzej Bialecki (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-712?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12783306#action_12783306 ] Andrzej Bialecki commented on NUTCH-712: - Fixed in rev. 885159. Thank you!

[Nutch Wiki] Trivial Update of Automating_Fetches_wi th_Python by newacct

2009-11-28 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on Nutch Wiki for change notification. The Automating_Fetches_with_Python page has been changed by newacct. http://wiki.apache.org/nutch/Automating_Fetches_with_Python?action=diffrev1=5rev2=6

[jira] Commented: (NUTCH-738) Close SegmentUpdater when FetchedSegments is closed

2009-11-28 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12783359#action_12783359 ] Hudson commented on NUTCH-738: -- Integrated in Nutch-trunk #996 (See

[jira] Commented: (NUTCH-741) Job file includes multiple copies of nutch config files.

2009-11-28 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12783357#action_12783357 ] Hudson commented on NUTCH-741: -- Integrated in Nutch-trunk #996 (See

[jira] Commented: (NUTCH-712) ParseOutputFormat should catch java.net.MalformedURLException coming from normalizers

2009-11-28 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-712?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12783360#action_12783360 ] Hudson commented on NUTCH-712: -- Integrated in Nutch-trunk #996 (See

[jira] Commented: (NUTCH-746) NutchBeanConstructor does not close NutchBean upon contextDestroyed, causing resource leak in the container.

2009-11-28 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12783356#action_12783356 ] Hudson commented on NUTCH-746: -- Integrated in Nutch-trunk #996 (See

[jira] Commented: (NUTCH-739) SolrDeleteDuplications too slow when using hadoop

2009-11-28 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12783358#action_12783358 ] Hudson commented on NUTCH-739: -- Integrated in Nutch-trunk #996 (See