[jira] [Closed] (NUTCH-1340) Increase scalability by only removing markers when they actually exist for DbUpdaterReducer

2012-04-26 Thread Ferdy Galema (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1340?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ferdy Galema closed NUTCH-1340. --- Resolution: Fixed Increase scalability by only removing markers when they actually exist for

[jira] [Updated] (NUTCH-1340) Increase scalability by only removing markers when they actually exist for DbUpdaterReducer

2012-04-26 Thread Ferdy Galema (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1340?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ferdy Galema updated NUTCH-1340: Attachment: NUTCH-1340-v2.txt v2 of patch, including javadoc. This patch increases performance,

[jira] [Commented] (NUTCH-882) Design a Host table in GORA

2012-04-26 Thread Ferdy Galema (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13262488#comment-13262488 ] Ferdy Galema commented on NUTCH-882: Committed. I realize that the current state is far

[jira] [Resolved] (NUTCH-882) Design a Host table in GORA

2012-04-26 Thread Ferdy Galema (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ferdy Galema resolved NUTCH-882. Resolution: Fixed Design a Host table in GORA ---

[jira] [Commented] (NUTCH-902) Add all necessary files and configuration so that nutch can be used with different backends out-of-the-box

2012-04-26 Thread Ferdy Galema (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-902?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13262496#comment-13262496 ] Ferdy Galema commented on NUTCH-902: I think nutch-default.xml does not correctly use

[jira] [Commented] (NUTCH-882) Design a Host table in GORA

2012-04-26 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13262500#comment-13262500 ] Julien Nioche commented on NUTCH-882: - Ferdy I'll let you close it. I don't have time

[jira] [Commented] (NUTCH-902) Add all necessary files and configuration so that nutch can be used with different backends out-of-the-box

2012-04-26 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-902?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13262503#comment-13262503 ] Lewis John McGibbney commented on NUTCH-902: Yeah +1. Is there anything else

[jira] [Commented] (NUTCH-1189) add commented out default settings to gora.properties files

2012-04-26 Thread Ferdy Galema (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1189?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13262506#comment-13262506 ] Ferdy Galema commented on NUTCH-1189: - FYI: I just committed a change to update the

[jira] [Closed] (NUTCH-882) Design a Host table in GORA

2012-04-26 Thread Ferdy Galema (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ferdy Galema closed NUTCH-882. -- Ok. Thanks to anyone who was involved. Design a Host table in GORA

[jira] [Updated] (NUTCH-1205) Upgrade gora modules to 0.2 in ivy/ivy.xml

2012-04-26 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1205: Summary: Upgrade gora modules to 0.2 in ivy/ivy.xml (was: Upgrade gora modules to

[jira] [Closed] (NUTCH-1290) crawlId not supported by all Tools

2012-04-26 Thread Ferdy Galema (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ferdy Galema closed NUTCH-1290. --- Resolution: Fixed crawlId not supported by all Tools --

[jira] [Commented] (NUTCH-902) Add all necessary files and configuration so that nutch can be used with different backends out-of-the-box

2012-04-26 Thread Ferdy Galema (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-902?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13262548#comment-13262548 ] Ferdy Galema commented on NUTCH-902: Alright I'll change and commit the

[jira] [Commented] (NUTCH-902) Add all necessary files and configuration so that nutch can be used with different backends out-of-the-box

2012-04-26 Thread Ferdy Galema (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-902?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13262556#comment-13262556 ] Ferdy Galema commented on NUTCH-902: Ok done. (Note that I did not actually check the

[jira] [Commented] (NUTCH-879) URL-s getting lost

2012-04-26 Thread Ferdy Galema (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13262558#comment-13262558 ] Ferdy Galema commented on NUTCH-879: This a pretty old issue. Nevertheless the bug

[jira] [Commented] (NUTCH-1306) Commit after finished writing to solr index

2012-04-26 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1306?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13262592#comment-13262592 ] Lewis John McGibbney commented on NUTCH-1306: - Having reviewed similar work

[jira] [Commented] (NUTCH-1293) IndexingFiltersChecker to store detected content type in crawldatum metadata

2012-04-26 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13263124#comment-13263124 ] Sebastian Nagel commented on NUTCH-1293: The content type should be added to

[jira] [Updated] (NUTCH-1205) Upgrade gora modules to 0.2 in ivy/ivy.xml

2012-04-26 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1205: Attachment: NUTCH-1205-v6.patch Hi Ferdy. If you would be so good to look at the

[jira] [Issue Comment Edited] (NUTCH-1205) Upgrade gora modules to 0.2 in ivy/ivy.xml

2012-04-26 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13263196#comment-13263196 ] Lewis John McGibbney edited comment on NUTCH-1205 at 4/26/12 10:15 PM:

[jira] [Commented] (NUTCH-1189) add commented out default settings to gora.properties files

2012-04-26 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1189?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13263362#comment-13263362 ] Hudson commented on NUTCH-1189: --- Integrated in Nutch-nutchgora #240 (See

[jira] [Commented] (NUTCH-882) Design a Host table in GORA

2012-04-26 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13263363#comment-13263363 ] Hudson commented on NUTCH-882: -- Integrated in Nutch-nutchgora #240 (See

[jira] [Commented] (NUTCH-902) Add all necessary files and configuration so that nutch can be used with different backends out-of-the-box

2012-04-26 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-902?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13263365#comment-13263365 ] Hudson commented on NUTCH-902: -- Integrated in Nutch-nutchgora #240 (See

[jira] [Commented] (NUTCH-1340) Increase scalability by only removing markers when they actually exist for DbUpdaterReducer

2012-04-26 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13263364#comment-13263364 ] Hudson commented on NUTCH-1340: --- Integrated in Nutch-nutchgora #240 (See