[
https://issues.apache.org/jira/browse/NUTCH-1306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ferdy Galema updated NUTCH-1306:
Attachment: NUTCH-1306-trunk-v2.patch
Heh indeed that's not ready for committing yet. Weird though
[
https://issues.apache.org/jira/browse/NUTCH-1362?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ferdy Galema updated NUTCH-1362:
Attachment: NUTCH-1362.patch
Hey Lewis,
This patches fixes the problem and makes the reversing a
[
https://issues.apache.org/jira/browse/NUTCH-1362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13273129#comment-13273129
]
Ferdy Galema commented on NUTCH-1362:
-
Btw this is a duplicate of NUTCH-1077.
[
https://issues.apache.org/jira/browse/NUTCH-1077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ferdy Galema closed NUTCH-1077.
---
Resolution: Duplicate
Fix Version/s: (was: 2.1)
Will be fixed with NUTCH-1362. (Use
[
https://issues.apache.org/jira/browse/NUTCH-1086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lewis John McGibbney updated NUTCH-1086:
Affects Version/s: 1.5
nutchgora
Fix Version/s: 2.1
[
https://issues.apache.org/jira/browse/NUTCH-1362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13273143#comment-13273143
]
Lewis John McGibbney commented on NUTCH-1362:
-
+1 to commit Ferdy. I am happy
[
https://issues.apache.org/jira/browse/NUTCH-1362?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ferdy Galema closed NUTCH-1362.
---
Resolution: Fixed
Done! Thanks.
Fix error handling of urls with empty fields
Ferdy Galema created NUTCH-1366:
---
Summary: speed up indexing by eliminating the indexreducer
Key: NUTCH-1366
URL: https://issues.apache.org/jira/browse/NUTCH-1366
Project: Nutch
Issue Type:
[
https://issues.apache.org/jira/browse/NUTCH-1366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ferdy Galema updated NUTCH-1366:
Attachment: NUTCH-1366.patch
speed up indexing by eliminating the indexreducer
[
https://issues.apache.org/jira/browse/NUTCH-1366?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13273318#comment-13273318
]
Markus Jelsma commented on NUTCH-1366:
--
Cool!
This indeed does not apply to trunk
[
https://issues.apache.org/jira/browse/NUTCH-1366?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13273335#comment-13273335
]
Ferdy Galema commented on NUTCH-1366:
-
The cool part about Nutchgora is that inlinks
[
https://issues.apache.org/jira/browse/NUTCH-1366?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13273354#comment-13273354
]
Markus Jelsma commented on NUTCH-1366:
--
Ah i see! This would mean the WebGraph can
Lewis John McGibbney created NUTCH-1367:
---
Summary: Port ParserChecker to Nutchgora
Key: NUTCH-1367
URL: https://issues.apache.org/jira/browse/NUTCH-1367
Project: Nutch
Issue Type: New
[
https://issues.apache.org/jira/browse/NUTCH-1362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13273829#comment-13273829
]
Hudson commented on NUTCH-1362:
---
Integrated in Nutch-nutchgora #250 (See
See https://builds.apache.org/job/nutch-trunk-maven/264/
--
Started by timer
Building remotely on ubuntu2 in workspace
https://builds.apache.org/job/nutch-trunk-maven/ws/
hudson.util.IOException2: remote file operation failed:
15 matches
Mail list logo