[
https://issues.apache.org/jira/browse/NUTCH-443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Doğacan Güney updated NUTCH-443:
Attachment: NUTCH_443_reopened_v3.patch
New version against latest trunk.
Tested locally, seems
[
https://issues.apache.org/jira/browse/NUTCH-443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andrzej Bialecki updated NUTCH-443:
Attachment: patch.txt
I'm not too happy with the direction you took in the latest patch.
[
https://issues.apache.org/jira/browse/NUTCH-443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Doğacan Güney updated NUTCH-443:
Attachment: redirect_and_index_v2.patch
New version. Moves parsing code into (content != null)
[
https://issues.apache.org/jira/browse/NUTCH-443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Doğacan Güney updated NUTCH-443:
Attachment: NUTCH-443.08052007.patch
Patch updated to latest trunk.
allow parsers to return
[
https://issues.apache.org/jira/browse/NUTCH-443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Doğacan Güney updated NUTCH-443:
Attachment: NUTCH-443.02282007-v2.patch
Yet another patch.
ParseResult.filter is out and Nutch no
[
https://issues.apache.org/jira/browse/NUTCH-443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chris A. Mattmann updated NUTCH-443:
Attachment: NUTCH-443.022507.patch.txt
Hi Folks,
Attached is a candidate patch for
[
https://issues.apache.org/jira/browse/NUTCH-443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Doğacan Güney updated NUTCH-443:
Attachment: NUTCH-443-draft-v7.patch
allow parsers to return multiple Parse object, this will
[
https://issues.apache.org/jira/browse/NUTCH-443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Dogacan Güney updated NUTCH-443:
Attachment: NUTCH-443-draft-v5.patch
New version. Now indexing also works but has a catch. Many
[
https://issues.apache.org/jira/browse/NUTCH-443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Dogacan Güney updated NUTCH-443:
Attachment: NUTCH-443-draft-v6.patch
Oops... I forgot to merge Renaud Richardet's work.
This is
[
https://issues.apache.org/jira/browse/NUTCH-443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Dogacan Güney updated NUTCH-443:
Attachment: NUTCH-443-draft-v1.patch
allow parsers to return multiple Parse object, this will
[
https://issues.apache.org/jira/browse/NUTCH-443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Dogacan Güney updated NUTCH-443:
Attachment: NUTCH-443-draft-v2.patch
Small update to the patch. Now all core junit tests pass.
[
https://issues.apache.org/jira/browse/NUTCH-443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Dogacan Güney updated NUTCH-443:
Attachment: NUTCH-443-draft-v3.patch
new patch, contains a possible fix for CrawlDbReducer problem.
[
https://issues.apache.org/jira/browse/NUTCH-443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Renaud Richardet updated NUTCH-443:
---
Attachment: NUTCH-443-draft-v4.patch
Hi Dogacan,
Thanks for merging the patches, good
[
https://issues.apache.org/jira/browse/NUTCH-443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Dogacan Güney updated NUTCH-443:
Attachment: parse-map-core-untested.patch
allow parsers to return multiple Parse object, this will
[
https://issues.apache.org/jira/browse/NUTCH-443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Dogacan Güney updated NUTCH-443:
Attachment: parse-map-core-draft-v1.patch
allow parsers to return multiple Parse object, this will
[
https://issues.apache.org/jira/browse/NUTCH-443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Renaud Richardet updated NUTCH-443:
---
Attachment: parsers.diff
Great, here's my work-in-progress(not finished, not tested) for
16 matches
Mail list logo