[
https://issues.apache.org/jira/browse/NUTCH-444?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12542819
]
Renaud Richardet commented on NUTCH-444:
hi,
i am travelling and will be offline until january 2008. thanks
[
https://issues.apache.org/jira/browse/NUTCH-540?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Renaud Richardet updated NUTCH-540:
---
Priority: Major (was: Blocker)
could you please attach log files and error messages? thanks
[
https://issues.apache.org/jira/browse/NUTCH-369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Renaud Richardet updated NUTCH-369:
---
Attachment: patch.diff
unified diff against head.
- fixes encoding, as described by King
[
https://issues.apache.org/jira/browse/NUTCH-369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Renaud Richardet updated NUTCH-369:
---
Attachment: remover.diff
just FYI, you can further filter which element neko should keep and
[
https://issues.apache.org/jira/browse/NUTCH-443?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12472733
]
Renaud Richardet commented on NUTCH-443:
hi All,
Glad to see that this patch is moving forward :-)
I have
[
https://issues.apache.org/jira/browse/NUTCH-443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Renaud Richardet updated NUTCH-443:
---
Attachment: NUTCH-443-draft-v4.patch
Hi Dogacan,
Thanks for merging the patches, good
[
https://issues.apache.org/jira/browse/NUTCH-443?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12471878
]
Renaud Richardet commented on NUTCH-443:
Nutch Newbie, Gal, Chris
It's great that you discuss alternative
[
https://issues.apache.org/jira/browse/NUTCH-443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Renaud Richardet updated NUTCH-443:
---
Attachment: parsers.diff
Great, here's my work-in-progress(not finished, not tested) for
allow parsers to return multiple Parse object, this will speed up the rss parser
Key: NUTCH-443
URL: https://issues.apache.org/jira/browse/NUTCH-443
Project: Nutch
[ http://issues.apache.org/jira/browse/NUTCH-412?page=all ]
Renaud Richardet updated NUTCH-412:
---
Attachment: plugin_parse-feedUrl2.diff
plugin to parse the feed-url (rss/atom) of a blog
-
plugin to parse the feed-url (rss/atom) of a blog
-
Key: NUTCH-412
URL: http://issues.apache.org/jira/browse/NUTCH-412
Project: Nutch
Issue Type: New Feature
Affects Versions: 0.9.0
[ http://issues.apache.org/jira/browse/NUTCH-412?page=all ]
Renaud Richardet updated NUTCH-412:
---
Attachment: plugin_parse-feedUrl.diff
unified diff against head (Rev: 481445)
plugin to parse the feed-url (rss/atom) of a blog
extraction of links will fail for whole page if one single link cannot be parsed
Key: NUTCH-359
URL: http://issues.apache.org/jira/browse/NUTCH-359
Project: Nutch
[ http://issues.apache.org/jira/browse/NUTCH-346?page=all ]
Renaud Richardet updated NUTCH-346:
---
Attachment: log4j_plugins.diff
OK, here we go. This patch should be good for 0.8 and trunk.
Improve readability of logs/hadoop.log
Improve readability of logs/hadoop.log
--
Key: NUTCH-346
URL: http://issues.apache.org/jira/browse/NUTCH-346
Project: Nutch
Issue Type: Improvement
Affects Versions: 0.9.0
Environment: ubuntu
[
http://issues.apache.org/jira/browse/NUTCH-266?page=comments#action_12426579 ]
Renaud Richardet commented on NUTCH-266:
KuroSaka, yes you can download the hadoop jar, release 0.5.0 from the project
website:
[
http://issues.apache.org/jira/browse/NUTCH-330?page=comments#action_12426629 ]
Renaud Richardet commented on NUTCH-330:
This bug is obsolte, I just found out that Nutch already allows to search from
the command line via
bin/nutch
[ http://issues.apache.org/jira/browse/NUTCH-266?page=all ]
Renaud Richardet updated NUTCH-266:
---
Attachment: patch_hadoop-0.5.0.diff
Now that Hadoop 0.5 has been released, here's the patch to use hadoop-0.5.0.jar
in Nutch-0.8.x
HTH,
Renaud
hadoop
[ http://issues.apache.org/jira/browse/NUTCH-266?page=all ]
Renaud Richardet updated NUTCH-266:
---
Attachment: patch.diff
Thank you Sami,
We had a similar problem with Win XP and were able to fix it by using
hadoop-nightly.jar. However, because of
[ http://issues.apache.org/jira/browse/NUTCH-208?page=all ]
Renaud Richardet updated NUTCH-208:
---
Attachment: proxy_exception_list-0.8.diff
I updated the patch to 0.8 and corrected small typo (if
(!.equals(input[i].trim())){ ). The proxy exception
command line tool to search a Lucene index
--
Key: NUTCH-330
URL: http://issues.apache.org/jira/browse/NUTCH-330
Project: Nutch
Issue Type: Improvement
Components: searcher
Affects
[ http://issues.apache.org/jira/browse/NUTCH-330?page=all ]
Renaud Richardet updated NUTCH-330:
---
Attachment: clSearch.diff
unified diff against head
command line tool to search a Lucene index
--
[ http://issues.apache.org/jira/browse/NUTCH-330?page=all ]
Renaud Richardet updated NUTCH-330:
---
Attachment: clSearch.diff
forgot the echo in sh...
command line tool to search a Lucene index
--
23 matches
Mail list logo