Dear Doug,
Any news about integration of OPIC in mapred? I have time to develop
OPIC on Nutch Mapred. Can you help me to start?
By the email from Carlos Alberto-Alejandro CASTILLO-Ocaranza, seams that
the best way to integrate OPIC in on old webdb, is this way valid also
CrawlDb in Mapred?
[
http://issues.apache.org/jira/browse/NUTCH-116?page=comments#action_12332493 ]
Doug Cutting commented on NUTCH-116:
Paul,
This looks like good stuff.
I could commit it more easily if changes were restricted to those required by
TestNDFS. Changes
Massimo Miccoli wrote:
Any news about integration of OPIC in mapred? I have time to develop
OPIC on Nutch Mapred. Can you help me to start?
By the email from Carlos Alberto-Alejandro CASTILLO-Ocaranza, seams that
the best way to integrate OPIC in on old webdb, is this way valid also
CrawlDb
[
http://issues.apache.org/jira/browse/NUTCH-88?page=comments#action_12332518 ]
Doug Cutting commented on NUTCH-88:
---
These both sound like good changes. +1
Enhance ParserFactory plugin selection policy
-
Here is a patch that implements this. I'm still testing it. If it
appears to work well, I will commit it.
Doug Cutting wrote:
Massimo Miccoli wrote:
Any news about integration of OPIC in mapred? I have time to develop
OPIC on Nutch Mapred. Can you help me to start?
By the email from
[ http://issues.apache.org/jira/browse/NUTCH-82?page=all ]
Nick Jacobsen updated NUTCH-82:
---
Attachment: nutch.pl
Perl version of the control script, meant to work on both Windows and Unix like
operating systems. Has been tested on Windows 2000/XP/2003
Massimo Miccoli wrote:
Any news about integration of OPIC in mapred? I have time to
develop OPIC on Nutch Mapred. Can you help me to start?
By the email from Carlos Alberto-Alejandro CASTILLO-Ocaranza, seams
that the best way to integrate OPIC in on old webdb, is this way
valid also
CrawlDb
[
http://issues.apache.org/jira/browse/NUTCH-88?page=comments#action_12332541 ]
Doug Cutting commented on NUTCH-88:
---
If it's to happen at parse time then it should happen in the Content
constructor, so that it's only done in one place, and we don't rely
[
http://issues.apache.org/jira/browse/NUTCH-82?page=comments#action_12332543 ]
Doug Cutting commented on NUTCH-82:
---
I do not think we should have multiple versions of the command line tools,
since that complicates maintenance. A windows batch file is
[ http://issues.apache.org/jira/browse/NUTCH-116?page=all ]
Paul Baclace updated NUTCH-116:
---
Attachment: required_by_TestNDFS_v2.patch
Change Notes revised for patch required_by_TestNDFS_v2.patch which supercedes
required_by_TestNDFS.patch:
[
http://issues.apache.org/jira/browse/NUTCH-116?page=comments#action_12332546 ]
Paul Baclace commented on NUTCH-116:
Doug,
Thanks for the quick response.
1. Should BLOCKREPORT_INTERVAL and DATANODE_STARTUP_PERIOD be removed from
FSConstants
11 matches
Mail list logo