Re: [Nutch-dev] [Fwd: Fetch list priority]

2005-10-19 Thread Massimo Miccoli
Dear Doug, Any news about integration of OPIC in mapred? I have time to develop OPIC on Nutch Mapred. Can you help me to start? By the email from Carlos Alberto-Alejandro CASTILLO-Ocaranza, seams that the best way to integrate OPIC in on old webdb, is this way valid also CrawlDb in Mapred?

[jira] Commented: (NUTCH-116) TestNDFS a JUnit test specifically for NDFS

2005-10-19 Thread Doug Cutting (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-116?page=comments#action_12332493 ] Doug Cutting commented on NUTCH-116: Paul, This looks like good stuff. I could commit it more easily if changes were restricted to those required by TestNDFS. Changes

Re: [Nutch-dev] [Fwd: Fetch list priority]

2005-10-19 Thread Doug Cutting
Massimo Miccoli wrote: Any news about integration of OPIC in mapred? I have time to develop OPIC on Nutch Mapred. Can you help me to start? By the email from Carlos Alberto-Alejandro CASTILLO-Ocaranza, seams that the best way to integrate OPIC in on old webdb, is this way valid also CrawlDb

[jira] Commented: (NUTCH-88) Enhance ParserFactory plugin selection policy

2005-10-19 Thread Doug Cutting (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-88?page=comments#action_12332518 ] Doug Cutting commented on NUTCH-88: --- These both sound like good changes. +1 Enhance ParserFactory plugin selection policy -

Re: OPIC

2005-10-19 Thread Doug Cutting
Here is a patch that implements this. I'm still testing it. If it appears to work well, I will commit it. Doug Cutting wrote: Massimo Miccoli wrote: Any news about integration of OPIC in mapred? I have time to develop OPIC on Nutch Mapred. Can you help me to start? By the email from

[jira] Updated: (NUTCH-82) Nutch Commands should run on Windows without external tools

2005-10-19 Thread Nick Jacobsen (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-82?page=all ] Nick Jacobsen updated NUTCH-82: --- Attachment: nutch.pl Perl version of the control script, meant to work on both Windows and Unix like operating systems. Has been tested on Windows 2000/XP/2003

Re: [Nutch-dev] [Fwd: Fetch list priority]

2005-10-19 Thread Ken Krugler
Massimo Miccoli wrote: Any news about integration of OPIC in mapred? I have time to develop OPIC on Nutch Mapred. Can you help me to start? By the email from Carlos Alberto-Alejandro CASTILLO-Ocaranza, seams that the best way to integrate OPIC in on old webdb, is this way valid also CrawlDb

[jira] Commented: (NUTCH-88) Enhance ParserFactory plugin selection policy

2005-10-19 Thread Doug Cutting (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-88?page=comments#action_12332541 ] Doug Cutting commented on NUTCH-88: --- If it's to happen at parse time then it should happen in the Content constructor, so that it's only done in one place, and we don't rely

[jira] Commented: (NUTCH-82) Nutch Commands should run on Windows without external tools

2005-10-19 Thread Doug Cutting (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-82?page=comments#action_12332543 ] Doug Cutting commented on NUTCH-82: --- I do not think we should have multiple versions of the command line tools, since that complicates maintenance. A windows batch file is

[jira] Updated: (NUTCH-116) TestNDFS a JUnit test specifically for NDFS

2005-10-19 Thread Paul Baclace (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-116?page=all ] Paul Baclace updated NUTCH-116: --- Attachment: required_by_TestNDFS_v2.patch Change Notes revised for patch required_by_TestNDFS_v2.patch which supercedes required_by_TestNDFS.patch:

[jira] Commented: (NUTCH-116) TestNDFS a JUnit test specifically for NDFS

2005-10-19 Thread Paul Baclace (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-116?page=comments#action_12332546 ] Paul Baclace commented on NUTCH-116: Doug, Thanks for the quick response. 1. Should BLOCKREPORT_INTERVAL and DATANODE_STARTUP_PERIOD be removed from FSConstants