Re: issue in nutch-default.xml

2012-02-17 Thread Lewis John Mcgibbney
Hi Kaveh, The description is incorrect and should be changed to This value expressed in milliseconds, I'll commit this just now. Thanks for reporting. On Fri, Feb 17, 2012 at 7:15 AM, ka...@plutoz.com wrote: so I checked the source code. the value seems that should be in fact 7. current

Re: issue in nutch-default.xml

2012-02-17 Thread Lewis John Mcgibbney
Committed @ revision 1245390 in trunk. I'll get nutchgora sorted out later on as I'm doing some other stuff up there. Thanks again. Lewis On Fri, Feb 17, 2012 at 11:04 AM, Lewis John Mcgibbney lewis.mcgibb...@gmail.com wrote: Hi Kaveh, The description is incorrect and should be changed to

[jira] [Commented] (NUTCH-1210) DomainBlacklistFilter

2012-02-17 Thread Lewis John McGibbney (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13210188#comment-13210188 ] Lewis John McGibbney commented on NUTCH-1210: - Hey Markus. In /conf we also

Re: issue in nutch-default.xml

2012-02-17 Thread Markus Jelsma
this is actually 7 days in milliseconds. so I checked the source code. the value seems that should be in fact 7. current default value means 1.6 thousand millenniums . property namecrawl.gen.delay/name value60480/value description This value, expressed in days, defines how

[jira] [Commented] (NUTCH-1246) Upgrade to Hadoop 1.0.0

2012-02-17 Thread Lewis John McGibbney (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13210312#comment-13210312 ] Lewis John McGibbney commented on NUTCH-1246: - How is this issue?

[jira] [Updated] (NUTCH-1086) Rewrite protocol-httpclient

2012-02-17 Thread Lewis John McGibbney (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1086: Priority: Critical (was: Major) Rewrite protocol-httpclient

[DISCUSS] Nutchgora 2.0 release

2012-02-17 Thread Lewis John Mcgibbney
Hi Guys, Here we are again :0) What are the perceptions with aiming for a 2.0 release? We have one blocking issue, the webapp, which I got no response from the community at large about. I would like to see this addressed but this is another issue. Speaking with the future in mind, we are hoping

[jira] [Commented] (NUTCH-1246) Upgrade to Hadoop 1.0.0

2012-02-17 Thread Markus Jelsma (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13210339#comment-13210339 ] Markus Jelsma commented on NUTCH-1246: -- hmm, the jackson dep is still there but it

Re: issue in nutch-default.xml

2012-02-17 Thread kaveh minooie
I know but the code expects to read the number of days: genDelay = job.getLong(GENERATOR_DELAY, 7L) * 3600L * 24L * 1000L; and as you can see the default value, as is mentioned in the description, is 7 and it is in days not milliseconds; On 02/17/2012 12:33 AM, Markus Jelsma wrote: this is

[jira] [Commented] (NUTCH-585) [PARSE-HTML plugin] Block certain parts of HTML code from being indexed

2012-02-17 Thread Lewis John McGibbney (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13210498#comment-13210498 ] Lewis John McGibbney commented on NUTCH-585: I like this contribution

[jira] [Commented] (NUTCH-1001) bin/nutch fetch/parse handle crawl/segments directory

2012-02-17 Thread Lewis John McGibbney (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13210506#comment-13210506 ] Lewis John McGibbney commented on NUTCH-1001: - Hi Gabriele are you interested

[jira] [Commented] (NUTCH-1079) StringBuffer converted to StringBuilder

2012-02-17 Thread Lewis John McGibbney (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13210516#comment-13210516 ] Lewis John McGibbney commented on NUTCH-1079: - How is this guys? It seems that

[jira] [Commented] (NUTCH-1210) DomainBlacklistFilter

2012-02-17 Thread Lewis John McGibbney (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13210529#comment-13210529 ] Lewis John McGibbney commented on NUTCH-1210: - One last thing, I think your

[jira] [Commented] (NUTCH-1246) Upgrade to Hadoop 1.0.0

2012-02-17 Thread Lewis John McGibbney (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13210534#comment-13210534 ] Lewis John McGibbney commented on NUTCH-1246: - Removal of jackson library in

[jira] [Commented] (NUTCH-1193) Incorrect url transform to lowercase: parameter solr

2012-02-17 Thread Lewis John McGibbney (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13210542#comment-13210542 ] Lewis John McGibbney commented on NUTCH-1193: - Committed @ revision 1245753 in

[jira] [Resolved] (NUTCH-1193) Incorrect url transform to lowercase: parameter solr

2012-02-17 Thread Lewis John McGibbney (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1193. - Resolution: Fixed Incorrect url transform to lowercase: parameter solr

[jira] [Closed] (NUTCH-1193) Incorrect url transform to lowercase: parameter solr

2012-02-17 Thread Lewis John McGibbney (Closed) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-1193. --- Incorrect url transform to lowercase: parameter solr

Build failed in Jenkins: nutch-trunk-maven #153

2012-02-17 Thread Apache Jenkins Server
See https://builds.apache.org/job/nutch-trunk-maven/153/ -- Started by an SCM change Building remotely on ubuntu2 in workspace https://builds.apache.org/job/nutch-trunk-maven/ws/ hudson.util.IOException2: remote file operation failed:

Re: svn commit: r1245753 - in /nutch/trunk: CHANGES.txt src/java/org/apache/nutch/crawl/Crawl.java

2012-02-17 Thread USC Mail
unsubscribe. Sent from my iPhone On Feb 17, 2012, at 12:48 PM, lewi...@apache.org wrote: Author: lewismc Date: Fri Feb 17 20:48:26 2012 New Revision: 1245753 URL: http://svn.apache.org/viewvc?rev=1245753view=rev Log: commit to adress NUTCH-1193 update to CHANGES.txt Modified:

[jira] [Commented] (NUTCH-1193) Incorrect url transform to lowercase: parameter solr

2012-02-17 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13210805#comment-13210805 ] Hudson commented on NUTCH-1193: --- Integrated in Nutch-trunk #1760 (See

Jenkins build is back to normal : nutch-trunk-maven #154

2012-02-17 Thread Apache Jenkins Server
See https://builds.apache.org/job/nutch-trunk-maven/154/