[jira] [Commented] (NUTCH-1370) Expose exact number of urls injected @runtime

2012-11-22 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13503013#comment-13503013 ] Hudson commented on NUTCH-1370: --- Integrated in Nutch-trunk #2026 (See [h

[jira] [Commented] (NUTCH-1370) Expose exact number of urls injected @runtime

2012-11-22 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13503012#comment-13503012 ] Hudson commented on NUTCH-1370: --- Integrated in Nutch-nutchgora #412 (See [h

[jira] [Commented] (NUTCH-1370) Expose exact number of urls injected @runtime

2012-11-22 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13502813#comment-13502813 ] Hudson commented on NUTCH-1370: --- Integrated in nutch-trunk-maven #503 (See [h

[jira] [Resolved] (NUTCH-1370) Expose exact number of urls injected @runtime

2012-11-22 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1370. - Resolution: Fixed Committed @revision 1412573 in trunk Thank you everyone for

[jira] [Commented] (NUTCH-1370) Expose exact number of urls injected @runtime

2012-11-22 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13502799#comment-13502799 ] Lewis John McGibbney commented on NUTCH-1370: - Tested against medium s

[jira] [Updated] (NUTCH-1370) Expose exact number of urls injected @runtime

2012-11-13 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel updated NUTCH-1370: --- Attachment: NUTCH-1370-2.x-v3.patch Hi Lewis, yes, the 1.x patch is not easily transferred

[jira] [Updated] (NUTCH-1370) Expose exact number of urls injected @runtime

2012-11-13 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1370: Attachment: NUTCH-1370-2.x-v2.patch 2nd WIP for 2.x I'm having diffi

[jira] [Updated] (NUTCH-1370) Expose exact number of urls injected @runtime

2012-11-13 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1370: Patch Info: Patch Available > Expose exact number of urls injected @runt

[jira] [Updated] (NUTCH-1370) Expose exact number of urls injected @runtime

2012-11-12 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel updated NUTCH-1370: --- Attachment: NUTCH-1370-1.x.patch Ferdy is right: custom counters are more transparent. Patch

[jira] [Commented] (NUTCH-1370) Expose exact number of urls injected @runtime

2012-11-09 Thread Ferdy Galema (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13493885#comment-13493885 ] Ferdy Galema commented on NUTCH-1370: - Hi, I checked the patch, it seems you

[jira] [Updated] (NUTCH-1370) Expose exact number of urls injected @runtime

2012-11-06 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1370: Attachment: NUTCH-1370-2.x.patch WIP patch for 2.x. I am convinced that I'

[jira] [Commented] (NUTCH-1370) Expose exact number of urls injected @runtime

2012-10-30 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13487322#comment-13487322 ] Lewis John McGibbney commented on NUTCH-1370: - No hassle Seb, I will

[jira] [Commented] (NUTCH-1370) Expose exact number of urls injected @runtime

2012-10-30 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13487316#comment-13487316 ] Sebastian Nagel commented on NUTCH-1370: +1 Would be nice to see also the nu

Re: NUTCH-1370

2012-10-30 Thread Lewis John Mcgibbney
Hi Again, Thanks Julien, I will also make this method public in the patch for 2.x. This is actually getting quite interesting now as I've found out that using the o.a.hadoop.mapreduce.Job#Counters API can actually lead to security issues when attempting to obtain counters fro map and reduce jobs.

Re: NUTCH-1370

2012-10-30 Thread Julien Nioche
Hi, Sounds pretty harmless to have that method public IMHO Julien On 29 October 2012 16:57, Lewis John Mcgibbney wrote: > Hi Julien, > > Thanks for the comments. Any additional ones regarding the accessibility > of the getDataStoreClass? > > Thanks again > > Lewis > > > On Mon, Oct 29, 2012 at

Re: NUTCH-1370

2012-10-29 Thread Lewis John Mcgibbney
Hi Julien, Thanks for the comments. Any additional ones regarding the accessibility of the getDataStoreClass? Thanks again Lewis On Mon, Oct 29, 2012 at 4:52 PM, Julien Nioche < lists.digitalpeb...@gmail.com> wrote: > Hi Lewis > > see comments below > >> >> So I thought I'd take this one on to

Re: NUTCH-1370

2012-10-29 Thread Julien Nioche
Hi Lewis see comments below > > So I thought I'd take this one on tonight and see if I can resolve. > Basically, my high level question is as follows... > Is each line of a text file (seed file) which we attempt to inject > into the webdb considered as an individual map task? > no - each file in

Re: NUTCH-1370

2012-10-29 Thread Lewis John Mcgibbney
In addition to this. Can someone please explain why [0] StorageUtils#getDataStoreClass is a private method in this class. The reason I ask is that it would be nice to be able to log which Gora class is being used to persist the Injected URLs. Are there any security risks associated with making thi

[jira] [Assigned] (NUTCH-1370) Expose exact number of urls injected @runtime

2012-10-29 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-1370: --- Assignee: Lewis John McGibbney > Expose exact number of urls injec

[jira] [Updated] (NUTCH-1370) Expose exact number of urls injected @runtime

2012-09-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1370: Fix Version/s: (was: 2.1) 2.2 > Expose exact number

[jira] [Updated] (NUTCH-1370) Expose exact number of urls injected @runtime

2012-06-07 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche updated NUTCH-1370: - Affects Version/s: (was: 1.4) 1.5 Fix Version/s: (was: 1.5

[jira] [Updated] (NUTCH-1370) Expose exact number of urls injected @runtime

2012-05-22 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche updated NUTCH-1370: - Priority: Minor (was: Major) Running in pseudo-distributed mode gives you more information if

[jira] [Created] (NUTCH-1370) Expose exact number of urls injected @runtime

2012-05-22 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-1370: --- Summary: Expose exact number of urls injected @runtime Key: NUTCH-1370 URL: https://issues.apache.org/jira/browse/NUTCH-1370 Project: Nutch