In addition to this. Can someone please explain why [0] StorageUtils#getDataStoreClass is a private method in this class. The reason I ask is that it would be nice to be able to log which Gora class is being used to persist the Injected URLs.
Are there any security risks associated with making this class public and accessible? Thanks Lewis [0] https://svn.apache.org/repos/asf/nutch/branches/2.x/src/java/org/apache/nutch/storage/StorageUtils.java On Mon, Oct 29, 2012 at 4:22 PM, Lewis John Mcgibbney <[email protected]> wrote: > Hi, > > So I thought I'd take this one on tonight and see if I can resolve. > Basically, my high level question is as follows... > Is each line of a text file (seed file) which we attempt to inject > into the webdb considered as an individual map task? > The idea is to establish a counter for the successfully injected URLS > (and possibly a counter for unsuccessful ones as well) so determining > how many URLs are (or should be) present within the webdb can be > determined after bootstrapping Nutch via the inject command. > > Thanks all > > Lewis > > -- > Lewis -- Lewis

