Regarding the nutch log, i think i missed out something while running the job. In eclipse I have given the following as VM arguments - -Dhadoop.log.dir=logs -Dhadoop.log.file=nutch.log (local mode) How should i do the same while running the job file.
On Thu, May 10, 2012 at 11:43 PM, Lewis John Mcgibbney < [email protected]> wrote: > Hi, > > Re your first couple of points please see this very informative > conversation > > http://www.mail-archive.com/[email protected]/msg00538.html > > On Thu, May 10, 2012 at 5:22 PM, Vijith <[email protected]> wrote: > > > - Also i am not finding the nutch log file > > As you would expect, you are running Nutch jobs as part of Hadoop, > therefore your logs will be in your Hadoop setup (JT). Please see this > issue if you wish to submit a patch to change this > https://issues.apache.org/jira/browse/NUTCH-342 > > > - Will it in anyway affect the distributed run if we are using some > > external libraries like Jena > > This is a near impossible question for us to answer given the real > brief nature of the description > > > > > - right now my filters are reading the ontology files and creating jena > > objects every time the filters are executed > > - how can i make this faster; is there any caching mechanism available > > For what? > -- *Thanks & Regards* * * *Vijith V*

