Re: build with ant

2015-01-07 Thread Hasan Diwan
I've not built this particular package myself, but generally, one doesn't need to build it -- you can most likely do what you need to using the configuration xml. If not, I suggest you respond with what you need to do so others may answer. -- H On 7 January 2015 at 19:09, Shadi Saleh

Re: Problems with DomainStatistics

2015-01-07 Thread Lewis John Mcgibbney
Hi Markus, On Wed, Jan 7, 2015 at 7:42 PM, user-digest-h...@nutch.apache.org wrote: Hi - it is a strange piece indeed. You cannot just tell it where the crawldb is, you need to tell it where the directory is, so specifying current is ok, but not part-* Thanks very much. I'll cook a patch up

build with ant

2015-01-07 Thread Shadi Saleh
Dear all, I am trying to build nutch ver 2.2.1 with ant, but I got this error: [ivy:resolve] :: [ivy:resolve] :: UNRESOLVED DEPENDENCIES :: [ivy:resolve] :: [ivy:resolve] ::

Re: build with ant

2015-01-07 Thread Hasan Diwan
On 7 January 2015 at 18:56, Shadi Saleh propat...@gmail.com wrote: [ivy:resolve] :: org.restlet.jse#org.restlet;2.0.5: not found dependency org=org.restlet.jee name=org.restlet rev=2.0.12 / [ivy:resolve] :: org.restlet.jse#org.restlet.ext.jackson;2.0.5: not found dependency

Re: build with ant

2015-01-07 Thread Shadi Saleh
Thank you so much for quick reply, I still have the problem, here is the full output: uildfile: /root/svn_nutch/nutch/build.xml Trying to override old definition of task javac [taskdef] Could not load definitions from resource org/sonar/ant/antlib.xml. It could not be found.

Re: build with ant

2015-01-07 Thread Shadi Saleh
I just removed the previous two lines and it works, I do not see them now on nutch git repository that's why I removed them, hope it will not affect on the future. best On Thu, Jan 8, 2015 at 4:28 AM, Hasan Diwan hasan.di...@gmail.com wrote: I've not built this particular package myself, but

Re: build with ant

2015-01-07 Thread Hasan Diwan
On 7 January 2015 at 19:40, Shadi Saleh propat...@gmail.com wrote: I just removed the previous two lines and it works, I do not see them now on nutch git repository that's why I removed them, hope it will not affect on the future. Good to hear it was a simple fix. -- H -- OpenPGP:

RE: Problems with DomainStatistics

2015-01-07 Thread Markus Jelsma
Hi - it is a strange piece indeed. You cannot just tell it where the crawldb is, you need to tell it where the directory is, so specifying current is ok, but not part-* M -Original message- From:Lewis John Mcgibbney lewis.mcgibb...@gmail.com Sent: Wednesday 7th January 2015 19:48

Re: Potential Bug in 2.X HostDbUpdateReducer

2015-01-07 Thread Lewis John Mcgibbney
Hi Renato, On Thu, Dec 11, 2014 at 5:52 AM, user-digest-h...@nutch.apache.org wrote: From quickly checking out the code (Host.java + HostDB + HostDBUpdateReducer) it would seems like there is a bug exactly where you pointed. LOGGED!!! https://issues.apache.org/jira/browse/NUTCH-1907 WOW

Re: Help regarding headings plugin

2015-01-07 Thread Lewis John Mcgibbney
Hi Krishna, On Thu, Dec 11, 2014 at 5:52 AM, user-digest-h...@nutch.apache.org wrote: When I dump data from segments, I am getting entire html data. Shouldnot it be just headings read from crawling. Why am I getting entire data? Please help me. Thanks in advance. No this is

Problems with DomainStatistics

2015-01-07 Thread Lewis John Mcgibbney
Hi Folks, Does anyone else have problems with the DomainStaticstics [0] tool? I use it as follows ./bin/nutch domainstats /usr/local/.../crawldb/old/part-0/ output tld Although it is generated, nothing is written to the output directory ./bin/nutch domainstats

custom plugin with nutch job jar

2015-01-07 Thread Adil Ishaque Abbasi
Hi, I have a custom plugin for nutch 1.9, which runs well when I run the crawl script in local mode. But, when I run the crawl script in deploy mode, the custom plugin dint work. I believe this is due to the fact that my custom plugin is not incorporated in nutch job jar. Can anyone please