I've not built this particular package myself, but generally, one doesn't
need to build it -- you can most likely do what you need to using the
configuration xml. If not, I suggest you respond with what you need to do
so others may answer. -- H
On 7 January 2015 at 19:09, Shadi Saleh
Hi Markus,
On Wed, Jan 7, 2015 at 7:42 PM, user-digest-h...@nutch.apache.org wrote:
Hi - it is a strange piece indeed. You cannot just tell it where the
crawldb is, you need to tell it where the directory is, so specifying
current is ok, but not part-*
Thanks very much. I'll cook a patch up
Dear all,
I am trying to build nutch ver 2.2.1 with ant, but I got this error:
[ivy:resolve] ::
[ivy:resolve] :: UNRESOLVED DEPENDENCIES ::
[ivy:resolve] ::
[ivy:resolve] ::
On 7 January 2015 at 18:56, Shadi Saleh propat...@gmail.com wrote:
[ivy:resolve] :: org.restlet.jse#org.restlet;2.0.5: not found
dependency org=org.restlet.jee name=org.restlet rev=2.0.12 /
[ivy:resolve] :: org.restlet.jse#org.restlet.ext.jackson;2.0.5: not found
dependency
Thank you so much for quick reply, I still have the problem, here is the
full output:
uildfile: /root/svn_nutch/nutch/build.xml
Trying to override old definition of task javac
[taskdef] Could not load definitions from resource
org/sonar/ant/antlib.xml. It could not be found.
I just removed the previous two lines and it works, I do not see them now
on nutch git repository that's why I removed them, hope it will not affect
on the future.
best
On Thu, Jan 8, 2015 at 4:28 AM, Hasan Diwan hasan.di...@gmail.com wrote:
I've not built this particular package myself, but
On 7 January 2015 at 19:40, Shadi Saleh propat...@gmail.com wrote:
I just removed the previous two lines and it works, I do not see them now
on nutch git repository that's why I removed them, hope it will not affect
on the future.
Good to hear it was a simple fix. -- H
--
OpenPGP:
Hi - it is a strange piece indeed. You cannot just tell it where the crawldb
is, you need to tell it where the directory is, so specifying current is ok,
but not part-*
M
-Original message-
From:Lewis John Mcgibbney lewis.mcgibb...@gmail.com
Sent: Wednesday 7th January 2015 19:48
Hi Renato,
On Thu, Dec 11, 2014 at 5:52 AM, user-digest-h...@nutch.apache.org wrote:
From quickly checking out the code (Host.java + HostDB +
HostDBUpdateReducer) it would seems like there is a bug exactly where you
pointed.
LOGGED!!!
https://issues.apache.org/jira/browse/NUTCH-1907
WOW
Hi Krishna,
On Thu, Dec 11, 2014 at 5:52 AM, user-digest-h...@nutch.apache.org wrote:
When I dump data from segments, I am getting entire html data. Shouldnot it
be just headings read from crawling. Why am I getting entire data?
Please help me. Thanks in advance.
No this is
Hi Folks,
Does anyone else have problems with the DomainStaticstics [0] tool?
I use it as follows
./bin/nutch domainstats /usr/local/.../crawldb/old/part-0/ output tld
Although it is generated, nothing is written to the output directory
./bin/nutch domainstats
Hi,
I have a custom plugin for nutch 1.9, which runs well when I run the crawl
script in local mode.
But, when I run the crawl script in deploy mode, the custom plugin dint
work. I believe this is due to the fact that my custom plugin is not
incorporated in nutch job jar.
Can anyone please
12 matches
Mail list logo