Aha! I was wrong when I said I was using all default settings. I forgot I had
followed a tutorial that told mem to put |scoring-depth| instead of
|scoring-opic| into the plugin.includes property. Now I get a variety of scores.
Anyway, what is the general advice on which scoring method to use? Is
Thanks lewis.
Nutch crawl script has an automatic option to detect if it is distributed or
local mode.
as you said i have copied nutch into a cluster and also compile as a job with
its configuration, and is done.
That is a complex task because ambari has a lot of component that are
intersting.
Hi Eyeris,
Replies inline
On Fri, Oct 28, 2016 at 8:51 PM, wrote:
> From: Eyeris Rodriguez Rueda
> To: user@nutch.apache.org
> Cc:
> Date: Fri, 28 Oct 2016 09:43:59 -0400 (CDT)
> Subject: how to insert nutch into ambari ecosystem ?
> Hi all.
>
Thank you Lewis!
About second question
db
{
"batchId": "batch-id"
}
I replaced batch-id with value from batchId from database.
It doesn't work.
Regards,
Vladimir.
-Original Message-
From: lewis john mcgibbney [mailto:lewi...@apache.org]
Sent: November-15-16 11:53 AM
To:
Hi Eyeris,
I've just tried Nutch master branch to parse outlinks from a number of RSS
Feeds, an example being 'http://www.jpl.nasa.gov/blog/feed/'. This works
perfectly with both the feed and parse-tika plugins. Outlinks are extracted
accordingly.
Can you provide an example of the RSS Feeds you
Hi Vladimir,
Responses inline
On Thu, Nov 10, 2016 at 1:05 AM, wrote:
> From: Vladimir Loubenski
> To: "user@nutch.apache.org"
> Cc:
> Date: Tue, 8 Nov 2016 17:53:59 +
> Subject: Nutch 2.3.1 REST calls to DB
Hi Michael,
Replies inline
On Sat, Nov 12, 2016 at 7:10 PM, wrote:
> From: Michael Coffey
> To: "user@nutch.apache.org"
> Cc:
> Date: Sun, 13 Nov 2016 03:07:16 + (UTC)
> Subject: How can I Score?
> When
7 matches
Mail list logo