Hi Yongyao,
In addition to Seb's response, please also check out the
'scoring.filter.order' property in nutch-site.xml
https://github.com/apache/nutch/blob/master/conf/nutch-default.xml#L1429-L1437
This will determine the order and provide you with more control over
complex scoring logic.
Lewis
Hi,
the score calculated so far (by the filter(s) in front of the chain)
is passed to the method of the next scoring filter
- either directly as float argument
- or as field, e.g., of a CrawlDatum object
It's on the ScoringFilter implementation whether to ignore this value
(overwriting it), to
+1 from me:
SIGS + CHECKSUMS check out.
LMC-053601:nutch-release mattmann$ $HOME/bin/stage_apache_rc apache-nutch
1.13-src https://dist.apache.org/repos/dist/dev/nutch/1.13/
% Total% Received % Xferd Average Speed TimeTime Time Current
Dload
+1
+ signatures verified
+ compile + test
+ small test crawl and index into Solr 4.10.4
- clean (CleaningJob) on Solr 4 fails but this is already known (NUTCH-2269)
- indexing into Solr 5.5.0 fails without a properly configured managed-schema
We should deliver a proper managed-schema to
+1
On Wed, Mar 29, 2017 at 4:06 PM, Julien Nioche <
lists.digitalpeb...@gmail.com> wrote:
> Hi Lewis
>
> +1 compiled from source and ran a small crawl in local mode. All good!
>
> Thanks
>
> Julien
>
> On 29 March 2017 at 05:20, lewis john mcgibbney
> wrote:
>
>> Hi Folks,
Hi Lewis
+1 compiled from source and ran a small crawl in local mode. All good!
Thanks
Julien
On 29 March 2017 at 05:20, lewis john mcgibbney wrote:
> Hi Folks,
>
> A first candidate for the Nutch 1.13 release is available at:
>
>
6 matches
Mail list logo