Is it possible to set different depths for different urls in seed.txt. For
example. there are two url 'A' and 'B' in seed.txt, is it possible that
crawler crawls for depth 3 for url 'A' and depth 2 for url 'B'.
Yes, use the scoring-depth plugin and set _maxdepth_=X in the seeds file
HTH
Julien
On 18 January 2017 at 10:40, Manav Bagai wrote:
> Is it possible to set different depths for different urls in seed.txt. For
> example. there are two url 'A' and 'B' in seed.txt, is
Can you please elaborate the solution provided?
On Wed, Jan 18, 2017 at 4:23 PM, Julien Nioche <
lists.digitalpeb...@gmail.com> wrote:
> Yes, use the scoring-depth plugin and set _maxdepth_=X in the seeds file
>
> HTH
>
> Julien
>
> On 18 January 2017 at 10:40, Manav Bagai
Hi,
I am trying to list all books about Nutch -- here are the ones I have found:
- Big data made easy : a working guide to the complete Hadoop toolset
(Chapter 3) http://www.apress.com/us/book/9781484200957
- Hadoop: The Definitive Guide, 2nd Edition (Chapter 16)
Hello, fellow Apache enthusiast. Thanks for your participation, and
interest in, the projects of the Apache Software Foundation.
I wanted to remind you that the Call For Papers (CFP) for ApacheCon
North America, and Apache: Big Data North America, closes in less than a
month. If you've been
5 matches
Mail list logo