Setting different depths for different urls in seed.txt

2017-01-18 Thread Manav Bagai
Is it possible to set different depths for different urls in seed.txt. For example. there are two url 'A' and 'B' in seed.txt, is it possible that crawler crawls for depth 3 for url 'A' and depth 2 for url 'B'.

Re: Setting different depths for different urls in seed.txt

2017-01-18 Thread Julien Nioche
Yes, use the scoring-depth plugin and set _maxdepth_=X in the seeds file HTH Julien On 18 January 2017 at 10:40, Manav Bagai wrote: > Is it possible to set different depths for different urls in seed.txt. For > example. there are two url 'A' and 'B' in seed.txt, is

Re: Setting different depths for different urls in seed.txt

2017-01-18 Thread Manav Bagai
Can you please elaborate the solution provided? On Wed, Jan 18, 2017 at 4:23 PM, Julien Nioche < lists.digitalpeb...@gmail.com> wrote: > Yes, use the scoring-depth plugin and set _maxdepth_=X in the seeds file > > HTH > > Julien > > On 18 January 2017 at 10:40, Manav Bagai

Books about Nutch

2017-01-18 Thread Fengtan
Hi, I am trying to list all books about Nutch -- here are the ones I have found: - Big data made easy : a working guide to the complete Hadoop toolset (Chapter 3) http://www.apress.com/us/book/9781484200957 - Hadoop: The Definitive Guide, 2nd Edition (Chapter 16)

ApacheCon CFP closing soon (11 February)

2017-01-18 Thread Rich Bowen
Hello, fellow Apache enthusiast. Thanks for your participation, and interest in, the projects of the Apache Software Foundation. I wanted to remind you that the Call For Papers (CFP) for ApacheCon North America, and Apache: Big Data North America, closes in less than a month. If you've been