[nutch] branch master updated: NUTCH-2758 Add plugin READMEs to binary release packages

2020-05-05 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 90502bd NUTCH-2758 Add plugin READMEs to binary

[nutch] branch master updated: NUTCH-2753 Add -listen option to command-line help of CrawlDbReader and LinkDbReader

2020-05-05 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new c573c70 NUTCH-2753 Add -listen option to

[nutch] branch master updated: NUTCH-2002 parse and index checkers to check robots.txt - applied Julien's patch to recent code base - also check redirects whether they are allowed - add command-line p

2020-05-05 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 46db3ed NUTCH-2002 parse and index checkers to

[nutch] branch master updated: NUTCH-2785 FreeGenerator: command-line option to define number of generated fetch lists - add command-line option `-numFetchers` to FreeGenerator - in local mode: genera

2020-05-05 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 72f3ff2 NUTCH-2785 FreeGenerator: command-line

[nutch] branch master updated: NUTCH-1194 Generator: CrawlDB lock should be released earlier - release CrawlDb lock after select step, in case, generated items are not marked in CrawlDb (generate.upda

2020-05-05 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 11eea5a NUTCH-1194 Generator: CrawlDB lock

[nutch] branch master updated: NUTCH-2434 Add methods to reset parameters HTMLMetaTags (apply patch contributed by Markus)

2020-05-05 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new a0ed0b4 NUTCH-2434 Add methods to reset