[nutch] branch master updated: NUTCH-2700 Indexchecker: improve command-line help - add options `-doIndex` to pass "checked" document to index writers (the property `doIndex` is kept to ensure back-wa

2019-04-11 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 76c8cff NUTCH-2700 Indexchecker: improve

[nutch] branch master updated: NUTCH-2703 parse-tika: Boilerpipe should not run for non-(X)HTML pages

2019-04-11 Thread markus
This is an automated email from the ASF dual-hosted git repository. markus pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 7e6eabb NUTCH-2703 parse-tika: Boilerpipe