[jira] [Commented] (NUTCH-2555) URL normalization problem: path not starting with a '/'

2018-06-12 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16509865#comment-16509865 ] Hudson commented on NUTCH-2555: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3534 (See

[jira] [Commented] (NUTCH-2556) protocol-http makes invalid HTTP/1.0 requests

2018-06-12 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16509866#comment-16509866 ] Hudson commented on NUTCH-2556: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3534 (See

[jira] [Commented] (NUTCH-2484) Extend indexer-elastic-rest to support languages

2018-06-01 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16498409#comment-16498409 ] Hudson commented on NUTCH-2484: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3527 (See

[jira] [Commented] (NUTCH-2380) indexer-elastic version upgrade to 5.3.0

2018-06-01 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16498410#comment-16498410 ] Hudson commented on NUTCH-2380: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3527 (See

[jira] [Commented] (NUTCH-2590) SegmentReader -get fails

2018-06-02 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2590?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16499047#comment-16499047 ] Hudson commented on NUTCH-2590: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3529 (See

[jira] [Commented] (NUTCH-2562) protocol-http fails to read large chunked HTTP responses

2018-06-02 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16499046#comment-16499046 ] Hudson commented on NUTCH-2562: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3529 (See

[jira] [Commented] (NUTCH-2580) Improvements for Rabbitmq support

2018-06-02 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2580?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16499026#comment-16499026 ] Hudson commented on NUTCH-2580: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3528 (See

[jira] [Commented] (NUTCH-2583) Upgrading Nutch's dependencies

2018-06-02 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2583?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16499024#comment-16499024 ] Hudson commented on NUTCH-2583: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3528 (See

[jira] [Commented] (NUTCH-1480) SolrIndexer to write to multiple servers.

2018-06-02 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16499028#comment-16499028 ] Hudson commented on NUTCH-1480: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3528 (See

[jira] [Commented] (NUTCH-2584) Upgrade parse-tika to use Tika 1.18

2018-06-02 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2584?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16499025#comment-16499025 ] Hudson commented on NUTCH-2584: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3528 (See

[jira] [Commented] (NUTCH-2589) HTML redirections are not followed when using parse-tika

2018-06-02 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16499027#comment-16499027 ] Hudson commented on NUTCH-2589: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3528 (See

[jira] [Commented] (NUTCH-2592) Fetcher to log reason of failed fetches

2018-06-06 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16503019#comment-16503019 ] Hudson commented on NUTCH-2592: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3530 (See

[jira] [Commented] (NUTCH-2593) Single mode doesn't work in RabbitMQ indexer

2018-06-06 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16503020#comment-16503020 ] Hudson commented on NUTCH-2593: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3530 (See

[jira] [Commented] (NUTCH-2510) Crawl script modification. HostDb : generate, optional usage and description

2018-07-02 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16530036#comment-16530036 ] Hudson commented on NUTCH-2510: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3541 (See

[jira] [Commented] (NUTCH-2432) Protocol httpclient to disable cookies if http.enable.cookie.header is false

2018-07-02 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2432?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16530103#comment-16530103 ] Hudson commented on NUTCH-2432: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3542 (See

[jira] [Commented] (NUTCH-2569) ClassNotFoundException when running in (pseudo-)distributed mode

2018-04-26 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16453905#comment-16453905 ] Hudson commented on NUTCH-2569: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3522 (See

[jira] [Commented] (NUTCH-2571) SegmentReader -list fails to read segment

2018-04-26 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2571?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16453908#comment-16453908 ] Hudson commented on NUTCH-2571: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3522 (See

[jira] [Commented] (NUTCH-2517) mergesegs corrupts segment data

2018-04-26 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16453904#comment-16453904 ] Hudson commented on NUTCH-2517: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3522 (See

[jira] [Commented] (NUTCH-2570) Deduplication job fails to install deduplicated CrawlDb

2018-04-26 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16453906#comment-16453906 ] Hudson commented on NUTCH-2570: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3522 (See

[jira] [Commented] (NUTCH-1228) Change mapred.task.timeout to mapreduce.task.timeout in fetcher

2018-04-26 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16453822#comment-16453822 ] Hudson commented on NUTCH-1228: --- SUCCESS: Integrated in Jenkins build Nutch-nutchgora #1606 (See

[jira] [Commented] (NUTCH-2527) URL filter: provide rules to exclude localhost and private address spaces

2018-04-26 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2527?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16453835#comment-16453835 ] Hudson commented on NUTCH-2527: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3521 (See

[jira] [Commented] (NUTCH-1763) Improving comments on the Injector Class

2017-10-19 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1763?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16211836#comment-16211836 ] Hudson commented on NUTCH-1763: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3458 (See

[jira] [Commented] (NUTCH-2445) Fetcher following outlinks to keep track of already fetched items

2017-10-23 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16215248#comment-16215248 ] Hudson commented on NUTCH-2445: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3461 (See

[jira] [Commented] (NUTCH-2493) Add configuration parameter for sitemap processing to crawler script

2018-01-10 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16320632#comment-16320632 ] Hudson commented on NUTCH-2493: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3494 (See

[jira] [Commented] (NUTCH-2492) Add more configuration parameters to crawl script

2018-01-08 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16316252#comment-16316252 ] Hudson commented on NUTCH-2492: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3492 (See

[jira] [Commented] (NUTCH-2497) Elastic REST Indexer: Allow multiple hosts

2018-01-18 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2497?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16330996#comment-16330996 ] Hudson commented on NUTCH-2497: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3497 (See

[jira] [Commented] (NUTCH-2441) ARG_SEGMENT usage

2018-01-18 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16330995#comment-16330995 ] Hudson commented on NUTCH-2441: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3497 (See

[jira] [Commented] (NUTCH-2461) Generate passes the data to when maxCount == 0

2018-01-15 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16326474#comment-16326474 ] Hudson commented on NUTCH-2461: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3496 (See

[jira] [Commented] (NUTCH-2321) Indexing filter checker leaks threads

2018-01-15 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16326475#comment-16326475 ] Hudson commented on NUTCH-2321: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3496 (See

[jira] [Commented] (NUTCH-1129) Any23 Nutch plugin

2018-01-11 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16323068#comment-16323068 ] Hudson commented on NUTCH-1129: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3495 (See

[jira] [Commented] (NUTCH-2466) Sitemap processor to follow redirects

2018-01-31 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16346947#comment-16346947 ] Hudson commented on NUTCH-2466: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3500 (See

[jira] [Commented] (NUTCH-2494) Fetcher: java.lang.IllegalArgumentException: Wrong FS: s3

2018-01-29 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2494?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16343555#comment-16343555 ] Hudson commented on NUTCH-2494: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3499 (See

[jira] [Commented] (NUTCH-2508) Misleading documentation about http.proxy.exception.list

2018-01-31 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2508?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16347837#comment-16347837 ] Hudson commented on NUTCH-2508: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3501 (See

[jira] [Commented] (NUTCH-2489) Dependency collision with lucene-analyzers-common in scoring-similarity plugin

2018-02-07 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16356520#comment-16356520 ] Hudson commented on NUTCH-2489: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3502 (See

[jira] [Commented] (NUTCH-2454) REST API fix for usage of hostdb in generator

2018-01-03 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2454?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16309978#comment-16309978 ] Hudson commented on NUTCH-2454: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3491 (See

[jira] [Commented] (NUTCH-2490) Sitemap processing: Sitemap index files not working

2018-01-03 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2490?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16309979#comment-16309979 ] Hudson commented on NUTCH-2490: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3491 (See

[jira] [Commented] (NUTCH-2491) Integrate sitemap processing and HostDB into crawl script

2018-01-03 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2491?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16309980#comment-16309980 ] Hudson commented on NUTCH-2491: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3491 (See

[jira] [Commented] (NUTCH-2597) NPE in updatehostdb

2018-06-21 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2597?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16519501#comment-16519501 ] Hudson commented on NUTCH-2597: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3537 (See

[jira] [Commented] (NUTCH-2600) Refactoring indexer-solr

2018-06-21 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16519503#comment-16519503 ] Hudson commented on NUTCH-2600: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3537 (See

[jira] [Commented] (NUTCH-2601) Elasticsearch Rest and Amazon CloudSearch have the same implementation class in indexer-writers.xml

2018-06-21 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2601?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16519502#comment-16519502 ] Hudson commented on NUTCH-2601: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3537 (See

[jira] [Commented] (NUTCH-2565) MergeDB incorrectly handles unfetched CrawlDatums

2018-06-21 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16519500#comment-16519500 ] Hudson commented on NUTCH-2565: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3537 (See

[jira] [Commented] (NUTCH-2222) re-fetch deletes all metadata except _csh_ and _rs_

2018-08-01 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16565805#comment-16565805 ] Hudson commented on NUTCH-: --- SUCCESS: Integrated in Jenkins build Nutch-nutchgora #1613 (See

[jira] [Commented] (NUTCH-2624) protocol-okhttp resource leak

2018-07-25 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2624?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16555714#comment-16555714 ] Hudson commented on NUTCH-2624: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3548 (See

[jira] [Commented] (NUTCH-2622) Unbundle LGPL-licensed jars from binary release

2018-07-25 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2622?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16555713#comment-16555713 ] Hudson commented on NUTCH-2622: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3548 (See

[jira] [Commented] (NUTCH-2633) Fix deprecation warnings when building Nutch master branch under JDK 10.0.2+13

2018-08-10 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16576994#comment-16576994 ] Hudson commented on NUTCH-2633: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3550 (See

[jira] [Commented] (NUTCH-2633) Fix deprecation warnings when building Nutch master branch under JDK 10.0.2+13

2018-08-17 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16583774#comment-16583774 ] Hudson commented on NUTCH-2633: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3551 (See

[jira] [Commented] (NUTCH-2632) protocol-okhttp doesn't accept proxy authentication

2018-08-17 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16583773#comment-16583773 ] Hudson commented on NUTCH-2632: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3551 (See

[jira] [Commented] (NUTCH-2621) Generate report of third-party licenses

2018-08-17 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16583772#comment-16583772 ] Hudson commented on NUTCH-2621: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3551 (See

[jira] [Commented] (NUTCH-2071) A parser failure on a single document may fail crawling job if parser.timeout=-1

2018-07-17 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2071?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16546500#comment-16546500 ] Hudson commented on NUTCH-2071: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3546 (See

[jira] [Commented] (NUTCH-1106) Options to skip url's based on length

2018-07-17 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16546499#comment-16546499 ] Hudson commented on NUTCH-1106: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3546 (See

[jira] [Commented] (NUTCH-2619) protocol-okhttp: allow to keep partially fetched docs as truncated

2018-07-19 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16549309#comment-16549309 ] Hudson commented on NUTCH-2619: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3547 (See

[jira] [Commented] (NUTCH-1993) Nutch does not use backup parsers

2018-07-19 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16549311#comment-16549311 ] Hudson commented on NUTCH-1993: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3547 (See

[jira] [Commented] (NUTCH-2152) CommonCrawl dump via Service endpoint

2018-07-19 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2152?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16549307#comment-16549307 ] Hudson commented on NUTCH-2152: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3547 (See

[jira] [Commented] (NUTCH-2616) Review routing of deletions by Exchange component

2018-07-19 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2616?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16549310#comment-16549310 ] Hudson commented on NUTCH-2616: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3547 (See

[jira] [Commented] (NUTCH-2618) protocol-okhttp not to use http.timeout for max duration to fetch document

2018-07-19 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16549308#comment-16549308 ] Hudson commented on NUTCH-2618: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3547 (See

[jira] [Commented] (NUTCH-2639) bin/nutch fails to set native library path on Cygwin causing jobs to fail with UnsatisfiedLinkError

2018-09-11 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16610305#comment-16610305 ] Hudson commented on NUTCH-2639: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3552 (See

[jira] [Commented] (NUTCH-2639) bin/nutch fails to set native library path on Cygwin causing jobs to fail with UnsatisfiedLinkError

2018-09-11 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16610292#comment-16610292 ] Hudson commented on NUTCH-2639: --- SUCCESS: Integrated in Jenkins build Nutch-nutchgora #1614 (See

[jira] [Commented] (NUTCH-2640) Typo: DbUpdaterJob: updatinging all

2018-09-11 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16610293#comment-16610293 ] Hudson commented on NUTCH-2640: --- SUCCESS: Integrated in Jenkins build Nutch-nutchgora #1614 (See

[jira] [Commented] (NUTCH-1480) SolrIndexer to write to multiple servers.

2018-07-10 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16538658#comment-16538658 ] Hudson commented on NUTCH-1480: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3543 (See

[jira] [Commented] (NUTCH-1514) Phase out the deprecated configuration properties (if possible)

2018-07-10 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16538656#comment-16538656 ] Hudson commented on NUTCH-1514: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3543 (See

[jira] [Commented] (NUTCH-1541) Indexer plugin to write CSV

2018-07-10 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16538655#comment-16538655 ] Hudson commented on NUTCH-1541: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3543 (See

[jira] [Commented] (NUTCH-2503) Add option to run tests for a single plugin

2018-01-23 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16336242#comment-16336242 ] Hudson commented on NUTCH-2503: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3498 (See

[jira] [Commented] (NUTCH-2499) Elastic REST Indexer: Duplicate values

2018-01-23 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2499?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16336240#comment-16336240 ] Hudson commented on NUTCH-2499: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3498 (See

[jira] [Commented] (NUTCH-2502) Any23 Plugin: Add Content-Type filtering

2018-01-23 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16336241#comment-16336241 ] Hudson commented on NUTCH-2502: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3498 (See

[jira] [Commented] (NUTCH-2411) Index-metadata to support indexing multiple values for a field

2018-03-08 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2411?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16391258#comment-16391258 ] Hudson commented on NUTCH-2411: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3506 (See

[jira] [Commented] (NUTCH-2535) CrawlDbReader -stats: ClassCastException

2018-03-16 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16401694#comment-16401694 ] Hudson commented on NUTCH-2535: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3509 (See

[jira] [Commented] (NUTCH-2517) mergesegs corrupts segment data

2018-03-14 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16398901#comment-16398901 ] Hudson commented on NUTCH-2517: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3508 (See

[jira] [Commented] (NUTCH-2518) Must check return value of job.waitForCompletion()

2018-04-04 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16425377#comment-16425377 ] Hudson commented on NUTCH-2518: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3515 (See

[jira] [Commented] (NUTCH-2566) Fix exception log messages

2018-04-11 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16433870#comment-16433870 ] Hudson commented on NUTCH-2566: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3517 (See

[jira] [Commented] (NUTCH-2012) Merge parsechecker and indexchecker

2018-04-11 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16433869#comment-16433869 ] Hudson commented on NUTCH-2012: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3517 (See

[jira] [Commented] (NUTCH-2533) Injector: NullPointerException if seed URL dir contains non-file entries

2018-04-12 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16435175#comment-16435175 ] Hudson commented on NUTCH-2533: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3518 (See

[jira] [Commented] (NUTCH-2533) Injector: NullPointerException if seed URL dir contains non-file entries

2018-04-12 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16435162#comment-16435162 ] Hudson commented on NUTCH-2533: --- SUCCESS: Integrated in Jenkins build Nutch-nutchgora #1605 (See

[jira] [Commented] (NUTCH-2551) NullPointerException in generator

2018-04-12 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2551?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16435660#comment-16435660 ] Hudson commented on NUTCH-2551: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3519 (See

[jira] [Commented] (NUTCH-2548) Compressed content skipped. Content of size 78 was truncated to 74

2018-04-09 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2548?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16430359#comment-16430359 ] Hudson commented on NUTCH-2548: --- SUCCESS: Integrated in Jenkins build Nutch-nutchgora #1604 (See

[jira] [Commented] (NUTCH-2539) Not correct naming of db.url.filters and db.url.normalizers in nutch-default.xml

2018-04-10 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2539?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16433201#comment-16433201 ] Hudson commented on NUTCH-2539: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3516 (See

[jira] [Commented] (NUTCH-2550) Fetcher fails to follow redirects

2018-04-10 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16433200#comment-16433200 ] Hudson commented on NUTCH-2550: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3516 (See

[jira] [Commented] (NUTCH-2552) CrawlDbReader -topN fails

2018-04-21 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2552?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16446884#comment-16446884 ] Hudson commented on NUTCH-2552: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3520 (See

[jira] [Commented] (NUTCH-2568) Caught exception is immediately rethrown

2018-04-21 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16446886#comment-16446886 ] Hudson commented on NUTCH-2568: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3520 (See

[jira] [Commented] (NUTCH-2553) Fetcher not to modify URLs to be fetched

2018-04-21 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16446885#comment-16446885 ] Hudson commented on NUTCH-2553: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3520 (See

[jira] [Commented] (NUTCH-2516) Hadoop imports use wildcards

2018-03-27 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16415762#comment-16415762 ] Hudson commented on NUTCH-2516: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3511 (See

[jira] [Commented] (NUTCH-2543) readdb & readlinkdb to implement AbstractChecker

2018-03-27 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2543?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16415763#comment-16415763 ] Hudson commented on NUTCH-2543: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3511 (See

[jira] [Commented] (NUTCH-2534) CrawlDbReader -stats: make score quantiles configurable

2018-03-27 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2534?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16415829#comment-16415829 ] Hudson commented on NUTCH-2534: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3512 (See

[jira] [Commented] (NUTCH-2447) Work-around SSLProtocolException: handshake alert: unrecognized_name

2018-03-27 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16415918#comment-16415918 ] Hudson commented on NUTCH-2447: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3513 (See

[jira] [Commented] (NUTCH-2545) Upgrade to Any23 2.2

2018-04-02 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2545?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16422757#comment-16422757 ] Hudson commented on NUTCH-2545: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3514 (See

[jira] [Commented] (NUTCH-2536) GeneratorReducer.count is a static variable

2018-03-27 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16416145#comment-16416145 ] Hudson commented on NUTCH-2536: --- SUCCESS: Integrated in Jenkins build Nutch-nutchgora #1603 (See

[jira] [Commented] (NUTCH-2523) UpdateHostDB blocks usage of plugins unintentionally

2018-03-19 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16404934#comment-16404934 ] Hudson commented on NUTCH-2523: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3510 (See

[jira] [Commented] (NUTCH-2375) Upgrade the code base from org.apache.hadoop.mapred to org.apache.hadoop.mapreduce

2018-02-27 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16379427#comment-16379427 ] Hudson commented on NUTCH-2375: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3503 (See

[jira] [Commented] (NUTCH-2519) Log mapreduce job counters in local mode

2018-03-06 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16387472#comment-16387472 ] Hudson commented on NUTCH-2519: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3504 (See

[jira] [Commented] (NUTCH-2520) Wrong Accept-Charset sent when http.accept.charset is not defined

2018-03-06 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16387473#comment-16387473 ] Hudson commented on NUTCH-2520: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3504 (See

[jira] [Commented] (NUTCH-2519) Log mapreduce job counters in local mode

2018-03-06 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16387463#comment-16387463 ] Hudson commented on NUTCH-2519: --- SUCCESS: Integrated in Jenkins build Nutch-nutchgora #1602 (See

[jira] [Commented] (NUTCH-2520) Wrong Accept-Charset sent when http.accept.charset is not defined

2018-03-06 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16387464#comment-16387464 ] Hudson commented on NUTCH-2520: --- SUCCESS: Integrated in Jenkins build Nutch-nutchgora #1602 (See

[jira] [Commented] (NUTCH-2521) SitemapProcessor to use property sitemap.redir.max

2018-03-06 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16387541#comment-16387541 ] Hudson commented on NUTCH-2521: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3505 (See

[jira] [Commented] (NUTCH-2527) URL filter: provide rules to exclude localhost and private address spaces

2018-04-26 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2527?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16453899#comment-16453899 ] Hudson commented on NUTCH-2527: --- SUCCESS: Integrated in Jenkins build Nutch-nutchgora #1607 (See

[jira] [Commented] (NUTCH-2572) HostDb: updatehostdb does not set values

2018-04-26 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2572?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16453909#comment-16453909 ] Hudson commented on NUTCH-2572: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3522 (See

[jira] [Commented] (NUTCH-2526) NPE in scoring-opic when indexing document without CrawlDb datum

2018-04-26 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16453907#comment-16453907 ] Hudson commented on NUTCH-2526: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3522 (See

[jira] [Commented] (NUTCH-2544) Nutch 1.15 no longer compatible with AWS EMR and S3

2018-04-26 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16453903#comment-16453903 ] Hudson commented on NUTCH-2544: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3522 (See

[jira] [Commented] (NUTCH-2412) Exchange component for indexing job

2018-06-28 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2412?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16526092#comment-16526092 ] Hudson commented on NUTCH-2412: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3538 (See

[jira] [Commented] (NUTCH-2602) Configuration values in the description of index writers

2018-09-27 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16630967#comment-16630967 ] Hudson commented on NUTCH-2602: --- FAILURE: Integrated in Jenkins build Nutch-trunk #3553 (See

[jira] [Commented] (NUTCH-1678) Remove dependency on org.apache.oro

2018-10-13 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1678?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16648877#comment-16648877 ] Hudson commented on NUTCH-1678: --- SUCCESS: Integrated in Jenkins build Nutch-nutchgora #1618 (See

[jira] [Commented] (NUTCH-2651) Upgrade to Tika 1.19.1 (from 1.18)

2018-10-21 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16658408#comment-16658408 ] Hudson commented on NUTCH-2651: --- FAILURE: Integrated in Jenkins build Nutch-trunk #3574 (See

<    5   6   7   8   9   10   11   12   13   14   >