[
https://issues.apache.org/jira/browse/NUTCH-2812?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2812.
Resolution: Fixed
> Methods returning array may expose internal representation
> --
[
https://issues.apache.org/jira/browse/NUTCH-1942?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-1942.
Resolution: Done
> Remove TopLevelDomain
> --
>
> Key:
[
https://issues.apache.org/jira/browse/NUTCH-1806?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-1806.
Resolution: Implemented
Thanks, everybody!
> Delegate processing of URL domains to crawler
[
https://issues.apache.org/jira/browse/NUTCH-3058?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-3058.
Resolution: Implemented
> Fetcher: counter for hung threads
> -
[
https://issues.apache.org/jira/browse/NUTCH-3059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17881792#comment-17881792
]
Sebastian Nagel commented on NUTCH-3059:
Note: the above test was run in pseudo-d
[
https://issues.apache.org/jira/browse/NUTCH-3059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17881791#comment-17881791
]
Sebastian Nagel commented on NUTCH-3059:
Ok, found the reason: it's because of
[
[
https://issues.apache.org/jira/browse/NUTCH-3061?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-3061.
Resolution: Implemented
> URL filters to log name of the rule file rules are read from
> --
[
https://issues.apache.org/jira/browse/NUTCH-3062?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-3062.
Resolution: Implemented
> protocol-okhttp: optionally record HTTP and SSL/TLS versions
> --
[
https://issues.apache.org/jira/browse/NUTCH-3065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-3065.
Resolution: Implemented
> Format changelog as Markdown
>
>
>
[
https://issues.apache.org/jira/browse/NUTCH-3066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-3066.
Resolution: Fixed
> Protocol plugin unit tests fail randomly
>
[
https://issues.apache.org/jira/browse/NUTCH-1806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17880958#comment-17880958
]
Sebastian Nagel commented on NUTCH-1806:
> it seems odd to return an empty String
Sebastian Nagel created NUTCH-3067:
--
Summary: Improve performance of FetchItemQueues if error state is
preserved
Key: NUTCH-3067
URL: https://issues.apache.org/jira/browse/NUTCH-3067
Project: Nutch
[
https://issues.apache.org/jira/browse/NUTCH-1806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17880036#comment-17880036
]
Sebastian Nagel commented on NUTCH-1806:
Any comments on this? It's an important
[
https://issues.apache.org/jira/browse/NUTCH-3063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-3063.
Resolution: Implemented
Committed in
[ac03cf1|https://github.com/apache/nutch/commit/ac03c
[
https://issues.apache.org/jira/browse/NUTCH-3063?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17879964#comment-17879964
]
Sebastian Nagel commented on NUTCH-3063:
+1 looks good. And definitely makes sens
[
https://issues.apache.org/jira/browse/NUTCH-3065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17879666#comment-17879666
]
Sebastian Nagel commented on NUTCH-3065:
PR in progress: the [reformatted
change
[
https://issues.apache.org/jira/browse/NUTCH-3065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel reassigned NUTCH-3065:
--
Assignee: Sebastian Nagel
> Format changelog as Markdown
> ---
Sebastian Nagel created NUTCH-3065:
--
Summary: Format changelog as Markdown
Key: NUTCH-3065
URL: https://issues.apache.org/jira/browse/NUTCH-3065
Project: Nutch
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/NUTCH-3060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-3060:
---
Description: The link to the 1.20 Javadocs on
[https://nutch.apache.org/documentation/javadoc
[
https://issues.apache.org/jira/browse/NUTCH-3060?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17870291#comment-17870291
]
Sebastian Nagel commented on NUTCH-3060:
The missing Javadocs are now placed on s
Sebastian Nagel created NUTCH-3062:
--
Summary: protocol-okhttp: optionally record HTTP and SSL/TLS
versions
Key: NUTCH-3062
URL: https://issues.apache.org/jira/browse/NUTCH-3062
Project: Nutch
Sebastian Nagel created NUTCH-3061:
--
Summary: URL filters to log name of the rule file rules are read
from
Key: NUTCH-3061
URL: https://issues.apache.org/jira/browse/NUTCH-3061
Project: Nutch
Sebastian Nagel created NUTCH-3060:
--
Summary: Javadoc link broken on website
Key: NUTCH-3060
URL: https://issues.apache.org/jira/browse/NUTCH-3060
Project: Nutch
Issue Type: Bug
Co
[
https://issues.apache.org/jira/browse/NUTCH-3060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-3060:
---
Fix Version/s: 1.21
(was: 1.20)
> Javadoc link broken on website
> ---
Sebastian Nagel created NUTCH-3059:
--
Summary: Generator: selector job does not count reduce output
records
Key: NUTCH-3059
URL: https://issues.apache.org/jira/browse/NUTCH-3059
Project: Nutch
Sebastian Nagel created NUTCH-3058:
--
Summary: Fetcher: counter for hung threads
Key: NUTCH-3058
URL: https://issues.apache.org/jira/browse/NUTCH-3058
Project: Nutch
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/NUTCH-3055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-3055.
Resolution: Fixed
> README: fix Github "hub" commands
> -
>
[
https://issues.apache.org/jira/browse/NUTCH-3044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-3044.
Resolution: Fixed
> Generator: NPE when extracting the host part of a URL fails
> -
[
https://issues.apache.org/jira/browse/NUTCH-3043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-3043.
Resolution: Implemented
> Generator: count URLs rejected by URL filters
> -
[
https://issues.apache.org/jira/browse/NUTCH-3039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-3039.
Resolution: Fixed
> Failure to handle ftp:// URLs
> -
>
>
Sebastian Nagel created NUTCH-3055:
--
Summary: README: fix Github "hub" commands
Key: NUTCH-3055
URL: https://issues.apache.org/jira/browse/NUTCH-3055
Project: Nutch
Issue Type: Bug
[
https://issues.apache.org/jira/browse/NUTCH-3028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17842291#comment-17842291
]
Sebastian Nagel commented on NUTCH-3028:
+1 lgtm.
One question: if there is no p
[
https://issues.apache.org/jira/browse/NUTCH-3045?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17842284#comment-17842284
]
Sebastian Nagel commented on NUTCH-3045:
See also NUTCH-2987. Until HADOOP-17177
Sebastian Nagel created NUTCH-3044:
--
Summary: Generator: NPE when extracting the host part of a URL
fails
Key: NUTCH-3044
URL: https://issues.apache.org/jira/browse/NUTCH-3044
Project: Nutch
Sebastian Nagel created NUTCH-3043:
--
Summary: Generator: count URLs rejected by URL filters
Key: NUTCH-3043
URL: https://issues.apache.org/jira/browse/NUTCH-3043
Project: Nutch
Issue Type: I
Sebastian Nagel created NUTCH-3040:
--
Summary: Upgrade to Hadoop 3.4.0
Key: NUTCH-3040
URL: https://issues.apache.org/jira/browse/NUTCH-3040
Project: Nutch
Issue Type: Improvement
C
[
https://issues.apache.org/jira/browse/NUTCH-3039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel reassigned NUTCH-3039:
--
Assignee: Sebastian Nagel
> Failure to handle ftp:// URLs
> --
Sebastian Nagel created NUTCH-3039:
--
Summary: Failure to handle ftp:// URLs
Key: NUTCH-3039
URL: https://issues.apache.org/jira/browse/NUTCH-3039
Project: Nutch
Issue Type: Bug
Com
[
https://issues.apache.org/jira/browse/NUTCH-2937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2937.
Resolution: Fixed
Fixed NUTCH-2959 by using the shaded Tika package. Thanks, [~tallison]!
[
https://issues.apache.org/jira/browse/NUTCH-2937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel reassigned NUTCH-2937:
--
Assignee: Tim Allison
> parse-tika: review dependency exclusions and avoid dependency
[
https://issues.apache.org/jira/browse/NUTCH-2937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2937:
---
Fix Version/s: 1.20
(was: 1.21)
> parse-tika: review dependency exclus
[
https://issues.apache.org/jira/browse/NUTCH-3005?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-3005.
Resolution: Implemented
Done by [~lewismc] as part of NUTCH-3036, commit
[1563396|https://
[
https://issues.apache.org/jira/browse/NUTCH-3016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-3016.
Resolution: Duplicate
> Upgrade Apache Ivy to 2.5.2
> ---
>
>
[
https://issues.apache.org/jira/browse/NUTCH-3016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-3016:
---
Fix Version/s: 1.20
(was: 1.21)
> Upgrade Apache Ivy to 2.5.2
> --
[
https://issues.apache.org/jira/browse/NUTCH-3005?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-3005:
---
Affects Version/s: 1.19
> Upgrade selenium as needed
> --
>
>
[
https://issues.apache.org/jira/browse/NUTCH-3005?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-3005:
---
Fix Version/s: 1.20
> Upgrade selenium as needed
> --
>
>
[
https://issues.apache.org/jira/browse/NUTCH-3028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-3028:
---
Affects Version/s: 1.19
> WARCExported to support filtering by JEXL
> ---
[
https://issues.apache.org/jira/browse/NUTCH-3028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-3028:
---
Fix Version/s: 1.21
> WARCExported to support filtering by JEXL
> ---
[
https://issues.apache.org/jira/browse/NUTCH-2960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2960.
Resolution: Won't Fix
The license issue is addressed by NUTCH-3008.
> indexer-elastic: rem
[
https://issues.apache.org/jira/browse/NUTCH-2960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel closed NUTCH-2960.
--
> indexer-elastic: remove plugin from binary package to address licensing issues
>
[
https://issues.apache.org/jira/browse/NUTCH-2960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2960:
---
Fix Version/s: (was: 1.20)
> indexer-elastic: remove plugin from binary package to addres
[
https://issues.apache.org/jira/browse/NUTCH-3008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-3008.
Resolution: Fixed
> indexer-elastic: downgrade to ES 7.10.2 to address licensing issues
> -
[
https://issues.apache.org/jira/browse/NUTCH-3029?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-3029.
Resolution: Implemented
> Host specific max. and min. intervals in adaptive scheduler
> ---
[
https://issues.apache.org/jira/browse/NUTCH-3029?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel closed NUTCH-3029.
--
> Host specific max. and min. intervals in adaptive scheduler
> ---
[
https://issues.apache.org/jira/browse/NUTCH-3029?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel reopened NUTCH-3029:
Assignee: Sebastian Nagel (was: Markus Jelsma)
Reopen to update "Fix version(s)" - add 1
[
https://issues.apache.org/jira/browse/NUTCH-3029?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-3029:
---
Fix Version/s: 1.20
> Host specific max. and min. intervals in adaptive scheduler
> -
Sebastian Nagel created NUTCH-3035:
--
Summary: Update license and notice file for release of 1.20
Key: NUTCH-3035
URL: https://issues.apache.org/jira/browse/NUTCH-3035
Project: Nutch
Issue T
[
https://issues.apache.org/jira/browse/NUTCH-3025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-3025.
Resolution: Implemented
> urlfilter-fast to filter based on the length of the URL
> ---
[
https://issues.apache.org/jira/browse/NUTCH-3025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-3025:
---
Component/s: plugin
urlfilter
> urlfilter-fast to filter based on the length
[
https://issues.apache.org/jira/browse/NUTCH-3017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17784030#comment-17784030
]
Sebastian Nagel commented on NUTCH-3017:
Thanks, [~jnioche]
> Allow fast-urlfilt
[
https://issues.apache.org/jira/browse/NUTCH-3017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-3017.
Resolution: Implemented
> Allow fast-urlfilter to load from HDFS/S3 and support gzipped inp
[
https://issues.apache.org/jira/browse/NUTCH-3017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-3017:
---
Component/s: plugin
urlfilter
> Allow fast-urlfilter to load from HDFS/S3 an
[
https://issues.apache.org/jira/browse/NUTCH-3017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-3017:
---
Fix Version/s: 1.20
> Allow fast-urlfilter to load from HDFS/S3 and support gzipped input
> -
[
https://issues.apache.org/jira/browse/NUTCH-3012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-3012.
Resolution: Fixed
> SegmentReader when dumping with option -recode: NPE on unparsed documen
[
https://issues.apache.org/jira/browse/NUTCH-3011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-3011.
Resolution: Implemented
> HttpRobotRulesParser: handle HTTP 429 Too Many Requests same as s
[
https://issues.apache.org/jira/browse/NUTCH-2990?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2990.
Resolution: Implemented
Thanks, everybody!
> HttpRobotRulesParser to follow 5 redirects as
[
https://issues.apache.org/jira/browse/NUTCH-3009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel reassigned NUTCH-3009:
--
Assignee: Sebastian Nagel
> Upgrade to Hadoop 3.3.6
> ---
>
>
[
https://issues.apache.org/jira/browse/NUTCH-3009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-3009.
Resolution: Implemented
> Upgrade to Hadoop 3.3.6
> ---
>
>
[
https://issues.apache.org/jira/browse/NUTCH-3006?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-3006.
Fix Version/s: (was: 1.20)
Resolution: Abandoned
> Downgrade Tika dependency to
[
https://issues.apache.org/jira/browse/NUTCH-3002?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel reassigned NUTCH-3002:
--
Assignee: Sebastian Nagel
> Protocol-okhttp HttpResponse: HTTP header metadata lookup
[
https://issues.apache.org/jira/browse/NUTCH-3002?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-3002.
Resolution: Fixed
> Protocol-okhttp HttpResponse: HTTP header metadata lookup should be
>
[
https://issues.apache.org/jira/browse/NUTCH-3014?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17778103#comment-17778103
]
Sebastian Nagel commented on NUTCH-3014:
If there is a single data name/directory
[
https://issues.apache.org/jira/browse/NUTCH-3012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-3012:
---
Description:
SegmentReader when called with the flag {{-recode}} fails with a NPE when
tryin
[
https://issues.apache.org/jira/browse/NUTCH-3012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-3012:
---
Summary: SegmentReader when dumping with option -recode: NPE on unparsed
documents (was: Seg
Sebastian Nagel created NUTCH-3012:
--
Summary: SegmentReader when dumping with option -recode: NPE on
documents without charset defined
Key: NUTCH-3012
URL: https://issues.apache.org/jira/browse/NUTCH-3012
[
https://issues.apache.org/jira/browse/NUTCH-2959?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17771445#comment-17771445
]
Sebastian Nagel commented on NUTCH-2959:
Hi [~tallison], it's your decision wheth
[
https://issues.apache.org/jira/browse/NUTCH-1130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-1130.
Resolution: Won't Do
Closing - the any23 project has retired and the any23 plugin was remov
[
https://issues.apache.org/jira/browse/NUTCH-1130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel closed NUTCH-1130.
--
> JUnit test for Any23 RDF plugin
> ---
>
> Key: NUTCH-
[
https://issues.apache.org/jira/browse/NUTCH-2938?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2938.
Resolution: Won't Do
Closing - the any23 project has retired and the any23 plugin was remov
[
https://issues.apache.org/jira/browse/NUTCH-2938?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel closed NUTCH-2938.
--
> Use Any23's RepositoryWriter to write structured data to Rdf4j repository
> -
[
https://issues.apache.org/jira/browse/NUTCH-2938?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2938:
---
Fix Version/s: (was: 1.20)
> Use Any23's RepositoryWriter to write structured data to Rdf
[
https://issues.apache.org/jira/browse/NUTCH-2853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2853.
Resolution: Fixed
> bin/nutch: remove deprecated commands solrindex, solrdedup, solrclean
>
[
https://issues.apache.org/jira/browse/NUTCH-2897?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2897.
Resolution: Fixed
> Do not supress deprecated API warnings
> --
[
https://issues.apache.org/jira/browse/NUTCH-3010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-3010.
Resolution: Fixed
> Injector: count unique number of injected URLs
> --
Sebastian Nagel created NUTCH-3011:
--
Summary: HttpRobotRulesParser: handle HTTP 429 Too Many Requests
same as server errors (HTTP 5xx)
Key: NUTCH-3011
URL: https://issues.apache.org/jira/browse/NUTCH-3011
[
https://issues.apache.org/jira/browse/NUTCH-1373?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel closed NUTCH-1373.
--
> Implement consistent execution of normalising and filtering in Generator
> --
[
https://issues.apache.org/jira/browse/NUTCH-1373?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-1373.
Resolution: Abandoned
Closing as Nutch 2.x (aka. nutchgora) isn't maintained anymore.
> Im
[
https://issues.apache.org/jira/browse/NUTCH-1374?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17770833#comment-17770833
]
Sebastian Nagel commented on NUTCH-1374:
The package.html files were replaced by
[
https://issues.apache.org/jira/browse/NUTCH-1635?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17770831#comment-17770831
]
Sebastian Nagel commented on NUTCH-1635:
Hi [~markus17], did this continue to hap
[
https://issues.apache.org/jira/browse/NUTCH-1947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-1947.
Resolution: Abandoned
Closing because OutlinkExtractor has seen many updates since then: up
[
https://issues.apache.org/jira/browse/NUTCH-1947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel closed NUTCH-1947.
--
> Overhaul o.a.n.parse.OutlinkExtractor.java
> ---
>
>
[
https://issues.apache.org/jira/browse/NUTCH-2053?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2053.
Resolution: Abandoned
Closing this old issue (8 years), assuming that dependencies have bee
[
https://issues.apache.org/jira/browse/NUTCH-2053?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel closed NUTCH-2053.
--
> Uncessary dependencies included in ivy.xml (post NUTCH-2038)
> --
[
https://issues.apache.org/jira/browse/NUTCH-2423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2423.
Fix Version/s: (was: 1.20)
Resolution: Fixed
The wiki pages were updated in 2020
[
https://issues.apache.org/jira/browse/NUTCH-2820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2820.
Resolution: Resolved
Resolved with the removal of the any23 plugin (NUTCH-2998).
> Review
[
https://issues.apache.org/jira/browse/NUTCH-2888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2888.
Resolution: Duplicate
Thanks, [~mmkivist]! This issue was resolved by NUTCH-2980 and will b
[
https://issues.apache.org/jira/browse/NUTCH-2888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2888:
---
Affects Version/s: 1.18
> Selenium Protocol: Support for Selenium 4
> ---
[
https://issues.apache.org/jira/browse/NUTCH-2888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2888:
---
Fix Version/s: 1.20
> Selenium Protocol: Support for Selenium 4
> ---
[
https://issues.apache.org/jira/browse/NUTCH-3007?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-3007.
Resolution: Fixed
Thanks for the review, [~markus17]!
> Fix impossible casts
> ---
[
https://issues.apache.org/jira/browse/NUTCH-2852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2852.
Resolution: Fixed
> Method invokes System.exit(...) 9 bugs
> --
1 - 100 of 3450 matches
Mail list logo