Jenkins build is back to normal : Nutch » Nutch-trunk #109

2023-08-30 Thread Apache Jenkins Server
See 




[jira] [Commented] (NUTCH-2999) Update Lucene version to latest 8.x

2023-08-30 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/NUTCH-2999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17760606#comment-17760606
 ] 

ASF GitHub Bot commented on NUTCH-2999:
---

tballison commented on PR #771:
URL: https://github.com/apache/nutch/pull/771#issuecomment-1699690920

   Apologies for the noise!




> Update Lucene version to latest 8.x
> ---
>
> Key: NUTCH-2999
> URL: https://issues.apache.org/jira/browse/NUTCH-2999
> Project: Nutch
>  Issue Type: Task
>Reporter: Tim Allison
>Priority: Minor
> Fix For: 1.20
>
>
> It may be the way that I'm loading the project, but, for me, Intellij really 
> does not like the Lucene version conflict between {{scoring-similarity}} and 
> the OpenSearch/Elasticsearch modules.
> Can we bump Lucene to the latest 8.11.2 throughout?
> PR for review incoming.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[GitHub] [nutch] tballison commented on pull request #771: NUTCH-2999 fix for initial PR

2023-08-30 Thread via GitHub


tballison commented on PR #771:
URL: https://github.com/apache/nutch/pull/771#issuecomment-1699690920

   Apologies for the noise!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@nutch.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[jira] [Commented] (NUTCH-2999) Update Lucene version to latest 8.x

2023-08-30 Thread Hudson (Jira)


[ 
https://issues.apache.org/jira/browse/NUTCH-2999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17760595#comment-17760595
 ] 

Hudson commented on NUTCH-2999:
---

FAILURE: Integrated in Jenkins build Nutch » Nutch-trunk #108 (See 
[https://ci-builds.apache.org/job/Nutch/job/Nutch-trunk/108/])
NUTCH-2999 -- upgrade lucene to latest 8.x throughout (tallison: 
[https://github.com/apache/nutch/commit/8d9c77fd1b044f7c8fc51b70e34321cb9260cfbb])
* (add) src/plugin/indexer-opensearch-1x/howto_upgrade_opensearch.txt
* (edit) src/plugin/indexer-opensearch-1x/plugin.xml
* (edit) src/plugin/indexer-elastic/plugin.xml
* (edit) src/plugin/indexer-opensearch-1x/ivy.xml
* (edit) src/plugin/indexer-elastic/ivy.xml


> Update Lucene version to latest 8.x
> ---
>
> Key: NUTCH-2999
> URL: https://issues.apache.org/jira/browse/NUTCH-2999
> Project: Nutch
>  Issue Type: Task
>Reporter: Tim Allison
>Priority: Minor
> Fix For: 1.20
>
>
> It may be the way that I'm loading the project, but, for me, Intellij really 
> does not like the Lucene version conflict between {{scoring-similarity}} and 
> the OpenSearch/Elasticsearch modules.
> Can we bump Lucene to the latest 8.11.2 throughout?
> PR for review incoming.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


Build failed in Jenkins: Nutch » Nutch-trunk #108

2023-08-30 Thread Apache Jenkins Server
See 


Changes:

[Tim Allison] NUTCH-2999 -- upgrade lucene to latest 8.x throughout


--
[...truncated 810.27 KB...]
deps-test:

deploy:

copy-generated-lib:

test:
 [echo] Testing plugin: urlnormalizer-ajax

jar:

deps-test:

deploy:

copy-generated-lib:

test:
 [echo] Testing plugin: urlnormalizer-basic
[junit] Running 
org.apache.nutch.net.urlnormalizer.basic.TestBasicURLNormalizer
[junit] Running 
org.apache.nutch.net.urlnormalizer.ajax.TestAjaxURLNormalizer
[junit] Tests run: 2, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 
0.13 sec

init:

init-plugin:

deps-jar:

clean-lib:

resolve-default:
[ivy:resolve] :: loading settings :: file = 


compile:
 [echo] Compiling plugin: urlnormalizer-host

deps-test-compile:

compile-test:
[javac] Compiling 1 source file to 

[junit] Tests run: 6, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 
0.474 sec

init:

init-plugin:

deps-jar:

clean-lib:

resolve-default:
[ivy:resolve] :: loading settings :: file = 


compile:
 [echo] Compiling plugin: urlnormalizer-pass

deps-test-compile:

compile-test:
[javac] Compiling 1 source file to 


jar:

deps-test:

deploy:

copy-generated-lib:

test:
 [echo] Testing plugin: urlnormalizer-host

jar:

deps-test:

deploy:

copy-generated-lib:

test:
 [echo] Testing plugin: urlnormalizer-pass
[junit] Running 
org.apache.nutch.net.urlnormalizer.host.TestHostURLNormalizer
[junit] Running 
org.apache.nutch.net.urlnormalizer.pass.TestPassURLNormalizer
[junit] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 
0.312 sec

init:

init-plugin:

deps-jar:

clean-lib:

resolve-default:
[ivy:resolve] :: loading settings :: file = 


compile:
 [echo] Compiling plugin: urlnormalizer-protocol

deps-test-compile:

compile-test:
[javac] Compiling 1 source file to 

[junit] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 
0.548 sec

init:

init-plugin:

deps-jar:

clean-lib:

resolve-default:
[ivy:resolve] :: loading settings :: file = 


compile:
 [echo] Compiling plugin: urlnormalizer-querystring

deps-test-compile:

compile-test:
[javac] Compiling 1 source file to 


jar:

deps-test:

deploy:

copy-generated-lib:

test:
 [echo] Testing plugin: urlnormalizer-protocol

jar:

deps-test:

deploy:

copy-generated-lib:

test:
 [echo] Testing plugin: urlnormalizer-querystring
[junit] Running 
org.apache.nutch.net.urlnormalizer.querystring.TestQuerystringURLNormalizer
[junit] Running 
org.apache.nutch.net.urlnormalizer.protocol.TestProtocolURLNormalizer
[junit] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 
0.28 sec

init:

init-plugin:

deps-jar:

clean-lib:

resolve-default:
[ivy:resolve] :: loading settings :: file = 


compile:
 [echo] Compiling plugin: urlnormalizer-regex

deps-test-compile:

compile-test:
[javac] Compiling 1 source file to 


jar:

deps-test:

init:

init-plugin:

clean-lib:

resolve-default:
[ivy:resolve] :: loading settings :: file = 


compile:

jar:

deps-test:

deploy:

copy-generated-lib:

deploy:

copy-generated-lib:

test:
 [echo] Testing plugin: urlnormalizer-regex
[junit] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 
1.079 sec

init:

init-plugin:

deps-jar:

clean-lib:

resolve-default:
[ivy:resolve] :: loading settings :: file = 


compile:
 [echo] Compiling plugin: urlnormalizer-slash

deps-test-compile:

compile-test:
[javac] Compiling 1 source file to 


jar:

deps-test:

deploy:

copy-generated-lib:

test:
 [echo] Testing plugin: urlnormalizer-slash
[junit] Running 
org.apache.nutch.net.urlnormalizer.regex.TestRegexURLNormalizer
[junit] Tests run: 2, Failures: 0, Errors: 0, Skipped: 0, Time 

[jira] [Resolved] (NUTCH-2999) Update Lucene version to latest 8.x

2023-08-30 Thread Tim Allison (Jira)


 [ 
https://issues.apache.org/jira/browse/NUTCH-2999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Allison resolved NUTCH-2999.

Resolution: Fixed

Updated PR should have fixed that issue.  Would be nice to add testcontainers 
containerized ES and OpenSearch for unit tests.  One day...

> Update Lucene version to latest 8.x
> ---
>
> Key: NUTCH-2999
> URL: https://issues.apache.org/jira/browse/NUTCH-2999
> Project: Nutch
>  Issue Type: Task
>Reporter: Tim Allison
>Priority: Minor
> Fix For: 1.20
>
>
> It may be the way that I'm loading the project, but, for me, Intellij really 
> does not like the Lucene version conflict between {{scoring-similarity}} and 
> the OpenSearch/Elasticsearch modules.
> Can we bump Lucene to the latest 8.11.2 throughout?
> PR for review incoming.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (NUTCH-2999) Update Lucene version to latest 8.x

2023-08-30 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/NUTCH-2999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17760585#comment-17760585
 ] 

ASF GitHub Bot commented on NUTCH-2999:
---

tballison merged PR #771:
URL: https://github.com/apache/nutch/pull/771




> Update Lucene version to latest 8.x
> ---
>
> Key: NUTCH-2999
> URL: https://issues.apache.org/jira/browse/NUTCH-2999
> Project: Nutch
>  Issue Type: Task
>Reporter: Tim Allison
>Priority: Minor
> Fix For: 1.20
>
>
> It may be the way that I'm loading the project, but, for me, Intellij really 
> does not like the Lucene version conflict between {{scoring-similarity}} and 
> the OpenSearch/Elasticsearch modules.
> Can we bump Lucene to the latest 8.11.2 throughout?
> PR for review incoming.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[GitHub] [nutch] tballison merged pull request #771: NUTCH-2999 fix for initial PR

2023-08-30 Thread via GitHub


tballison merged PR #771:
URL: https://github.com/apache/nutch/pull/771


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@nutch.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[jira] [Commented] (NUTCH-2999) Update Lucene version to latest 8.x

2023-08-30 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/NUTCH-2999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17760579#comment-17760579
 ] 

ASF GitHub Bot commented on NUTCH-2999:
---

tballison opened a new pull request, #771:
URL: https://github.com/apache/nutch/pull/771

   Thanks for your contribution to [Apache Nutch](https://nutch.apache.org/)! 
Your help is appreciated!
   
   Before opening the pull request, please verify that
   * there is an open issue on the [Nutch issue 
tracker](https://issues.apache.org/jira/projects/NUTCH) which describes the 
problem or the improvement. We cannot accept pull requests without an issue 
because the change wouldn't be listed in the release notes.
   * the issue ID (`NUTCH-`)
 - is referenced in the title of the pull request
 - and placed in front of your commit messages surrounded by square 
brackets (`[NUTCH-] Issue or pull request title`)
   * commits are squashed into a single one (or few commits for larger changes)
   * Java source code follows [Nutch Eclipse Code Formatting 
rules](https://github.com/apache/nutch/blob/master/eclipse-codeformat.xml)
   * Nutch is successfully built and unit tests pass by running `ant clean 
runtime test`
   * there should be no conflicts when merging the pull request branch into the 
*recent* master branch. If there are conflicts, please try to rebase the pull 
request branch on top of a freshly pulled master branch.
   * if new dependencies are added,
 - are these dependencies licensed in a way that is compatible for 
inclusion under [ASF 
2.0](https://www.apache.org/legal/resolved.html#category-a)?
 - are `LICENSE-binary` and `NOTICE-binary` updated accordingly?
   
   We will be able to faster integrate your pull request if these conditions 
are met. If you have any questions how to fix your problem or about using Nutch 
in general, please sign up for the [Nutch mailing 
list](https://nutch.apache.org/mailing_lists.html). Thanks!
   




> Update Lucene version to latest 8.x
> ---
>
> Key: NUTCH-2999
> URL: https://issues.apache.org/jira/browse/NUTCH-2999
> Project: Nutch
>  Issue Type: Task
>Reporter: Tim Allison
>Priority: Minor
> Fix For: 1.20
>
>
> It may be the way that I'm loading the project, but, for me, Intellij really 
> does not like the Lucene version conflict between {{scoring-similarity}} and 
> the OpenSearch/Elasticsearch modules.
> Can we bump Lucene to the latest 8.11.2 throughout?
> PR for review incoming.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[GitHub] [nutch] tballison opened a new pull request, #771: NUTCH-2999 fix for initial PR

2023-08-30 Thread via GitHub


tballison opened a new pull request, #771:
URL: https://github.com/apache/nutch/pull/771

   Thanks for your contribution to [Apache Nutch](https://nutch.apache.org/)! 
Your help is appreciated!
   
   Before opening the pull request, please verify that
   * there is an open issue on the [Nutch issue 
tracker](https://issues.apache.org/jira/projects/NUTCH) which describes the 
problem or the improvement. We cannot accept pull requests without an issue 
because the change wouldn't be listed in the release notes.
   * the issue ID (`NUTCH-`)
 - is referenced in the title of the pull request
 - and placed in front of your commit messages surrounded by square 
brackets (`[NUTCH-] Issue or pull request title`)
   * commits are squashed into a single one (or few commits for larger changes)
   * Java source code follows [Nutch Eclipse Code Formatting 
rules](https://github.com/apache/nutch/blob/master/eclipse-codeformat.xml)
   * Nutch is successfully built and unit tests pass by running `ant clean 
runtime test`
   * there should be no conflicts when merging the pull request branch into the 
*recent* master branch. If there are conflicts, please try to rebase the pull 
request branch on top of a freshly pulled master branch.
   * if new dependencies are added,
 - are these dependencies licensed in a way that is compatible for 
inclusion under [ASF 
2.0](https://www.apache.org/legal/resolved.html#category-a)?
 - are `LICENSE-binary` and `NOTICE-binary` updated accordingly?
   
   We will be able to faster integrate your pull request if these conditions 
are met. If you have any questions how to fix your problem or about using Nutch 
in general, please sign up for the [Nutch mailing 
list](https://nutch.apache.org/mailing_lists.html). Thanks!
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@nutch.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[jira] [Commented] (NUTCH-2999) Update Lucene version to latest 8.x

2023-08-30 Thread Hudson (Jira)


[ 
https://issues.apache.org/jira/browse/NUTCH-2999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17760574#comment-17760574
 ] 

Hudson commented on NUTCH-2999:
---

SUCCESS: Integrated in Jenkins build Nutch » Nutch-trunk #107 (See 
[https://ci-builds.apache.org/job/Nutch/job/Nutch-trunk/107/])
NUTCH-2999 -- upgrade Lucene to latest 8.x throughout (tallison: 
[https://github.com/apache/nutch/commit/3bb8b0eeb90f7ba1304ef807cf87f28d0a6341f5])
* (edit) src/plugin/indexer-opensearch-1x/plugin.xml
* (edit) 
src/plugin/scoring-similarity/src/java/org/apache/nutch/scoring/similarity/util/LuceneAnalyzerUtil.java
* (edit) src/plugin/parsefilter-naivebayes/ivy.xml
* (edit) src/plugin/parsefilter-naivebayes/plugin.xml
* (edit) src/plugin/scoring-similarity/ivy.xml
* (edit) src/plugin/indexer-elastic/plugin.xml
* (edit) 
src/plugin/scoring-similarity/src/java/org/apache/nutch/scoring/similarity/util/LuceneTokenizer.java
* (edit) src/plugin/scoring-similarity/plugin.xml


> Update Lucene version to latest 8.x
> ---
>
> Key: NUTCH-2999
> URL: https://issues.apache.org/jira/browse/NUTCH-2999
> Project: Nutch
>  Issue Type: Task
>Reporter: Tim Allison
>Priority: Minor
> Fix For: 1.20
>
>
> It may be the way that I'm loading the project, but, for me, Intellij really 
> does not like the Lucene version conflict between {{scoring-similarity}} and 
> the OpenSearch/Elasticsearch modules.
> Can we bump Lucene to the latest 8.11.2 throughout?
> PR for review incoming.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Reopened] (NUTCH-2999) Update Lucene version to latest 8.x

2023-08-30 Thread Tim Allison (Jira)


 [ 
https://issues.apache.org/jira/browse/NUTCH-2999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Allison reopened NUTCH-2999:


The applied PR breaks the lucene-based indexers.

> Update Lucene version to latest 8.x
> ---
>
> Key: NUTCH-2999
> URL: https://issues.apache.org/jira/browse/NUTCH-2999
> Project: Nutch
>  Issue Type: Task
>Reporter: Tim Allison
>Priority: Minor
> Fix For: 1.20
>
>
> It may be the way that I'm loading the project, but, for me, Intellij really 
> does not like the Lucene version conflict between {{scoring-similarity}} and 
> the OpenSearch/Elasticsearch modules.
> Can we bump Lucene to the latest 8.11.2 throughout?
> PR for review incoming.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (NUTCH-2978) Move to slf4j2 and remove log4j1 and reload4j

2023-08-30 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/NUTCH-2978?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17760550#comment-17760550
 ] 

ASF GitHub Bot commented on NUTCH-2978:
---

tballison closed pull request #769: NUTCH-2978 

> Move to slf4j2 and remove log4j1 and reload4j
> -
>
> Key: NUTCH-2978
> URL: https://issues.apache.org/jira/browse/NUTCH-2978
> Project: Nutch
>  Issue Type: Task
>Reporter: Markus Jelsma
>Priority: Major
> Attachments: NUTCH-2978-1.patch, NUTCH-2978-2.patch, 
> NUTCH-2978-3.patch, NUTCH-2978-any23.patch, NUTCH-2978.patch
>
>
> I got in trouble upgrading some dependencies and got a lot of LinkageErrors 
> today, or with a Tika upgrade, disappearing logs. This patch fixes that by 
> moving to slf4j2, using the corrent log4j2-slfj4-impl2 and getting rid of old 
> log4j -> reload4j.
>  
> This patch fixes it.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[GitHub] [nutch] tballison closed pull request #769: NUTCH-2978 -- move to log4j2 logging throughout

2023-08-30 Thread via GitHub


tballison closed pull request #769: NUTCH-2978 -- move to log4j2 logging 
throughout 
URL: https://github.com/apache/nutch/pull/769


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@nutch.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[jira] [Resolved] (NUTCH-2961) Upgrade dependencies of parsefilter-naivebayes

2023-08-30 Thread Tim Allison (Jira)


 [ 
https://issues.apache.org/jira/browse/NUTCH-2961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Allison resolved NUTCH-2961.

Resolution: Fixed

I confirmed we can simply remove those dependencies.  I fixed this as part of 
NUTCH-2999

> Upgrade dependencies of parsefilter-naivebayes
> --
>
> Key: NUTCH-2961
> URL: https://issues.apache.org/jira/browse/NUTCH-2961
> Project: Nutch
>  Issue Type: Improvement
>Affects Versions: 1.18
>Reporter: Sebastian Nagel
>Priority: Major
> Fix For: 1.20
>
>
> The dependencies (Mahout 0.9, Lucene 5.5.0) of parsefilter-naivebayes date 
> back to 2016/2017 and may need an upgrade.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Resolved] (NUTCH-2999) Update Lucene version to latest 8.x

2023-08-30 Thread Tim Allison (Jira)


 [ 
https://issues.apache.org/jira/browse/NUTCH-2999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Allison resolved NUTCH-2999.

Fix Version/s: 1.20
   Resolution: Fixed

Thank you [~markus17] for the review!

> Update Lucene version to latest 8.x
> ---
>
> Key: NUTCH-2999
> URL: https://issues.apache.org/jira/browse/NUTCH-2999
> Project: Nutch
>  Issue Type: Task
>Reporter: Tim Allison
>Priority: Minor
> Fix For: 1.20
>
>
> It may be the way that I'm loading the project, but, for me, Intellij really 
> does not like the Lucene version conflict between {{scoring-similarity}} and 
> the OpenSearch/Elasticsearch modules.
> Can we bump Lucene to the latest 8.11.2 throughout?
> PR for review incoming.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (NUTCH-2999) Update Lucene version to latest 8.x

2023-08-30 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/NUTCH-2999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17760537#comment-17760537
 ] 

ASF GitHub Bot commented on NUTCH-2999:
---

tballison merged PR #770:
URL: https://github.com/apache/nutch/pull/770




> Update Lucene version to latest 8.x
> ---
>
> Key: NUTCH-2999
> URL: https://issues.apache.org/jira/browse/NUTCH-2999
> Project: Nutch
>  Issue Type: Task
>Reporter: Tim Allison
>Priority: Minor
>
> It may be the way that I'm loading the project, but, for me, Intellij really 
> does not like the Lucene version conflict between {{scoring-similarity}} and 
> the OpenSearch/Elasticsearch modules.
> Can we bump Lucene to the latest 8.11.2 throughout?
> PR for review incoming.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[GitHub] [nutch] tballison merged pull request #770: NUTCH-2999 Upgrade Lucene to latest 8.x version throughout

2023-08-30 Thread via GitHub


tballison merged PR #770:
URL: https://github.com/apache/nutch/pull/770


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@nutch.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[jira] [Commented] (NUTCH-2999) Update Lucene version to latest 8.x

2023-08-30 Thread Markus Jelsma (Jira)


[ 
https://issues.apache.org/jira/browse/NUTCH-2999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17760522#comment-17760522
 ] 

Markus Jelsma commented on NUTCH-2999:
--

Seems fine +1

> Update Lucene version to latest 8.x
> ---
>
> Key: NUTCH-2999
> URL: https://issues.apache.org/jira/browse/NUTCH-2999
> Project: Nutch
>  Issue Type: Task
>Reporter: Tim Allison
>Priority: Minor
>
> It may be the way that I'm loading the project, but, for me, Intellij really 
> does not like the Lucene version conflict between {{scoring-similarity}} and 
> the OpenSearch/Elasticsearch modules.
> Can we bump Lucene to the latest 8.11.2 throughout?
> PR for review incoming.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (NUTCH-2999) Update Lucene version to latest 8.x

2023-08-30 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/NUTCH-2999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17760514#comment-17760514
 ] 

ASF GitHub Bot commented on NUTCH-2999:
---

tballison opened a new pull request, #770:
URL: https://github.com/apache/nutch/pull/770

   Thanks for your contribution to [Apache Nutch](https://nutch.apache.org/)! 
Your help is appreciated!
   
   Before opening the pull request, please verify that
   * there is an open issue on the [Nutch issue 
tracker](https://issues.apache.org/jira/projects/NUTCH) which describes the 
problem or the improvement. We cannot accept pull requests without an issue 
because the change wouldn't be listed in the release notes.
   * the issue ID (`NUTCH-`)
 - is referenced in the title of the pull request
 - and placed in front of your commit messages surrounded by square 
brackets (`[NUTCH-] Issue or pull request title`)
   * commits are squashed into a single one (or few commits for larger changes)
   * Java source code follows [Nutch Eclipse Code Formatting 
rules](https://github.com/apache/nutch/blob/master/eclipse-codeformat.xml)
   * Nutch is successfully built and unit tests pass by running `ant clean 
runtime test`
   * there should be no conflicts when merging the pull request branch into the 
*recent* master branch. If there are conflicts, please try to rebase the pull 
request branch on top of a freshly pulled master branch.
   * if new dependencies are added,
 - are these dependencies licensed in a way that is compatible for 
inclusion under [ASF 
2.0](https://www.apache.org/legal/resolved.html#category-a)?
 - are `LICENSE-binary` and `NOTICE-binary` updated accordingly?
   
   We will be able to faster integrate your pull request if these conditions 
are met. If you have any questions how to fix your problem or about using Nutch 
in general, please sign up for the [Nutch mailing 
list](https://nutch.apache.org/mailing_lists.html). Thanks!
   




> Update Lucene version to latest 8.x
> ---
>
> Key: NUTCH-2999
> URL: https://issues.apache.org/jira/browse/NUTCH-2999
> Project: Nutch
>  Issue Type: Task
>Reporter: Tim Allison
>Priority: Minor
>
> It may be the way that I'm loading the project, but, for me, Intellij really 
> does not like the Lucene version conflict between {{scoring-similarity}} and 
> the OpenSearch/Elasticsearch modules.
> Can we bump Lucene to the latest 8.11.2 throughout?
> PR for review incoming.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[GitHub] [nutch] tballison opened a new pull request, #770: NUTCH-2999 Upgrade Lucene to latest 8.x version throughout

2023-08-30 Thread via GitHub


tballison opened a new pull request, #770:
URL: https://github.com/apache/nutch/pull/770

   Thanks for your contribution to [Apache Nutch](https://nutch.apache.org/)! 
Your help is appreciated!
   
   Before opening the pull request, please verify that
   * there is an open issue on the [Nutch issue 
tracker](https://issues.apache.org/jira/projects/NUTCH) which describes the 
problem or the improvement. We cannot accept pull requests without an issue 
because the change wouldn't be listed in the release notes.
   * the issue ID (`NUTCH-`)
 - is referenced in the title of the pull request
 - and placed in front of your commit messages surrounded by square 
brackets (`[NUTCH-] Issue or pull request title`)
   * commits are squashed into a single one (or few commits for larger changes)
   * Java source code follows [Nutch Eclipse Code Formatting 
rules](https://github.com/apache/nutch/blob/master/eclipse-codeformat.xml)
   * Nutch is successfully built and unit tests pass by running `ant clean 
runtime test`
   * there should be no conflicts when merging the pull request branch into the 
*recent* master branch. If there are conflicts, please try to rebase the pull 
request branch on top of a freshly pulled master branch.
   * if new dependencies are added,
 - are these dependencies licensed in a way that is compatible for 
inclusion under [ASF 
2.0](https://www.apache.org/legal/resolved.html#category-a)?
 - are `LICENSE-binary` and `NOTICE-binary` updated accordingly?
   
   We will be able to faster integrate your pull request if these conditions 
are met. If you have any questions how to fix your problem or about using Nutch 
in general, please sign up for the [Nutch mailing 
list](https://nutch.apache.org/mailing_lists.html). Thanks!
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@nutch.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[jira] [Commented] (NUTCH-2999) Update Lucene version to latest 8.x

2023-08-30 Thread Tim Allison (Jira)


[ 
https://issues.apache.org/jira/browse/NUTCH-2999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17760512#comment-17760512
 ] 

Tim Allison commented on NUTCH-2999:


This PR also takes care of NUTCH-2961

> Update Lucene version to latest 8.x
> ---
>
> Key: NUTCH-2999
> URL: https://issues.apache.org/jira/browse/NUTCH-2999
> Project: Nutch
>  Issue Type: Task
>Reporter: Tim Allison
>Priority: Minor
>
> It may be the way that I'm loading the project, but, for me, Intellij really 
> does not like the Lucene version conflict between {{scoring-similarity}} and 
> the OpenSearch/Elasticsearch modules.
> Can we bump Lucene to the latest 8.11.2 throughout?
> PR for review incoming.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (NUTCH-2999) Update Lucene version to latest 8.x

2023-08-30 Thread Tim Allison (Jira)


[ 
https://issues.apache.org/jira/browse/NUTCH-2999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17760511#comment-17760511
 ] 

Tim Allison commented on NUTCH-2999:


https://github.com/apache/nutch/pull/770

> Update Lucene version to latest 8.x
> ---
>
> Key: NUTCH-2999
> URL: https://issues.apache.org/jira/browse/NUTCH-2999
> Project: Nutch
>  Issue Type: Task
>Reporter: Tim Allison
>Priority: Minor
>
> It may be the way that I'm loading the project, but, for me, Intellij really 
> does not like the Lucene version conflict between {{scoring-similarity}} and 
> the OpenSearch/Elasticsearch modules.
> Can we bump Lucene to the latest 8.11.2 throughout?
> PR for review incoming.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (NUTCH-2999) Update Lucene version to latest 8.x

2023-08-30 Thread Tim Allison (Jira)
Tim Allison created NUTCH-2999:
--

 Summary: Update Lucene version to latest 8.x
 Key: NUTCH-2999
 URL: https://issues.apache.org/jira/browse/NUTCH-2999
 Project: Nutch
  Issue Type: Task
Reporter: Tim Allison


It may be the way that I'm loading the project, but, for me, Intellij really 
does not like the Lucene version conflict between {{scoring-similarity}} and 
the OpenSearch/Elasticsearch modules.

Can we bump Lucene to the latest 8.11.2 throughout?

PR for review incoming.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (NUTCH-2961) Upgrade dependencies of parsefilter-naivebayes

2023-08-30 Thread Tim Allison (Jira)


[ 
https://issues.apache.org/jira/browse/NUTCH-2961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17760508#comment-17760508
 ] 

Tim Allison commented on NUTCH-2961:


It looks like neither mahout nor lucene are actually used any more.  I may be 
misreading the code...

Can we just get rid of them?

> Upgrade dependencies of parsefilter-naivebayes
> --
>
> Key: NUTCH-2961
> URL: https://issues.apache.org/jira/browse/NUTCH-2961
> Project: Nutch
>  Issue Type: Improvement
>Affects Versions: 1.18
>Reporter: Sebastian Nagel
>Priority: Major
> Fix For: 1.20
>
>
> The dependencies (Mahout 0.9, Lucene 5.5.0) of parsefilter-naivebayes date 
> back to 2016/2017 and may need an upgrade.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)