[jira] [Commented] (NUTCH-2959) Upgrade to Apache Tika 2.9.0

2023-09-28 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2959?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17770236#comment-17770236 ] ASF GitHub Bot commented on NUTCH-2959: --- lewismc commented on PR #776: URL:

[GitHub] [nutch] lewismc commented on pull request #776: NUTCH-2959 -- upgrade Tika to 2.9.0

2023-09-28 Thread via GitHub
lewismc commented on PR #776: URL: https://github.com/apache/nutch/pull/776#issuecomment-1740040368 Try the full path Also make sure the directory exists on HDFS On Thu, Sep 28, 2023 at 14:03 Tim Allison ***@***.***> wrote: > Paging the nutch-test-single-node-cluster

[jira] [Commented] (NUTCH-2959) Upgrade to Apache Tika 2.9.0

2023-09-28 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2959?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17770221#comment-17770221 ] ASF GitHub Bot commented on NUTCH-2959: --- tballison commented on PR #776: URL:

[GitHub] [nutch] tballison commented on pull request #776: NUTCH-2959 -- upgrade Tika to 2.9.0

2023-09-28 Thread via GitHub
tballison commented on PR #776: URL: https://github.com/apache/nutch/pull/776#issuecomment-173655 Paging the `nutch-test-single-node-cluster` helpdesk what do I use for the tika seeds file? Are you using[ our github repo, or the tika-parsers-common package

Re: Establishing a Nutch development roadmap

2023-09-28 Thread Tim Allison
Sorry for two emails... Migrating javax->jakarta has been quite a chore on Tika because of dependencies. Given back-compat issues with hadoop, is this even on the horizon for Nutch? On Thu, Sep 28, 2023 at 9:29 AM Tim Allison wrote: > Y, I'd like to get a working Tika version in a release

Re: Establishing a Nutch development roadmap

2023-09-28 Thread Tim Allison
Y, I'd like to get a working Tika version in a release fairly soon. Not sure how much effort a release is? On Thu, Sep 28, 2023 at 8:29 AM Sebastian Nagel wrote: > Hi Lewis, > > thanks! > > I'd put on top of the list > > * release 1.20 > > Since the release of 1.19 more than one year has

[jira] [Commented] (NUTCH-3006) Downgrade Tika dependency to 2.2.1 (core and parse-tika)

2023-09-28 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3006?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17770059#comment-17770059 ] Tim Allison commented on NUTCH-3006: An alternative approach would be for Tika to revert

[jira] [Commented] (NUTCH-3009) Upgrade to Hadoop 3.3.6

2023-09-28 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17770048#comment-17770048 ] ASF GitHub Bot commented on NUTCH-3009: --- lewismc commented on PR #782: URL:

[GitHub] [nutch] lewismc commented on pull request #782: NUTCH-3009 Upgrade to Hadoop 3.3.6

2023-09-28 Thread via GitHub
lewismc commented on PR #782: URL: https://github.com/apache/nutch/pull/782#issuecomment-1739126057 I’ll check Javac output for any deprecation warnings. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

[jira] [Commented] (NUTCH-3009) Upgrade to Hadoop 3.3.6

2023-09-28 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17770045#comment-17770045 ] ASF GitHub Bot commented on NUTCH-3009: --- sebastian-nagel opened a new pull request, #782: URL:

[GitHub] [nutch] sebastian-nagel opened a new pull request, #782: NUTCH-3009 Upgrade to Hadoop 3.3.6

2023-09-28 Thread via GitHub
sebastian-nagel opened a new pull request, #782: URL: https://github.com/apache/nutch/pull/782 (no comment) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe,

[jira] [Commented] (NUTCH-2979) Upgrade Commons Text to 1.10.0

2023-09-28 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17770041#comment-17770041 ] Sebastian Nagel commented on NUTCH-2979: Note: upgrading to Hadoop 3.3.6 (NUTCH-3009) will update

[jira] [Created] (NUTCH-3009) Upgrade to Hadoop 3.3.6

2023-09-28 Thread Sebastian Nagel (Jira)
Sebastian Nagel created NUTCH-3009: -- Summary: Upgrade to Hadoop 3.3.6 Key: NUTCH-3009 URL: https://issues.apache.org/jira/browse/NUTCH-3009 Project: Nutch Issue Type: Improvement

[jira] [Resolved] (NUTCH-2979) Upgrade Commons Text to 1.10.0

2023-09-28 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel resolved NUTCH-2979. Resolution: Fixed Resolved, so far, without any direct action: - Nutch core still depends

Re: Establishing a Nutch development roadmap

2023-09-28 Thread Sebastian Nagel
Hi Lewis, thanks! I'd put on top of the list * release 1.20 Since the release of 1.19 more than one year has elapsed. Otherwise I agree with all points on the road map, even in this order / priority. Best, Sebastian On 9/26/23 18:37, lewis john mcgibbney wrote: Hi dev@, I've been at

[jira] [Created] (NUTCH-3008) indexer-elastic: downgrade to ES 7.10.2 to address licensing issues

2023-09-28 Thread Sebastian Nagel (Jira)
Sebastian Nagel created NUTCH-3008: -- Summary: indexer-elastic: downgrade to ES 7.10.2 to address licensing issues Key: NUTCH-3008 URL: https://issues.apache.org/jira/browse/NUTCH-3008 Project: Nutch

[jira] [Commented] (NUTCH-3007) Fix impossible casts

2023-09-28 Thread Markus Jelsma (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17769989#comment-17769989 ] Markus Jelsma commented on NUTCH-3007: -- +1 yes! > Fix impossible casts > > >

[jira] [Commented] (NUTCH-2852) Method invokes System.exit(...) 9 bugs

2023-09-28 Thread Markus Jelsma (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17769988#comment-17769988 ] Markus Jelsma commented on NUTCH-2852: -- Seems just fine for these files +1 > Method invokes

[GitHub] [nutch] sebastian-nagel opened a new pull request, #781: NUTCH-3007 Fix impossible casts

2023-09-28 Thread via GitHub
sebastian-nagel opened a new pull request, #781: URL: https://github.com/apache/nutch/pull/781 - remove code blocks (else clauses) unneeded and containing impossible casts -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and

[jira] [Commented] (NUTCH-3007) Fix impossible casts

2023-09-28 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17769982#comment-17769982 ] ASF GitHub Bot commented on NUTCH-3007: --- sebastian-nagel opened a new pull request, #781: URL:

[jira] [Created] (NUTCH-3007) Fix impossible casts

2023-09-28 Thread Sebastian Nagel (Jira)
Sebastian Nagel created NUTCH-3007: -- Summary: Fix impossible casts Key: NUTCH-3007 URL: https://issues.apache.org/jira/browse/NUTCH-3007 Project: Nutch Issue Type: Sub-task Affects

[jira] [Commented] (NUTCH-2852) Method invokes System.exit(...) 9 bugs

2023-09-28 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17769977#comment-17769977 ] Sebastian Nagel commented on NUTCH-2852: The PR addresses all corresponding issues in the checker

[jira] [Commented] (NUTCH-2852) Method invokes System.exit(...) 9 bugs

2023-09-28 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17769975#comment-17769975 ] ASF GitHub Bot commented on NUTCH-2852: --- sebastian-nagel opened a new pull request, #780: URL:

[GitHub] [nutch] sebastian-nagel opened a new pull request, #780: NUTCH-2852 SpotBugs: Method invokes System.exit(...)

2023-09-28 Thread via GitHub
sebastian-nagel opened a new pull request, #780: URL: https://github.com/apache/nutch/pull/780 Remove all calls of System.exit(...) in methods except main(args) of various "checker" tools and replace by return values passed to main(). -- This is an automated message from the Apache Git