[jira] [Commented] (NUTCH-3020) ParseSegment should check for protocol's flags for truncation

2023-11-06 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17783365#comment-17783365 ] Hudson commented on NUTCH-3020: --- SUCCESS: Integrated in Jenkins build Nutch » Nutch-trunk #140 (See

Jenkins build is back to normal : Nutch » Nutch-trunk #140

2023-11-06 Thread Apache Jenkins Server
See

[jira] [Resolved] (NUTCH-3020) ParseSegment should check for protocol's flags for truncation

2023-11-06 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved NUTCH-3020. Fix Version/s: 1.20 Resolution: Fixed > ParseSegment should check for protocol's flags for

[jira] [Commented] (NUTCH-3020) ParseSegment should check for protocol's flags for truncation

2023-11-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17783360#comment-17783360 ] ASF GitHub Bot commented on NUTCH-3020: --- tballison merged PR #794: URL:

Re: [PR] NUTCH-3020 -- ParseSegment should check for okhttp's truncation flag [nutch]

2023-11-06 Thread via GitHub
tballison merged PR #794: URL: https://github.com/apache/nutch/pull/794 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[jira] [Commented] (NUTCH-3019) Upgrade to Apache Tika 2.9.1

2023-11-06 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17783352#comment-17783352 ] Tim Allison commented on NUTCH-3019: {noformat} [junit] Tests run: 7, Failures: 4, Errors: 0,

[jira] [Commented] (NUTCH-3019) Upgrade to Apache Tika 2.9.1

2023-11-06 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17783327#comment-17783327 ] Hudson commented on NUTCH-3019: --- FAILURE: Integrated in Jenkins build Nutch » Nutch-trunk #139 (See

Build failed in Jenkins: Nutch » Nutch-trunk #139

2023-11-06 Thread Apache Jenkins Server
See Changes: [github] NUTCH-3019 -- update Tika (#797) -- [...truncated 760.61 KB...] resolve-default: [ivy:resolve] :: loading settings :: file =

[jira] [Resolved] (NUTCH-3019) Upgrade to Apache Tika 2.9.1

2023-11-06 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved NUTCH-3019. Fix Version/s: 1.20 Resolution: Fixed > Upgrade to Apache Tika 2.9.1 >

[jira] [Commented] (NUTCH-3019) Upgrade to Apache Tika 2.9.1

2023-11-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17783290#comment-17783290 ] ASF GitHub Bot commented on NUTCH-3019: --- tballison merged PR #797: URL:

Re: [PR] NUTCH-3019 -- update Tika to 2.9.1 [nutch]

2023-11-06 Thread via GitHub
tballison merged PR #797: URL: https://github.com/apache/nutch/pull/797 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[jira] [Comment Edited] (NUTCH-3019) Upgrade to Apache Tika 2.9.1

2023-11-06 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17783254#comment-17783254 ] Tim Allison edited comment on NUTCH-3019 at 11/6/23 3:46 PM: - tballison

[jira] [Commented] (NUTCH-3019) Upgrade to Apache Tika 2.9.1

2023-11-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17783254#comment-17783254 ] ASF GitHub Bot commented on NUTCH-3019: --- tballison commented on PR #797: URL:

Re: [PR] NUTCH-3019 -- update Tika to 2.9.1 [nutch]

2023-11-06 Thread via GitHub
tballison commented on PR #797: URL: https://github.com/apache/nutch/pull/797#issuecomment-1795161171 ```2023-11-06T15:02:47.9408964Z [junit] Tests run: 14, Failures: 2, Errors: 0, Skipped: 4, Time elapsed: 4.342 sec 2023-11-06T15:02:48.2192793Z [junit] Test

[jira] [Comment Edited] (NUTCH-3019) Upgrade to Apache Tika 2.9.1

2023-11-06 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17783252#comment-17783252 ] Tim Allison edited comment on NUTCH-3019 at 11/6/23 3:32 PM: - I just got

[jira] [Commented] (NUTCH-3019) Upgrade to Apache Tika 2.9.1

2023-11-06 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17783252#comment-17783252 ] Tim Allison commented on NUTCH-3019: ParserStatus         failed=84         success=625 > Upgrade to

[jira] [Commented] (NUTCH-3019) Upgrade to Apache Tika 2.9.1

2023-11-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17783228#comment-17783228 ] ASF GitHub Bot commented on NUTCH-3019: --- tballison commented on PR #797: URL:

Re: [PR] NUTCH-3019 -- update Tika to 2.9.1 [nutch]

2023-11-06 Thread via GitHub
tballison commented on PR #797: URL: https://github.com/apache/nutch/pull/797#issuecomment-1794934171 Need to keep as draft until the 2.9.1.0 shim actually lands in maven central. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to

[jira] [Commented] (NUTCH-3019) Upgrade to Apache Tika 2.9.1

2023-11-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17783227#comment-17783227 ] ASF GitHub Bot commented on NUTCH-3019: --- tballison opened a new pull request, #797: URL:

[PR] NUTCH-3019 -- update Tika to 2.9.1 [nutch]

2023-11-06 Thread via GitHub
tballison opened a new pull request, #797: URL: https://github.com/apache/nutch/pull/797 Thanks for your contribution to [Apache Nutch](https://nutch.apache.org/)! Your help is appreciated! Before opening the pull request, please verify that * there is an open issue on the [Nutch

[jira] [Commented] (NUTCH-3025) urlfilter-fast to filter based on the length of the URL

2023-11-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17783226#comment-17783226 ] ASF GitHub Bot commented on NUTCH-3025: --- jnioche opened a new pull request, #796: URL:

[jira] [Created] (NUTCH-3025) urlfilter-fast to filter based on the length of the URL

2023-11-06 Thread Julien Nioche (Jira)
Julien Nioche created NUTCH-3025: Summary: urlfilter-fast to filter based on the length of the URL Key: NUTCH-3025 URL: https://issues.apache.org/jira/browse/NUTCH-3025 Project: Nutch Issue