[
https://issues.apache.org/jira/browse/NUTCH-2290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17579082#comment-17579082
]
ASF GitHub Bot commented on NUTCH-2290:
---------------------------------------
sebastian-nagel opened a new pull request, #743:
URL: https://github.com/apache/nutch/pull/743
This PR addresses 3 license-related issues:
- NUTCH-2290 Update licenses of bundled libraries
- NUTCH-2822 Split the LICENSE.txt file into two files for source resp.
binary releases
- NUTCH-2821 Deduplicate licenses in LICENSE.txt file
and also includes (kind of a precondition)
- NUTCH-2861 Remove parse-swf / #742
The changes have been partially created automatically running a Python
Jupyter notebook (see the attachment in NUTCH-2290 on Jira).
> Update licenses of bundled libraries
> ------------------------------------
>
> Key: NUTCH-2290
> URL: https://issues.apache.org/jira/browse/NUTCH-2290
> Project: Nutch
> Issue Type: Bug
> Components: deployment
> Affects Versions: 2.3.1, 1.12
> Reporter: Sebastian Nagel
> Assignee: Sebastian Nagel
> Priority: Major
> Labels: help-wanted
> Fix For: 1.19
>
> Attachments: 3rd-party-license-report.sh,
> 3rd-party-licenses-nutch-1.15.txt, apache_nutch_1.17_3rd_party_licenses.txt
>
>
> The files LICENSE.txt and NOTICE.txt were last edited 5 years ago and should
> be updated to include all licenses of dependencies (and their dependencies)
> in accordance to [Assembling LICENSE and NOTICE
> HOWTO|http://www.apache.org/dev/licensing-howto.html]:
> # check for missing or obsolete licenses due to added or removed dependencies
> # update year in NOTICE.txt -- should be a range according to the licensing
> HOWTO
> # bundled libraries are referenced with path and version number, e.g
> {{lib/icu4j-4_0_1.jar}}. This would require to update the LICENSE.txt with
> every dependency upgrade. A more generic reference ("ICU4J") would be easier
> to maintain but the HOWTO requires to "specify the version of the dependency
> as licenses are sometimes changed".
> # try to reduce the size of LICENSE.txt (currently 5800 lines). Mainly,
> according to the HOWTO there is no need to repeat the Apache license again
> and again.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)