[ https://issues.apache.org/jira/browse/NUTCH-2290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17579082#comment-17579082 ]
ASF GitHub Bot commented on NUTCH-2290: --------------------------------------- sebastian-nagel opened a new pull request, #743: URL: https://github.com/apache/nutch/pull/743 This PR addresses 3 license-related issues: - NUTCH-2290 Update licenses of bundled libraries - NUTCH-2822 Split the LICENSE.txt file into two files for source resp. binary releases - NUTCH-2821 Deduplicate licenses in LICENSE.txt file and also includes (kind of a precondition) - NUTCH-2861 Remove parse-swf / #742 The changes have been partially created automatically running a Python Jupyter notebook (see the attachment in NUTCH-2290 on Jira). > Update licenses of bundled libraries > ------------------------------------ > > Key: NUTCH-2290 > URL: https://issues.apache.org/jira/browse/NUTCH-2290 > Project: Nutch > Issue Type: Bug > Components: deployment > Affects Versions: 2.3.1, 1.12 > Reporter: Sebastian Nagel > Assignee: Sebastian Nagel > Priority: Major > Labels: help-wanted > Fix For: 1.19 > > Attachments: 3rd-party-license-report.sh, > 3rd-party-licenses-nutch-1.15.txt, apache_nutch_1.17_3rd_party_licenses.txt > > > The files LICENSE.txt and NOTICE.txt were last edited 5 years ago and should > be updated to include all licenses of dependencies (and their dependencies) > in accordance to [Assembling LICENSE and NOTICE > HOWTO|http://www.apache.org/dev/licensing-howto.html]: > # check for missing or obsolete licenses due to added or removed dependencies > # update year in NOTICE.txt -- should be a range according to the licensing > HOWTO > # bundled libraries are referenced with path and version number, e.g > {{lib/icu4j-4_0_1.jar}}. This would require to update the LICENSE.txt with > every dependency upgrade. A more generic reference ("ICU4J") would be easier > to maintain but the HOWTO requires to "specify the version of the dependency > as licenses are sometimes changed". > # try to reduce the size of LICENSE.txt (currently 5800 lines). Mainly, > according to the HOWTO there is no need to repeat the Apache license again > and again. -- This message was sent by Atlassian Jira (v8.20.10#820010)