Hi all, I am working on a Google Summer of Code project that emanates from this discussion/issue <https://github.com/spdx/LicenseListPublisher/issues/60#issuecomment-570511697>; concerning the validation of license cross references. Here is a link to my GSOC proposal <https://docs.google.com/document/d/10RlmmsnJ7suDudjgugHMZkOOa-1IsY2Bv_Ew_tgzpv4/edit> .
The focus is on improving the LicenseListPublisher <https://github.com/spdx/LicenseListPublisher> repository to have generated license data <https://github.com/spdx/license-list-data> updated with fields on the validity of the crossref, among others. Inorder to do this, the structure of the crossref shall change(in some cases, eg JSON), and in others, there shall be additional tags. In general the following are fields which shall be added to the crossrefs: *"isValid": true/false,* Indicates whether or not the crossref url is a valid url (ex: not some local file link) *"isWayBackLink": true/false,* Indicates whether or not the url is a link from a previous version(wayback machine) of the site(where the license is located) *"extraText": true/false,* Indicates whether or not the license from the url has extra text in its description when compared to the license description in the current file. "isMatch": true/false, Indicates whether or not the license from the url link matches(perfectly) the license description in the current file. "url": "http://landley.net/toybox/license.html", This is the url of the license text/description *"isDead": true/false* Indicates whether or not the url is a dead link(a link that returns a page different from HTTP_200, could be bad request HTTP_400, not found HTTP_404, forbidden HTTP_403, etc) Please consider this as a proposal and any suggestions and/or modifications will be very much appreciated. Thanks, Smith -=-=-=-=-=-=-=-=-=-=-=- Links: You receive all messages sent to this group. View/Reply Online (#3885): https://lists.spdx.org/g/Spdx-tech/message/3885 Mute This Topic: https://lists.spdx.org/mt/74934696/21656 Group Owner: [email protected] Unsubscribe: https://lists.spdx.org/g/Spdx-tech/unsub [[email protected]] -=-=-=-=-=-=-=-=-=-=-=-
