On Nov 20, 2015 Eric S. Raymond <[email protected]> wrote: > I'm thinking about writing a codewalker that would scan a source tree > for license inclusions and replace them with SPDX tags. > > The hard part of this wouldn't be the code, it would be scraping > copies of all the canonical license texts and SPDX names. > > For this, and other related reasons, I request that you make the > license list available in a machine-parseable form. What I'd like to > be able to do is write a code generator that massages that form into > Python structures that then drive the source transformation.
Eric: I maintain such a list in scancode [1] for SPDX and many other licenses. The format is one yaml file for each licenses metadata and a .LICENSE file for the license text and a .SPDX file for the SPDX reference license text. There were request to have that in a separate repo,if that can help I can pull it out. There is also Python code to read it [2] [1] https://github.com/nexB/scancode-toolkit/tree/develop/src/licensedcode/data/licenses [2] https://github.com/nexB/scancode-toolkit/blob/c1e70994abe8ceb18bc99aa6f81ebc40d832fb7f/src/licensedcode/models.py#L62 -- Cordially Philippe Ombredanne _______________________________________________ Spdx-tech mailing list [email protected] https://lists.spdx.org/mailman/listinfo/spdx-tech
