On Nov 20, 2015 Eric S. Raymond <[email protected]> wrote:
> I'm thinking about writing a codewalker that would scan a source tree
> for license inclusions and replace them with SPDX tags.
>
> The hard part of this wouldn't be the code, it would be scraping
> copies of all the canonical license texts and SPDX names.
>
> For this, and other related reasons, I request that you make the
> license list available in a machine-parseable form. What I'd like to
> be able to do is write a code generator that massages that form into
> Python structures that then drive the source transformation.

Eric:
I maintain such a list in scancode [1] for SPDX and many other licenses.
The format is one yaml file for each licenses metadata and a .LICENSE
file for the license text and a .SPDX file for the SPDX reference
license text.
There were request to have that in a separate repo,if that can help I
can pull it out.
There is also Python code to read it [2]

[1] 
https://github.com/nexB/scancode-toolkit/tree/develop/src/licensedcode/data/licenses
[2] 
https://github.com/nexB/scancode-toolkit/blob/c1e70994abe8ceb18bc99aa6f81ebc40d832fb7f/src/licensedcode/models.py#L62
-- 
Cordially
Philippe Ombredanne
_______________________________________________
Spdx-tech mailing list
[email protected]
https://lists.spdx.org/mailman/listinfo/spdx-tech

Reply via email to