v01d opened a new pull request #1834: URL: https://github.com/apache/incubator-nuttx/pull/1834
## Summary This is a work in progress to create a series of scripts to obtain the history of a file in the repository and extract all known sources of authorship so that we can determine if it is possible to change the header to Apache depending on each case. A first script (`log2json.sh`) receives a filename (path within the repo) and uses `git log` to obtain various metadata from each commit of the file's history, as well as the blob hash in each case, in order to later retrieve the file contents at each commit. The script uses `jq` command as part of this processing (available in Ubuntu). The second is a python script (`check.py`) which receives the json file as input (or from STDIN) and tries to extract any attributions in the commit messages as well as author information from license headers. Currently this is just a proof of concept and this would require tuning the regular expressions to improve detection of this information. The script also currently simply prints the information detected in the console. The idea would be that this script could, given a list of authors known to have ICLAs, check if a given file is suitable for safe change of its header into Apache. The script could be actually used also for making this change automatically and create a commit with all the reasons on why this is safe to do. ## Impact None, just scripts for generating reports. ## Testing You can run the following example, from within `tools/licensing`: <pre> ./log2json.sh ../configure.c > out.json ./check.py out.json </pre> ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org