Hi, So far in Discordia, to build the license database, a script browse the central Maven repository and extracts license information from project POMs. So we can at least do a best effort to initialize the database with hopefully accurate data (btw if somebody now of other formal sources that we can use to know the license of a given project/artifact, let me kow). The license information for each artifact (read jar, war, mar, gem, whatever) is then serialized in a file, on file per artifact (with a well chosen name that facilitates indexing).
So far the format used for this license file is as simple as it gets: in org.springframework-spring-jms-2.0.1.xml <artifact> <name>spring-jms</name> <project>org.springframework</project> <version>2.0.1</version> <licenses> <license> <name>The Apache Software License, Version 2.0</name> <url>http://www.apache.org/licenses/LICENSE-2.0.txt</url> <distribution>repo</distribution> </license> </licenses> </artifact> I think there are more pertinent RDF-based formats around. Specifically, I think that Robert came up with something like this for a similar application. Maybe we could reuse the same thing? Thanks! Matthieu