Hi,

So far in Discordia, to build the license database, a script browse the
central Maven repository and extracts license information from project POMs.
So we can at least do a best effort to initialize the database with
hopefully accurate data (btw if somebody now of other formal sources that we
can use to know the license of a given project/artifact, let me kow). The
license information for each artifact (read jar, war, mar, gem, whatever) is
then serialized in a file, on file per artifact (with a well chosen name
that facilitates indexing).

So far the format used for this license file is as simple as it gets:

in org.springframework-spring-jms-2.0.1.xml
<artifact>
  <name>spring-jms</name>
  <project>org.springframework</project>
  <version>2.0.1</version>
  <licenses>
    <license>
      <name>The Apache Software License, Version 2.0</name>
      <url>http://www.apache.org/licenses/LICENSE-2.0.txt</url>
      <distribution>repo</distribution>
    </license>
  </licenses>
</artifact>

I think there are more pertinent RDF-based formats around. Specifically, I
think that Robert came up with something like this for a similar
application. Maybe we could reuse the same thing?

Thanks!
Matthieu

Reply via email to