Possibly relevant/of interest to those following this thread, from a student of Daniel's:
http://turingmachine.org/~dmg/temp/trevor_BSD_report_2012_12_23.pdf I have not read it all yet, but it looks useful for informing our discussion of how to clean up/further standardize the BSD/MIT/X11 variants, based on an analysis of 70K versions of the statements, as found in Debian. On Tue, Nov 27, 2012 at 11:28 AM, D M German <[email protected]> wrote: > Gervase Markham twisted the bytes to say: > > > > Hi Gervase, > > Why don't you try the tool we developed. It is a bit hacky, but it will > help you do what you are doing automatically. > > http://github.com/dmgerman/ninka > > If you run the tool, make sure it is a copy of the code. It will > create, for each file you specify in its command line (I recommend you > use xargs to run it) several files. The ones you are looking for are: > > *license, *.senttok and *.goodsent > > they will abstract the information you are looking for. > > As I mentioned in the previous message, what is the code you are looking > at? I can run the tool myself and give you the resulting data. > > -daniel > > > Gervase> On 26/11/12 23:44, Luis Villa wrote: > >> I wonder if there is an easy way to visualize the various changes you > >> have in your data set, to see where people agreed/disagreed/edited, > >> outside the obvious changes. Daniel German, cc'd, may have already > >> tackled this, or have other ideas along these lines. > > Gervase> I don't have an automated way. I gave myself 10 minutes to do it by > Gervase> hand, and the results are as follows: > > Gervase> <ORGANIZATION>: > > Gervase> * "the author" > Gervase> * "the above-listed copyright holder(s)" > Gervase> * "Yahoo! Inc.", followed by "nor the names of YUI's contributors" > Gervase> * "the copyright holder" > Gervase> * "Google" > Gervase> * "the Eclipse Foundation, Inc." > Gervase> * "the University" > Gervase> * "Google Inc." > Gervase> * "the Xiph.org Foundation nor Pinknoise Productions Ltd" > Gervase> * "TransGaming Inc., Google Inc., 3DLabs Inc. Ltd.," > Gervase> * "the David Beazley or Dabeaz LLC" (!) > Gervase> * "the Jython Developers" > Gervase> * "KTH" > Gervase> * "The Android Open Source Project" > Gervase> * Rewording: "The names of the authors may not be used to > endorse..." > Gervase> * Rewording: "The names of the author may not be used to endorse..." > Gervase> * "David Young" > Gervase> * "the project" > Gervase> * "Cisco Systems, Inc." > Gervase> * "the libjpeg-turbo Project" > Gervase> * "the Motorola, Inc." (!) > Gervase> * "Adobe Systems, Network Resonance" > Gervase> * "Parakey Inc" > Gervase> * "Apple Computer, Inc. ("Apple")" > Gervase> * "the copyright holders" > Gervase> * "Network Resonance, Inc." > Gervase> * "the company" > Gervase> * "Redis" > Gervase> * "Apple Computer, Inc. ("Apple") or The Mozilla Foundation > ("Mozilla")" > Gervase> * "The NetBSD Foundation" > Gervase> * "the psutil authors" > Gervase> * "the Institute" > Gervase> * "the Eclipse Foundation, Inc." > Gervase> * "the Cisco Systems, Inc." (!) > Gervase> * "the author(s)" > Gervase> * the Xiph.org Foundation" > Gervase> ... and several more. > > Gervase> Disclaimer section: > > Gervase> Much less variation here, the first two being by far the most > common: > > Gervase> * "THE COPYRIGHT HOLDERS AND CONTRIBUTORS" > Gervase> * "THE AUTHOR" > Gervase> * "THE REGENTS AND CONTRIBUTORS" > Gervase> * "Google Inc." > Gervase> * "KTH AND ITS CONTRIBUTORS" > Gervase> * "The Android Open Source Project" > Gervase> * "THE AUTHOR AND CONTRIBUTORS" > Gervase> * "DAVID YOUNG" > Gervase> * "THE PROJECT AND CONTRIBUTORS" > Gervase> * "APPLE AND ITS CONTRIBUTORS" > Gervase> * "SUN MICROSYSTEMS, INC." > Gervase> * "APPLE, MOZILLA AND THEIR CONTRIBUTORS" > Gervase> * "THE NETBSD FOUNDATION, INC" > Gervase> * "THE INSTITUTE AND CONTRIBUTORS" > > Gervase> As far as I can tell, other than the substitution of names on > Gervase> occasion, the disclaimer is otherwise identical. And there is very > Gervase> little variation in the other text too. > > Gervase> Bullets: > > Gervase> * None > Gervase> * "1." > Gervase> * "a)" > Gervase> * "-" > Gervase> * "*" > Gervase> * In one case, 1), 2) and nothing! > Gervase> * In another, 1), 2) and "-"! > Gervase> * In another, nothing, nothing and "-"! > Gervase> * In another, all the paras are run together > > Gervase> Numbers seem to be the most common. > > Gervase> Gerv > > > -- > Daniel M. German "An intellectual is someone whose > Albert Camus -> mind watches itself. " > http://turingmachine.org/ > http://silvernegative.com/ > dmg (at) uvic (dot) ca > replace (at) with @ and (dot) with . > > _______________________________________________ License-discuss mailing list [email protected] http://projects.opensource.org/cgi-bin/mailman/listinfo/license-discuss

