Sebastian Pipping wrote: >> I'd like to determine the subset of URLs that appear >> exactly once in both gentoo and debian source packages. > > Mappable homepages in Debian: 6222 > Mappable homepages in Gentoo: 9582 > Shared (without normalization): 1183
With normalization for SourceForge, Google Code, Alioth, Savannah, Berlios, RobyForge, Gna, Pypi the number of directly mappable packages increases by about 500: Mappable homepages in Debian: 6222 Mappable homepages in Gentoo: 9582 Shared (w/o normalization): 1183 Shared (w/ normalization): 1670 Sebastian
