>>>>> On Sun, 5 Apr 2009 00:07:03 -0400, David Golden <[email protected]> said:
> * Historical CPAN Testers reports don't actually contain that > information -- all they generally have is DISTNAME-VERSION > That's easy enough to fix going forward, but it makes importing > history difficult -- and it even makes testing the Metabase difficult > as I have to shave yaks in CPAN::Reporter and Test::Reporter to pass > the full author/tarball path > My thought: get a full list of all tarballs on backpan create a > mapping table -- hopefully, there are few cases of duplicate > distname-version. > Q1: does that exist or could it be produced easily? It could be produced from the PAUSE database. The table distmtimes contains all files that ever existed: mysql> select count(*) from distmtimes; +----------+ | count(*) | +----------+ | 63632 | +----------+ 1 row in set (0.00 sec) mysql> select * from distmtimes order by distmtime+0 limit 1; +----------------------------+-----------+---------------------+-------------+------------+ | dist | distmtime | distmdatetime | indexing_at | indexed_at | +----------------------------+-----------+---------------------+-------------+------------+ | T/TO/TOMC/scripts/boxit.gz | 682812000 | 1991-08-22 00:00:00 | NULL | NULL | +----------------------------+-----------+---------------------+-------------+------------+ 1 row in set (0.10 sec) > Q2: any thoughts on how that could be either kept up to date or > web-queryable for ongoing mapping of "version 1" reports as they are > produced? Ricardo asked me during a breakfast for a new index for URI::cpan: 4 columns (author,distbase,version,path) and it became a Todo item for me. But I haven't yet done anything about it:-( -- andreas
