CPAN-river: can graph calculation be modified?

James E Keenan Fri, 02 Feb 2018 06:48:33 -0800

Overall Question: How can we implement different ways of constructingthe CPAN river?


Background:

Since about this time last year I've had occasion to use the concept ofCPAN-river to derive lists of distributions to be tested againstwhatever Perl 5 blead is of the moment. In particular, for the lastthree months I've been creating assessments of the impact of monthlyPerl 5 development releases on the "top 1000" of the CPAN river. (See,e.g.,http://thenceforward.net/perl/misc/cpan-river-1000-perl-5.27-master.psv.gz)

To calculate the CPAN river, I've been using the programs developed byDavid Golden found here:


https://github.com/dagolden/zzz-index-cpan-meta

... with one modification: a local branch for the second of the threeprograms cited there. I use a local branch because I'm using Linux andcannot install Ramdisk.


Problem:

As I've stared at this data over the past year I've become aware thatthe order in which distros appear in the river is not necessarily themost useful for assessing the real-world impact of changes in blead.Put less charitably, the CPAN river can be "gamed." It is possible fora person to release a large number of distributions which havedependencies on other distributions by the same author. That can boostsome of those distributions high up into the CPAN river -- into, say,the "top 1000" that I use in my monthly program.

But if that author's distributions are not depended upon by *other*authors' distributions then they are arguably less important than thosesuch as Module-Build and DateTime which are depended upon by vastnumbers of distros written by people other than those distros' maintainers.

Since "testing against blead" programs take hours to run, I would liketo have that time spent focusing on what I consider to be more relevantdistros.

For the 5.29.* development cycle starting in May of this year, I wouldlike to be able to use a ranking of CPAN distros which goes beyond asking:


* "How many other distributions depend on this one?"

... to asking:

* "How many distributions by other authors/maintainers depend on this one?"

Would that be feasible?  Has anyone attempted this already?

Thank you very much.
Jim Keenan

CPAN-river: can graph calculation be modified?

Reply via email to