Re: Measuring water quality of the CPAN river

2015-05-11 Thread Neil Bowers
 On 11 May 2015, at 01:47, Kent Fredric kentfred...@gmail.com wrote:
 So the quality of a dist could be measured indirectly by the failure rate of 
 its dependents.

This was kind of the basis of the “River Smoker” idea that Tux and and I 
discussed late on the last day of the QAH:

http://neilb.org/2015/04/24/cpan-downstream-testing.html

 Or as an analogy, we have 2 sampling points in the river 100m apart.
 
 If we sample at point 1, we can't sense the fecal matter because it enters 
 the river downstream of the sampling point.
 
 But the fecal matter is sampled at point 2, so by conjecture, either point 2 
 created it, or it entered between points 1 and 2.
 
 Sampling across the river at point 2 helps infer where the probable fecal 
 matter is.

Sort of a cpan bisest, or rather a cpan testers bisect: look at 2 or more CPAN 
Testers fails where the only difference is an upriver version number.

Neil



Re: Measuring water quality of the CPAN river

2015-05-11 Thread Kent Fredric
On 11 May 2015 at 19:20, Neil Bowers neil.bow...@cogendo.com wrote:

 look at 2 or more CPAN Testers fails where the only difference is an
 upriver version number.


my point didn't pertain to upriver versions changing, but the observation
that upriver modules can have buggy code that is only broken on certain
architectures, and have no tests to expose that fact.

Thus, any thing consuming that module, regardless the version its at, will
have failure rates on CPAN for that architecture that the author of the
downstream module didn't anticipate.

But the problem is the upriver module, and its not a symptom exposed by
upriver *changes*, but fundamental issues in that upriver was *alway*
broken.

Most common case of this I've seen is when upriver is only coded so it
works on a specific NV size, and on different NV sizes , behaviour is
unpredictable.

Authors of downstream modules who have the magical NV size don't notice the
problem and ship.

And soon after, they get failures pertaining to their NV size *IF* they had
tests.

This can go on and on and you can get things 3+ deps away exposing an error
that is truely in an upstream module simply due to the intermediate modules
not exposing it due to their absence of tests.

Obviously the accuracy of any such metric gets weaker the further it gets
from the real point of origin. And even at D=2, its already rather vague.



-- 
Kent

*KENTNL* - https://metacpan.org/author/KENTNL


Re: Measuring water quality of the CPAN river

2015-05-10 Thread Kent Fredric
On 11 May 2015 at 12:37, Neil Bowers neil.bow...@cogendo.com wrote:

 These are pretty simple, and have some problems, so I’m hoping someone
 might come up with something more useful.


Random thought while reading: backflow could be a thing to check for.

Some dists don't have tests, and still have deps, causing a false-pass.

So the quality of a dist could be measured indirectly by the failure rate
of its depedents.

Or as an analogy, we have 2 sampling points in the river 100m apart.

If we sample at point 1, we can't sense the fecal matter because it enters
the river downstream of the sampling point.

But the fecal matter is sampled at point 2, so by conjecture, either point
2 created it, or it entered between points 1 and 2.

Sampling across the river at point 2 helps infer where the probable fecal
matter is.


-- 
Kent

*KENTNL* - https://metacpan.org/author/KENTNL


Measuring water quality of the CPAN river

2015-05-10 Thread Neil Bowers
One of the goals of the CPAN River model is to get us to focus on cleaning up 
the river, starting at the headwaters first. To that end, I’ve been thinking 
about how we might measure “river quality”, and have written up two ideas so 
far:

http://neilb.org/2015/05/11/measuring-cpan-quality.html 
http://neilb.org/2015/05/11/measuring-cpan-quality.html

These are pretty simple, and have some problems, so I’m hoping someone might 
come up with something more useful.

Here are the 6 dists in the most upstream category that have = 2% failures on 
CPAN Testers 

podlators
IO
File-Path
IPC-Cmd
XSLoader
Locale-Maketext-Simple

I think I saw that File::Path was worked on at the NY hackathon last weekend, 
so maybe that will drop off this list soon ...

Neil



Re: Measuring water quality of the CPAN river

2015-05-10 Thread David Golden
On Sun, May 10, 2015 at 8:37 PM, Neil Bowers neil.bow...@cogendo.com
wrote:


 These are pretty simple, and have some problems, so I’m hoping someone
 might come up with something more useful.


It's really hard to examine specific cases with CT down at the moment, but
I think we could be considering a number of dimensions.  Brainstorming:

* fails tests now – i.e. blocking automated dep installations – this is
your existing metric
* possibly an OS-specific variation of the above
* stability over time – possibly your existing metric, but taking last N
stable releases or all stable releases in last N years
* something looking at bug queues – possibly open tickets proportional to
number of downriver dependents; possibly looking at open vs close ratios
* process metrics: e.g. trial release before stable
* Kwalitee (might not tell us much, but easy to examine)
* Proportional diff size between stable releases

David




-- 
David Golden x...@xdg.me Twitter/IRC: @xdg