Re: [python-committers] Codecov and PR

Terry Reedy Tue, 25 Apr 2017 17:01:16 -0700

On 4/25/2017 11:00 AM, Barry Warsaw wrote:

On Apr 24, 2017, at 09:32 PM, Ethan Furman wrote:

On 04/21/2017 03:29 PM, Victor Stinner wrote:


(In the context of having a patch blocked by the blind Codecov robot ...)

I dislike code coverage because there is a temptation to write artificial
tests whereas the code is tested indirectly or the code is not important
enough to *require* tests.

While I use code coverage to improve automated unittesting, I am opposedto turning a usable but limited and sometime faulty tool into a blindrobotic master that blocks improvements. The prospect of this beingdone has discouraged me from learning the new system. (More on 'faultytool' later.)

The temptation to write artificial tests to satisfy an artificial goalis real. Doing so can eat valuable time better used for something else.For instance:


    def meth(self, arg):
        mod.inst.meth(arg, True, ob=self, kw='cut')

Mocking mod.class.meth, calling meth, and checking that the mock iscalled will satisfy the robot, but does not contribute much to the goalof providing a language that people can use to solve problems.


Victor, can you explain 'tested indirectly' and perhaps give an example?

As used here,'whereas' is incorrect English and a bit confusing. Ibelieve Victor meant something more like 'even when'.

For the last clause, I believe he meant "the code change is notcomplicated enough to *require* automated unit test coverage for thechanged line(s)". If I change a comment or add missing spaces, I don'tthink I should be forced to write a missing test to make the improvement.

A less trivial example: on IDLE's menu, Options => Configure IDLE opensa dialog with a font size widget that when clicked opens a list of fontsizes. I recently added 4 larger sizes to the tuple inidlelib.configdialog.ConfigDialog.LoadFontCfg, as requested, I think, byat least 2 people. I tested manually by clicking until the list wasdisplayed. As I remember, I did not immediately worry about automatedtesting, let alone line coverage, and I do not think I should have hadto to get the change into 3.6.0.

That line may or may not by covered by the current minimal test thatsimply creates a ConfigDialog instance. But this gets back to what Ithink is Viktor's point. This minimal test 'covers' 46% of the file,but it only tests that 46% of the lines run without raising. This isuseful, but does not test that the lines are really correct. (For GUIdisplay code, human eyeballing is required.) This would remain trueeven if all the other code were moved to a new module, making thecoverage of configdialog the magical ***100%***.

If it's not important enough to require tests >> it's not important enough to 
be in Python.  ;)


Modules in the test package are mostly not tested. ;)

If 'test' means 'line coverage test for new or changed lines', then as apractical matter, I disagree, as explained above. So, in effect, didthe people who committed untested lines.

In the wider sense of 'test', there is no real argument. Each statementwritten should be mentally tested both when written and when reviewed.Code should be manually tested, preferably by someone in addition to theauthor. Automated testing is more than nice, but not everything. Dittofor unit testing.


Some practical issues with coverage and CodeCov:

1. A Python module is comprised of statements but coverage module countsphysical lines. This is good for development, but not for gating. Thenumber of physical lines comprising a statement can change withoutchanging or with only trivially changing the compiled run code. So ifcoverage is not 100%, it can vary without a real change in statementcoverage.

2. Some statements are only intended to run on certain systems, making100% coverage impossible unless one carefully puts all system-specificcode in "if system == 'xyz'" statements and uses system-specific.coveragerc files to exclude code for 'other' systems.

3. Some tests required extended resources. Statements that are onlycovered by such tests will be seen as uncovered when coverage is run ona system lacking the resources. As far as I know, all non-Windowsbuildbots and CodeCov are run on systems lacking the 'gui' resource. Sopatches to gui code will be seen as uncovered.

4. As I explained in a post on the core-workflow list, IDLE needs thefollowing added to the 'exclude_lines' item of .coveragerc.

    .*# htest #
    if not _utest:

The mechanism behind these would also be useful for testing any othermodules, scripts, or demos that use tkinter GUIs.


There seems to be other issues too.

"Untested code is broken code" :)

Most of CPython, including IDLE, has been pretty thoroughly tested. Andwe have heaps of bug reports to show for it. What's more important isthat even code that is tested, by whatever means, may still bugs. HenceHowever, obscure bugs are still found. And even correct code can becorrupted (repressed) by attempt fix and improve.


--
Terry Jan Reedy
_______________________________________________
python-committers mailing list
python-committers@python.org
https://mail.python.org/mailman/listinfo/python-committers
Code of Conduct: https://www.python.org/psf/codeofconduct/

Re: [python-committers] Codecov and PR

Reply via email to