Re: Committing to Subversion (Re: Some questions about CI)

Emmanuel Lecharny Tue, 18 Mar 2008 13:54:23 -0700

<snip/>

<seriously>
BTW this was just for laughs on an otherwise boring topic.
</seriously>

Sure ! This is the way I understood it :)

<snip/>
Before going into your points below I want to state the personal rulesI try to operate by on commits.(1) Always try not to break the build and tests (unit and integration)on commits to public branches and trunks - if I cannot do this Icreate a temporary private branch so my tinkering does not cost otherspain and grief.
(2) Try to commit coherent chunks of work in a single transactionwhich are easy to track. This makes it so I can add a one linersummarizing the general gist of the change. For example it may besomething like "Adding support for varargs in Entry API ...". Then Ican have some bullet points describing the specifics of the changes ifthey are noteworthy.
(3) Avoid mixing bogus formatting/refactoring changes with bug fixesor feature work. When refactoring changes like class renaming orformatting I make sure this is in a separate commit so it does notobscure the diff of these other kinds of changes.
(4) Don't pick at unrelated issues. Focus on the objective. Make thechanges and commit with a coherent focus message and diff so thechange is easy to understand. I can relax later and pick at thingslike changing the code to support generics or something I saw that wasnot right but unrelated. Use JIRA to note things or a notepad and getback to them after you have reached your objective and committed the code.
The overall goal with these rules is to have a searchable, easy tounderstand version history without noise to obscure the picture.Apache projects in general promote this with unofficial rules likedon't use tabs but use spaces - this is so whitespace variance to doesnot produce bogus chunks in diffs. This is just one example. Noiseor chatter should be reduced because there's a lot of information tobe filtered.
</seriously>.

Let me give you some enlightments about the way I work, which is somehowdifferent, but not that much.


(1) Totally agree. I'm not creating enough branch, and this is a bad habit.

(2) I usually prefer spliting the commits in smaller chunks, becauseit's sometime difficult to check what has been changed when you arecommitting more than a couple of files. If you are using eclipse, assoon as you have opened the 'commit' popup, you can't go back to thecode and look at the differences. So it's more confortable to do acomparison, and then a simple commit. Let's say it's a IDE driventechnic, and if the popup was not modal, I may gather more files in onecommit. But I also try to avoid single files commits when I have somerelated files to be committed.

(3) That is much more questionable, IMHO. For some reason, I found thatwhen you walk the code, and you find a bad javadoc, a bug, a typo, thenyou should fix it, otherwise you may never come back and fix it. Infact, this is always the case : you never come back and clean the code...

(4) Pretty raisonnable. Every time I tried to chase two targets, Irarely caught one of them, not to mention I caught both :)

Basically, I don't think we are far from working in a different way. Ican deal with your (2). It's just a matter of opening a text editor,which is something I'm doing more and more now, trying to follow yourbest practices. (just because they are, well, better than mine :)


That being said, let's continue the CI discussion.



    On Tue, Mar 18, 2008 at 12:41 PM, Emmanuel Lecharny
    <[EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]>> wrote:

        Alex Karasulu wrote:
        >
        >
        >     >  - - How often shall a build be done (compile/test,
        sitegeneration)
        >
        >     We have many possible options. We tried something like
        kicking CI
        >     after each commit, but it leads to issues (usually, we
        don't commit
        >     code in one big shot,
        >
        >
        > Yes this does happen but it's bad practice on our part.
        I don't think so. First because when you commit, you usually have
        already checked locally that the server is ok (_usually_ =>
        sometime,

this is not the case ;).

We can't trust that. This is the reason in the first place forsetting up a CI server.

I won't say that. I would rather say that we trust it, but we know thatwe will have some bad commits from time to time, and the CI will catchthose bad commits.

<WARNING>I'm *not* saying that you don't understand what is a CI systemin the following paragraph</WARNING>For me, there is a big misconception, and also a really badunderstanding of what a CI system is, and I saw that on many projects Iworked on, even before working on ADS. When a CI is installed, after afew weeks, dev just become lazzy, commits the code when leaving theoffice/going to bed, thinking : "F*ck the potential breakage, I willlook at it tomorrow morning, anyway the CI will inform me ...". A CIshould just be the latest safety net you have.


<personal vision>

I think that CI is useless. You don't have to share this vision, but Inever saw an environment where a CI was enforcing the code quality andthe commit quality. Except when working on a branch, it's just a wasteof time and energy. I even saw a CI being installed as a way to protectthe integration team against junk code being pushed by a dev team whichdidn't took enough time to run integration tests, just becaus, eh, theinteg team is just here for that purpose, nah ?

So what a CI is good for ? Well, when working in a branch, it can beusefull, as you can commit blindly, and check the day after if you brokesomething. It saves time, as you don't have to run integ tests.

What else ? When you have big and costly integration tests, then, yes, aCI is mandatory.

</personal vision>

Although I will entertain you by actually addressing your points, Ireally don't have to since you've repeatedly broken the damn buildmore than all of us combined.

That hurts ;) But this is so damn true ... I hope the situation isbetter now, as I'm trying to discipline myslef, and also thanks to thegood work you did with the 'reverter' ! A fast integration test isprobably the most important factor in better commits. Now that it onlytakes 8 minutes to run them, instead of 25', I'm more serious aboutrunning them before every commits...


May be my commit rate is also a factor ;) (but certainly not an excuse !!!)

Problem is, you write some good code so we let you commit anyway.Secretly we all talk behind your back. "Oh man Emmanuel broke thedamn build again. When is he going to start committing thingsproperly. I guess we just have to live with it since he does so muchfor the project, makes the server fast and buys everyone beers atApacheCon."

Ahhh !!! Yes, I heard the 'whispers' while I'm quietly sleeping whileyour are trying to fix what I have broke :)

<seriously>
Another reason why I take the time is because you live with my faultstoo which are much worse and I am thankful for your patience. I oweyou that as this is a meritocracy as well as many other things. Thisis the great thing about this project.
</seriously>

<seriously too>

You are right, breaking the server is bad. Everytime I did it, I feltguilty.

</seriously too>

<not seriously>

And now the problem is that I barely can blame Maven, it's so muchbetter than it was ! So it's all my fault ...

<not seriously/>

        Second, because doing more than one commit
        allows you to comment more precisely what kind of modification
        you have
        done.
Emmanuel your comments clutter the repository as they do the mailinglists with sometimes unnecessary information in a volume that isoverwhelming. Besides sounding like someone with peanut butter stuckto the roof of your mouth, you're constantly pointing out littledetails that don't matter that much. Either that or you're off topicin la la land.

I won't worry to much about the volume :) Last year, I committed around1200 times, and we are now at revision 566 000... Ok, I may commit lesssmall modifications... I'm trying :)

Ok ok let me be a bit more serious ... I'll stop trying to make youlaugh for a minute.

np !

We don't want log messages cluttered in the svn commit log with thingslike,


r1 "Adding generics to Comparator arguments"
r2 "Correct spelling"
r3 "Added Javadocs to methods"
r4 "Renamed some variables for clarity"
r5 "Added some defensive checks for null arguments"

Instead you can have this in one commit like so:

r1 "Cleaning up various jdbm-store module ...

o Adding generics to private method arguments

   o Correct spelling because Alex can't spell
   o Added Javadocs to methods
   o Renamed some private variables for clarity
   o Added some defensive checks for null arguments "

Well, I must say I agree. I'm trying to gather the comment more and morenow. Just because it's almost useless to people, and also because whenyou have to track a commit in the middle of a forest of useless commits ...

Now is this mandatory? Nope not according to rule #1 which is mostimportant. Just don't break the build and sometimes when you have toyou can commit these kinds of changes one by one. This clumping ofminor details into one commit just keeps the clutter down and makessearching the repository logs much easier to do. Sometimes you'lljust have no choice to do these independently and that's ok but youshould have the intention of keeping the noise down.

Fair enough.

        I personnally don't really like to commit a big shot of code,
        unless it
is really closely connected.
Right understood. Usually I like to try to have a clear concisepicture of what I will do for a specific commit to restrict it toclosely related changes but this is not always possible.Just think how much more useful our logs can be to us when we'rehunting down historical changes if the changes are more coherentinstead of mixed, unrelated, and strewn with bogus non-critical logs.

I would ponder my previous comments, which was not really reflectingwhat I wanted to say : I like to commits related things together, to acertain extent, but sometime, I also like to spilt the commit in morethan one piece. This is typically the case when committing in shared andin apacheds.

        But as I also fix some javadoc, bugs,
        warnings while browsing the code, I like to commit in smaller
        blocks.
Yeah I understand and you can do that. Just add the javadocs and makethe fix while working in the same regions sure. But you don't have tocommit that separately: most likely no one knows about the bug fix youjust made unless they've noticed it. Put the bug fix in a bulletpoint on the overall commit like, "found nasty bug in filter parsingand fixed". Another thing is you loose focus and pick at some thingsas if you had the same luxury when you pick at your arse. Often thesethings are unrelated. Like you start fixing something then startdoing generics all over the place and commit something that obscuresyour main intentions. You gotta stop drinking so much buddy.

:)

I have also slightly changed my commit habits lately. I now try togather the related modifications in a single commits, and avoid singlecommits as much as I can.


<joking>
My ohloh commit-meter is now high enough ;)
</joking>

<seriously>
The collateral changes you make are very valuable and I appreciateyour taking the time and effort to make them. This is all good in thesense that it makes the code more readable so please note that I'm nottalking about absolute rules here. These are guidelines to followthat lead to overall better conditions on a big project. There's alot of code and information here so let's try to make that informationeasy to filter.
</seriously>

It takes time to be infusate with good practice. Last year, I wascommitting like crazy, one change = one commit. And now, I find thisfutile, and almost useless. This was at a time I was 'killing' warnings,and it was not really smart.

I found it more usefull now to limit the number of commits and to addmore precise comments. I think this is reflected in some the latestcommits I did :


URL: http://svn.apache.org/viewvc?rev=637318&view=rev
Log:
o Added the DefaultServerAttribute class missing methods and tests
o Lot of tests added
o Removal of the useless AbstractServerAttribute class


URL: http://svn.apache.org/viewvc?rev=637317&view=rev
Log:
o Addition of the ClientAttribute interface
o Addition of the DefaultClientAttribute implementation
o Lot of tests added for this class
o Removal of the useless AbstractClientAttrbute class, as the inheritence 
scheme has been heavily reworked and simplified
o Other minor related modification

        > I prefer a build on each commit so it's easier to catch the
        offending
        > commit and isolate it to a user who can be informed
        immediately while
        > they still have a mental stack in memory.
        If you kick a build after each commit, you may have many
        builds kicked
when a lot of commits are done.
Well if you are committing like a super sonic hedge hog then yeah.That's the whole point of my email here. Let's not commit every timewe dot our i's or cross our t's.

ok got the message :)

I know this is all about your "oh lo" ratings though :-).

Shit... discovered :)

Which is cool so I might change my mind on this whole topic and startcommitting like a lunatic. Oh lo is showing you out committing me. Ithink I'll setup a cron job to just format, unformat and commit somerandom file just to get back on top.

ah ah ah !!! I wish I'm better at Perl than I'm ...

        I also think that it's quite rare that a
commit break the build (it happens, say, every sic months ...),
Ha ha ha you did not just set me up for this did you? Man if you'reEmmanuel the build breaks when you walk by the computer. That's whywe want to keep the build server here in the US far far way from yourdesk.


Whatever, I do have root access to them ;)

        and when
        it does, being able to point the offending commit does not
        really helps
        to fix the breakage, because the offender is generally already
        sleeping :)
People will break the build at different times. Usually when tiredand sleepy is the case and probably occurs more often so you may havea point there. But you're dead wrong about this not being useful.Just having the shame that comes with breaking the build is a goodthing. It's what brings the arrogance and sense of can't do no wrongof all of us down.

100% agree

When you slip the CI server tells you you f**ked up. A slap in theface from a machine instead of me having to feel bad about telling youthat you screwed up again. So if the CI works at the end of the dayit cannot catch those accountable since several errors could be mixedtogether from different offenders. And it's their responsibility toget their asses up to fix the problem.

Here, I disagree. Having you ( or any other committer) complaining abouta breakage is *way* stronger than if the server complains... It makesyou more carefull (see one of my point up)


Plus how do we know which change broke the build.

As you don't use kick a build on CI every 5 minutes, the preoblem remains...

Remember we all live in the same code base and can give each other ahard time by how we conduct ourselves. These reminders are good whenwe all slip - I personally want immediate notification. I want toknow when I screw up and am ok if others see it. I want the pressureto force me to change.

I would favor a local CI then. I personally thinking about installingone on my laptop, just as a safeguard.

Good engineers architect around their own faults.
We need alarms to tell us we're messing up some how. Why? Because weall do no matter who we are. I like committers who think they willf**k up better than those that think they will not because those folksare more honest and they actually double check there work. That's allI could ask for.

I buy the idea of alarms, but they should remains alarms. Not a way toaugment the risk you can take.


        >
        > I personally would like to know immediately when I goofed
        something
        > while that something is still in my head.
        Well, run the tests before committing should be enough, isn't it ?

I do but I screw up at times. Do you run all the tests every time youcommit? We need a safety net.

Almost every time, but when it's obviously a begnign commit (like fixinga javadoc). Sadly, the evil is in details :)



        Don't get me wrong : I don't say that we should fragment
        commits as much
        as possible, nor I say that knowing which commit has broke a
        build is
        useless, I just say that a CI should be an airbag, when the
        integration

is the safety belt.


I like that comment a lot!

        never commit without fasten your safety belt
        (-Dintegration test), and in case you crash the server, the
        the airbac
        (CI) may save your life !

Just perfect. We're not so much off base. OK I'm going to leave menasty comments above even after reading this because he he I took thetime trying to be funny.

I think we are sharing the same concerns. I'm mostly with you 99% onwhat you wrote.

This is however an interesting convo, because it helps to understand therational behind the rules and guidelines we are using, and also becauseit's like a vaccine : every year, you need a fresh injection !


Thanks Alex !

--
--
cordialement, regards,
Emmanuel Lécharny
www.iktek.com
directory.apache.org

Re: Committing to Subversion (Re: Some questions about CI)

Reply via email to