Re: [OpenAFS] rxk5 Mainline Issues?

Simon Wilkinson Sun, 08 Nov 2009 10:12:16 -0800

Could you expand a bit on the “number of issues” that remain if it’sfunctionally complete? Is there a published list of things that needto be in place for something to be mainlined, in general, or arethese issues only for Rxk5?

Marcus has pointed out that you were asking a more general question,and it occurred to me that, whilst I've talked about these issues atboth the Stanford and Rome Workshops, there are folks who won't havebeen to either. So, here's a rough summary of what I think the processof getting code upstream is. Bear in mind that I'm very much a singlevoice here, and what I say has no more weight they anyone else - butI've said this lot a few times now without getting shouted down...

Firstly, any changes to the protocol have to go past the afs3-standardisation list. This effectively means anything on the wire, butmay also extend to (for example) the pioctl interfaces when there aremultiple implementations of these. Proposals should be presented inthe form of an internet draft. We're still coming up to speed on howthese get dealt with - at present, the level of comment generallydepends on the scale of the change. If the proposal is for a minortweak to a debugging interface, it's likely to get through easily. Ifit's a wholescale change to a fundamental bit of AFS, then things willtake longer, and probably involve actively soliciting responses.

Secondly, any major changes should be discussed before they'reimplemented. The openafs-devel list is the best place to do this. Itis of course up to an author what to do with the feedback theyreceive. My general advice would be to listen carefully if there aresuggestions that the fundamental approach is wrong, but to be cautiousof demands for feature creep. One key point is that the peoplecommenting now are the people who'll be reviewing the finished code.If nobody agrees that an approach is the best solution now, findingreviewers when it comes to getting it into the tree is likelyto bedifficult. It's at this point that it'll also be possible to get anidea of when merge windows are available for this kind of change.Typically, large scale changes won't be accepted, even into thedevelopment branch, around branch points for stable releases.

Thirdly, code gets written, with steps one and two repeating asrequired. If it's a large project, please keep the community informed,especially if you are significantly changing previously discusseddesigns. When coding, bear in mind that the tree has a documentedcoding style, which should be followed as far as possible. Try andkeep your code as small modular chunks. Consider, as you write it, howit would make sense to split it for an upstream contribution. If yourcode requires interface changes, then split off these changes, andmake them separate commits. Avoid bundling unrelated changes in withyour core work - if you need to fix bugs in existing code, then makethose changes as separate patches, and contribute them as you work.Cutting down the size of you main code drop will make the finalintegration steps much much easier.

At the end of all of this, there will be a collection of code tocontribute as a set of patches. The rough guidance here is that thereshould be one patch per change, and one change per patch - but goodjudgement is necessary as 10,000 individual 10 line patches are nomore manageable than a single 100,000 line patch.

The key thing to consider when splitting code into patches is that thepeople who review them are volunteering their time to do so - there'sno commitment upon them to spend time evaluating any code, socontributors should make it as easy for them as possible. This is onereason why breaking large changes up makes life easier - it's a loteasier for reviewers to find spare hours here and there, than it is tocontemplate looking at something that's obviously going to take daysto get through. Also, a set of sensibly created patches is far easierto review, and to test, than a single monolithic change. Providingreviewers with a guide to the code is hugely helpful - either throughcommit comments, in-tree design documentation, or a separate document.

Contributors should expect that their code won't be accepted firsttime round. There are likely to be nits picked up by the reviewprocess, although the more communication that's occurred at thebeginning, the less significant those nits are likely to be.

The final issue I'd like to address is that of _when_ code should makeits way into the development branches. Maintaining code out of tree ishard - keeping it up to date is a pain - and people will often breakcode that's not in the tree (or even that is in the tree, but on adifferent branch) simply because they don't realise its there, ordon't test against it. This leads to a general desire by contributorsto get code in as quicky as possible. However, there is a flip side.There are a lot of developers working off the main branch, andcommitting code that breaks the tree or the build is a bad thing. Oncecode has gone in, the longer it remains in the tree, the harder itbecomes to remove it. This is a real problem when a new 'stable'branch is required - if the development branch is broken, how do youcreate the new stable branch. This is exactly the state of affairsthat demand attach left us in - with a development branch that was toobroken to even consider creating a 1.6 from, but with a set of changesthat it would be hugely timeconsuming to unpick from the rest of thecode. There is, of course, the question of branching earlier - youhave 'stable', 'development' and 'very bleeding edge'. But this justshifts the problem slightly sideways. Additionally, each new branchcreates significantly more work for everyone - as patches need to becreated, tested, and committed against each new branch.

So, I think the best answer is 'when its ready' and 'not near proposedbranch points'. Lots of projects have clearer answers to theseparticular questions - for example, Linux has its merge windows, andMozilla has code freezes around releases, where no feature commits arepermitted. I'd argue that OpenAFS probably wants to adopt somethingalong these lines too, along with a clearer set of data driven targetsfor stable release. But that's an argument for another email.

I should round off by saying that absolutely none of this is specificto OpenAFS - they're all just part of a functioning Open Source ecosystem. If you're interested, I'd highly recommend reading:

How to Participate in the Linux Community (A Guide to the KernelDevelopment Process)

http://ldn.linuxfoundation.org/how-participate-linux-community

Mozilla Hacker's Getting Started Guide
https://developer.mozilla.org/en/Mozilla_Hacker's_Getting_Started_Guide

or any other big projects' developer guidelines - we're all strugglingwith the same problems!


Hope that helps,

Simon.

_______________________________________________
OpenAFS-info mailing list
[email protected]
https://lists.openafs.org/mailman/listinfo/openafs-info

Re: [OpenAFS] rxk5 Mainline Issues?

Reply via email to