Source Code Versioning: Ideas, Methods, Tools for Specific scenario as a Content Writer?

Nigel Whitaker Wed, 12 Dec 2012 09:55:21 -0800

Hi Alex,

I've just been catching up on my email list reading and noticed you'veraised this issue here and also in opendocument-users and xsl-list.I'll answer here, with a DocBook specific response, but the samearguments apply to other documentation formats - but perhaps thisdiscussion belongs in docbook-apps and should be continued there?

We (DeltaXML) currently provide a general purpose/well-formed xmlcompare and a DocBook specific compare product. We are also workingon 3-way or more generally n-way 'Merge' products that will have morerelevance when used with revision control systems. However, Thomashas already given you some good advice - an existing VCS system, perhapswith additional normalization steps, may meet your requirements.

You asked do people use software revision control for documents - theanswer is certainly yes. As a company, we do this; we version ourproduct documentation with our source code. Until around 2 years ago itwas subversion (svn) and its now mainly mercurial (hg), git would havebeen an appropriate alternative. Other approaches are to use a contentmanagement system (CMS) or a filesystem with webdav/deltaV support. Iwas recently at the DITA Europe 2012 conference (yes I know! - but theanswer I suspect is equally applicable to DocBook) and tried to assesthe use of software version control - when talking to people I oftenasked the question - "do you use a CMS or software version controlsystems" - the result was around 50/50. I suspect that software/ITcompanies have the expertise readily available, and like us, find iteasier to adapt to using software version control.

Your follow-on question/email asked about line-based vs xml-awarecomparison:


On 08/12/2012 16:25, Alex S wrote:

Thank you for your time & responses. I remember reading somewhere thata pure text/ linear comparison based tool/ system may not be ideal tocompare & merge XML tree structure based documents.

When using an XML-aware algorithm as part of the merge/update processthere is a possibility to get better results. For example, consider auser on one branch using an editing or authoring tool which mixes upattributes, for example reordering them or re-indenting them overmultiple lines. When the branches are merged you are likely to get a"false conflict" with a line based algorithm, whereas an XML-awarealgorithm shouldn't identify a change.

Taking this a step further, you can do more if the tools/algorithmsunderstand the grammar or XML format being processed. Here's a DocBook5 example, in the ancestor revision there is a section with a title andan itemized list:


    <sect1>
        <title>Merge example</title>
        <para>In this example...</para>...

In one branch a user adds an indexterm, in another branch a revhistoryis added.

The ancestor revision used in 3 way diff or merge algorithms allows themto work out that different sets of lines have been inserted at the samepoint (relative to the ancestor) and that they are not identical, andhence gives a conflict. This is the conflict from mercurial:


    <sect1>
        <title>Merge example</title>
<<<<<<< mine
        <indexterm><primary>Revision Control</primary></indexterm>
=======
        <revhistory>
            <revision>

<date>2012-12-12</date><revdescription><simpara>Testinghg</simpara></revdescription>

            </revision>
        </revhistory>
>>>>>>> theirs
        <para>In this example...</para>

However, the DocBook 5 grammar allows "one or more of"revhistory/indexterm and so you could argue that this isn't really aconflict here. Conversely, there are places in DocBook where you have achoice of elements without a one-or-more (+), zero-or-more (*)repetition qualifier and adding both of the choices from differentbranches is definitely a conflict irrespective of how they arerepresented as lines. We propose that in an XML 'grammar aware'system, conflict can and should be related to the grammar rules.

We are addressing the software version control use-case with theseenhanced types of conflict detection in our upcoming 'merge' products.Integrations with hg, git and svn (probably in that order) areplanned. One of the problems we found in the past was that softwareversion control usually handles just binary or text files. svn allowedyou to plug-in alternative merge or diff tools, but only for all typesof text file. We are planning to take another look at the interfaces tosee if there are any ways in which we can plug-in our algorithms onlyfor specific types of file.


Thanks,

Nigel

--
Nigel Whitaker, Software Architect, DeltaXML Ltd. "Experts in information 
change"
[email protected]   http://www.deltaxml.com   +44 1684 869035
Registered in England: 02528681 Reg. Office: Monsell House, WR8 0QN, UK


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: [docbook] XML/XSL Revision Control/ Source Code Versioning: Ideas, Methods, Tools for Specific scenario as a Content Writer?

Reply via email to