[ 
https://issues.apache.org/jira/browse/NUTCH-881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12897929#action_12897929
 ] 

Alex McLintock commented on NUTCH-881:
--------------------------------------

The "other?" would be DITA. This is in some ways "DocBook Version 2" in that it 
seems to have most of the good features of DocBook - but be better for large 
software projects rather than single documents. 

Basically the documentation is written and stored in XML (in svn/git/cvs 
whatever). XSLT / xsl:fo is used to generate html and pdf from that single 
source. 


There is a precedent too. Apache Derby is using DITA for its documentation

http://db.apache.org/derby/manuals/dita.html

I don't have experience of this, but DITA was recommended to me by a friendly 
documentation professional. 


I'm happy to learn about this and try to set up a framework

> Good quality documentation for Nutch
> ------------------------------------
>
>                 Key: NUTCH-881
>                 URL: https://issues.apache.org/jira/browse/NUTCH-881
>             Project: Nutch
>          Issue Type: Improvement
>          Components: documentation
>    Affects Versions: 2.0
>            Reporter: Andrzej Bialecki 
>
> This is, and has been, a long standing request from Nutch users. This becomes 
> an acute need as we redesign Nutch 2.0, because the collective knowledge and 
> the Wiki will no longer be useful without massive amount of editing.
> IMHO the reference documentation should be in SVN, and not on the Wiki - the 
> Wiki is good for casual information and recipes but I think it's too messy 
> and not reliable enough as a reference.
> I propose to start with the following:
>  1. let's decide on the format of the docs. Each format has its own pros and 
> cons:
>   * HTML: easy to work with, but formatting may be messy unless we edit it by 
> hand, at which point it's no longer so easy... Good toolchains to convert to 
> other formats, but limited expressiveness of larger structures (e.g. book, 
> chapters, TOC, multi-column layouts, etc).
>   * Docbook: learning curve is higher, but not insurmountable... Naturally 
> yields very good structure. Figures/diagrams may be problematic - different 
> renderers (html, pdf) like to treat the scaling and placing somewhat 
> differently.
>   * Wiki-style (Confluence or TWiki): easy to use, but limited control over 
> larger structures. Maven Doxia can format cwiki, twiki, and a host of other 
> formats to e.g. html and pdf.
>   * other?
>  2. start documenting the main tools and the main APIs (e.g. the plugins and 
> all the extension points). We can of course reuse material from the Wiki and 
> from various presentations (e.g. the ApacheCon slides).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to