Good quality documentation for Nutch
------------------------------------
Key: NUTCH-881
URL: https://issues.apache.org/jira/browse/NUTCH-881
Project: Nutch
Issue Type: Improvement
Components: documentation
Affects Versions: 2.0
Reporter: Andrzej Bialecki
This is, and has been, a long standing request from Nutch users. This becomes
an acute need as we redesign Nutch 2.0, because the collective knowledge and
the Wiki will no longer be useful without massive amount of editing.
IMHO the reference documentation should be in SVN, and not on the Wiki - the
Wiki is good for casual information and recipes but I think it's too messy and
not reliable enough as a reference.
I propose to start with the following:
1. let's decide on the format of the docs. Each format has its own pros and
cons:
* HTML: easy to work with, but formatting may be messy unless we edit it by
hand, at which point it's no longer so easy... Good toolchains to convert to
other formats, but limited expressiveness of larger structures (e.g. book,
chapters, TOC, multi-column layouts, etc).
* Docbook: learning curve is higher, but not insurmountable... Naturally
yields very good structure. Figures/diagrams may be problematic - different
renderers (html, pdf) like to treat the scaling and placing somewhat
differently.
* Wiki-style (Confluence or TWiki): easy to use, but limited control over
larger structures. Maven Doxia can format cwiki, twiki, and a host of other
formats to e.g. html and pdf.
* other?
2. start documenting the main tools and the main APIs (e.g. the plugins and
all the extension points). We can of course reuse material from the Wiki and
from various presentations (e.g. the ApacheCon slides).
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.