At 22:04 09/01/2001 +0000, Gervase Markham wrote:
> > >Just about. A few site guidelines, such as "An H1 at the top, please, and
> > >H2s as major headings blah blah" should do the trick. We already have such
> > >things, of a sort - http://www.mozilla.org/README-style.html .
> >
> > No that doesn't do it at all, where is the project structure?  Where the
> > horizontal structures?
>
>I don't follow.

The point I thought was to organise the documentation so that it was 
structured not just within a particular documents but that the documents 
themselves were structured together in ways which made it feasible to have 
a variety of views depending on how a particular viewer sliced it.

So that, documents were identified as belonging to a particular project 
(XUL for instance) and might also belong to other projects or other layers 
of that project, (UI, Specifications, Guidelines, Tutorials, etc).


> > HTML strict or not is just layout it isn't logical
> > structure at all.  Morphing things like headers as structural content
> > doesn't do the job.
>
>Headers _are_ structural content!

If HTML strict imposed rules on <Hn> then that might be true, but even then 
it would be true relative only to other headers within the same 
document.  Having a convention external to the document that H1 means this 
and H2 means that isn't structuring documents, especially when there is no 
defined order to those tags.  You can have H1 followed by H6 followed by H3 
followed by H1.

Using HTML also presupposes that the document is somewhat fixed, especially 
in relation to other documents and the links to them.  The author, or more 
accurately the document has to know the exact URL to link to (unless you 
use some complicated CGI).

Originally I proposed a very small dialect of XML that would allow the 
logical structuring of documents without imposing that pain on 
authors.   DpcBook is a very large and very complete method to do the same 
thing and probably in structuring terms superior to a custom rolled 
dialect, but it is more complicated.

To illustrate what I mean, imagine a document regarding plugins and the 
browser.  Its overall project is  the Browser (whether SeaMonkey is 
appropriate or not is irrelevant), it happens to be and end user document 
and it contains examples and links to another document on 4.x plugins as 
well as a reference to the plugin SDK.

Using HTML the document, other than within the content or its place within 
a file system has no structured way of defining the project it belongs to, 
that it is an end user document and that it contains examples.  The 
document also has to know within the content the URL of each of the other 
documents which may or may not have been written by the same author, and in 
fact they may not even exist as of yet.

A document with structure tags can resolve those issues, a straightforward 
parser can then take that document and deliver its content appropriately.

You could have something like...

<document='MyDocumentFilename' project name = "Browser" level=enduser 
topic='Plugin Overview'>
         Text..... could include simple modifiers such as <B> <I> etc...
         <connect project='Browser' type = 'document' topic='netscape 4.x 
plugin support'/>
         <example name='Example 1.'>
                 text of an example
         </example>
         more text....
         <references rows = 25>
                 table of references text
                 <connect project='Browser' type = 'document' topic='Plugin 
SDK'/>
                 <connect project='Unknown' type='http' 
topic='www.reallygoodstuff.com/plugins'/>
         </references>
</document>

And so on.  The above structure was without a great deal of  thought and 
I'd certainly agree that there are holes in it, how would topics be defined 
and so on (I'd go for a topic registry).  I'd also not make it too 
restrictive you could have the parser apply a strict DTD but allow tags it 
didn't recognise to just live as part of the content, that way you needn't 
reinvent wheels to cope with vestiges of layout markup a more appropriate 
parser downstream can cope with the output.

It wouldn't be too onerous for someone to take either plain text or god 
forbid HTML and tag it up with something like that kind of logical 
structure, it certainly isn't beyond the wit of us to write a parser that 
can take the above and emit HTML strict out the other side.  Naturally CSS 
would apply just as well as in any other XML dialect.

The advantages are that documents can be integrated in a variety of 
directions without any of the documents needing to know precisely where the 
others are in relation to them, content within documents (examples say), 
can be addressed directly without too much pain and the level of 
documentation itself can be structured so that you can see all the end user 
documentation together or all the documentation for a project together.

A user browsing or downloading the documentation need only define the kind 
of documentation they want and the appropriate documentation is delivered, 
it becomes easier to treat this as a publishing exercise rather than a 
dumping ground for barely laid out text with links that erode over time.

>
> > What on earth is wrong with having authors produce just content, why should
> > an author learn a HTML layout app?
>
>As opposed to Emacs to produce DocBook? Or are you recommending that
>everyone submit as plain text ASCII, and someone translates it into
>something browser-readable? I don't understand.

I've said people either submit plain text (I left out ASCII because of 
double byte characters and such), or they submit it already logically 
structured to whatever format is agreed on.  The process of submitting 
would mean that any document would have to go through some human somewhere 
to either be structurally marked up or for it to be validated by a 
validating engine.  That in itself isn't too bad because documents have to 
go through some kind of review process and as the tagging is not about 
layout but about structure its actually a lot easier and a lot quicker to 
do than to fix someone's badly formed HTML.

Of course if someone submits badly formed HTML in the first place that 
would still take time to fix, hence my preference for submitting plain text.


> > Some of them may, that's true but an
> > awful lot of good authors know 4 things about creating documents on a
> > computer, how to open a document, how to operate a keyboard, how to save a
> > document and how to forget to save a document.
>
>Are we talking about "a lot of authors" or "a lot of authors who are going
>to be contributing documentation/pages to mozilla.org"?

Authors in general, my tongue was firmly lodged in the buccal cavity :-)

Simon



>Gerv

===============================================
The more exotic the Project name the more ordinary the Product
S.P.L.


Reply via email to