On Sat, Jan 05, 2013 at 11:36:15AM +0100, Willi Mann wrote:
> Thanks for looking at this bug. It seems we are getting close to zero
> bugs for the debian package, since the next upstream version will have
> working latex output again and fix the last remaining bug.

Great news.

> > 0.21.2-2 does indeed fix the closing comment tag, but it does still
> > have "--" within the comment, which is potentially problematic.  I
> > won't go into the gory details, but W3 note that:
> > 
> > "Authors should avoid putting two or more adjacent hyphens inside 
> > comments."
> > 
> > http://www.w3.org/TR/html4/intro/sgmltut.html#h-3.2.4
> > 
> > It seems in practice it doesn't cause problems with the browsers I 
> > checked, so I think leaving this bug closed is appropriate as
> > unrtf's output no longer causes the body of the document to appear
> > as a comment.
> 
> Since it is "should" but not "must", a fully conforming HTML parser
> should be able to deal with the hyphens.

Sadly it's not quite that simple.  There's a reasonable summary of the
history here:

http://www.howtocreate.co.uk/SGMLComments.html

> However, fixing this is trivial. Dave, can we fix this in 0.21.3,
> please?

There's another bug in this area - if the \author in the RTF includes
"-->" then that gets copied across verbatim into the HTML, and you get
something like this in the output:

    <!-- author: foo-->bar -->

It would be more useful to map \author to a meta tag instead of a
comment (since then it's automatically available to other tools), and
then doing HTML entity encoding avoids such problems:

    <meta name="author" content="foo--&gt;bar">

Cheers,
    Olly


-- 
To UNSUBSCRIBE, email to [email protected]
with a subject of "unsubscribe". Trouble? Contact [email protected]

Reply via email to