In the HTML Tidy configuration file, I'd remove
enclose-block-text: yes
The default is no, and the feature is broken. It will add P elements in
lots of places they are not needed, and in many cases in ways that
change the way pages render. For example, it changes
<td>This is fine strict markup</td> and
<li>This is fine strict markup</td>
to
<td><p>this is fine strict markup</p></td> and
<li><p>this is fine strict markup</p></li>
See
http://search.w3.org/Public/cgi-bin/query?mss=simple&pg=q&what=web&filter=lists&fmt=.&q=%2Bhtml-tidy+enclose-block-text&search=Search
Also, I see you use the undocumented 'doctype: none' configuration. This
is very cool and the way I run Tidy. The effect is to ignore and retain
the existing document declaration.
Since your Perl script resets the document declaration anyway, you might
want to change this to the documented
doctype: omit
Also $quot; is really only needed when you put a double quote inside a
double quoted attribute. My preference is for "quote-marks: no".
Robert
Henri Sivonen wrote:
> I have updated http://www.pp.htv.fi/hsivone1/moz-org2strict.html
>
> I added a new tool for minimizing the diffs. I'd like to hear opinions
> about the process I have described in the document. Is it OK? Can we
> make it official?
>
>