Package: tidy
Version: 1:5.2.0-1.1
Severity: normal

Hello again,
besides bug #830066, another thing that I noticed after the upgrade

  [UPGRADE] tidy:amd64 20091223cvs-1.5 -> 1:5.2.0-1.1

is that tidy started to add too much vertical space.

Please let me explain the steps to reproduce the issue.

Let's start with an HTML document, such as, for instance:

  $ wget http://www.inventati.org/frx/progs/scripts/refresh-pubring.html

Please note that this HTML file is already formatted by an older
version of Tidy (as shown by the <meta name="generator"> item).
Hence, I expect that that the re-formatting performed by the current
version of Tidy should not change too much in the file.

Let's set the HTML_TIDY environment variable, in order to work around
bug #830066:

  $ export HTML_TIDY=~/.tidyrc

The ~/.tidyrc file is attached and includes:

  $ grep vertical-space ~/.tidyrc
  vertical-space: yes

Let's start by reformatting the file:

  $ tidy refresh-pubring.html > refresh-pubring_T1.html

Tidy adds two lines of vertical space around <h1> and <h2> headers,
between <li> items, around <div> items, around <p> items.
I really think that two lines are too much vertical space: I am
convinced that the behavior of older versions of Tidy was saner
(only one line of vertical space was added).
Please restore the previous behavior.

Moreover each </li> end tag is moved to the next line for no
good reason. Again, I think the behavior of older versions of
Tidy was saner: please fix this regression, too.

On the other hand, I appreciate that the <!-- start tag for
comments is moved to the beginning of a line: I think this is
an improvement with respect to the behavior of older versions.

Please note that disabling the vertical-space option produces an
output HTML file with no vertical space at all (as expected), which
is not what I want:

  $ tidy --vertical-space no refresh-pubring.html > refresh-pubring_T2.html

Setting the option to auto produces an almost unreadable HTML code
(as I expected, since the documentation says that this eliminates
nearly all newline characters):

  $ tidy --vertical-space auto refresh-pubring.html > refresh-pubring_T3.html

Again, this is not what I want.


In conclusion, the best choice is setting vertical-space to "yes", which
unfortunately adds _too much_ vertical space.
I think this is a regression with respect to the previous versions
of Tidy.

Please fix this regression and/or forward my bug report upstream.

Thanks for your time!
Bye.



-- System Information:
Debian Release: stretch/sid
  APT prefers testing
  APT policy: (800, 'testing'), (500, 'unstable')
Architecture: amd64 (x86_64)

Kernel: Linux 4.6.0-1-amd64 (SMP w/4 CPU cores)
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash
Init: systemd (via /run/systemd/system)

Versions of packages tidy depends on:
ii  libc6     2.23-1
ii  libtidy5  1:5.2.0-1.1

tidy recommends no packages.

tidy suggests no packages.

-- no debconf information
decorate-inferred-ul: no
doctype: auto
drop-empty-paras: yes
drop-proprietary-attributes: yes
enclose-text: yes
hide-comments: no
logical-emphasis: yes
lower-literals: yes
merge-divs: no
merge-spans: no
numeric-entities: no
output-xhtml: yes
add-xml-decl: yes
quote-ampersand: yes
quote-marks: yes
quote-nbsp: yes
show-body-only: auto

accessibility-check: 0

indent: auto
indent-attributes: no
indent-spaces: 2
markup: yes
wrap: 72
wrap-attributes: no
vertical-space: yes

input-encoding: utf8
output-encoding: utf8

quiet: no
write-back: no

Reply via email to