I went a bit further with the few lines below. It adds some css,
favicon and link checking.
I reduced the check to targets under /usr/share/doc/PACKAGENAME/ since
they ought to exist in the package or a package from the same source.
This helps avoid false positives of cross-package links. Don't
I got to the few lines below so far. speed.pl compares HTML::Parser
against some regexp action. Amend @filenames for likely .html to
measure. The parse includes callbacks for 'a' which are not in the real
code yet.
New html.pm uses HTML::Parser and adds css file check. Some packages
tickle
I wrote:
File::Map
Actually I see its a touch slow on mapping lots of small files (I'd used
it on big files).
HTML::Parser is more promising. I measure it faster than slurp and
regexp.
--
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe.
On 2015-02-25 22:35, Kevin Ryde wrote:
I wrote:
File::Map
Actually I see its a touch slow on mapping lots of small files (I'd used
it on big files).
HTML::Parser is more promising. I measure it faster than slurp and
regexp.
That sounds good. I am looking forward to seeing the end
On 2015-02-24 11:51, Kevin Ryde wrote:
[...]
Incidentally for performance perhaps you've thought about mmap to slurp
(non-gzipped content). I've had some joy from File::Map (not in debian
currently).
The only disadvantage I've noticed is mmap reads seem higher kernel
priority then plain
Niels Thykier ni...@thykier.net writes:
only a few packages.
In your list of big htmls, five tickle my check even in its present
state, and seem to be real problems, though at most medium badness.
fp-docs-2.6.4 IDE screenshots
libgcj-doc graphics drawing diagrams
On 2015-02-24 01:53, Kevin Ryde wrote:
Niels Thykier ni...@thykier.net writes:
I do have some concerns on the performance front. On some packages,
this will be the second slowest check taking 10s or more.
Hmm. If html files are usually only moderately common then maybe the
totality
Niels Thykier ni...@thykier.net writes:
I do have some concerns on the performance front. On some packages,
this will be the second slowest check taking 10s or more.
Hmm. If html files are usually only moderately common then maybe the
totality across all packages is acceptable, even if a few
Control: tags -1 moreinfo
On 2015-02-22 05:26, Kevin Ryde wrote:
Package: lintian
Version: 2.5.30+deb8u3
Severity: wishlist
Tags: patch
If a .html file is in a package then usually its img files should be
in the package too so it displays nicely. I suggest the few lines below
to check
Package: lintian
Version: 2.5.30+deb8u3
Severity: wishlist
Tags: patch
If a .html file is in a package then usually its img files should be
in the package too so it displays nicely. I suggest the few lines below
to check this.
Without picking on any particular maintainers, missing images can be
10 matches
Mail list logo