Bug#778955: lintian: suggest check html imgs included in package

2015-03-20 Thread Kevin Ryde
I went a bit further with the few lines below. It adds some css, favicon and link checking. I reduced the check to targets under /usr/share/doc/PACKAGENAME/ since they ought to exist in the package or a package from the same source. This helps avoid false positives of cross-package links. Don't

Bug#778955: lintian: suggest check html imgs included in package

2015-02-26 Thread Kevin Ryde
I got to the few lines below so far. speed.pl compares HTML::Parser against some regexp action. Amend @filenames for likely .html to measure. The parse includes callbacks for 'a' which are not in the real code yet. New html.pm uses HTML::Parser and adds css file check. Some packages tickle

Bug#778955: lintian: suggest check html imgs included in package

2015-02-25 Thread Kevin Ryde
I wrote: File::Map Actually I see its a touch slow on mapping lots of small files (I'd used it on big files). HTML::Parser is more promising. I measure it faster than slurp and regexp. -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe.

Bug#778955: lintian: suggest check html imgs included in package

2015-02-25 Thread Niels Thykier
On 2015-02-25 22:35, Kevin Ryde wrote: I wrote: File::Map Actually I see its a touch slow on mapping lots of small files (I'd used it on big files). HTML::Parser is more promising. I measure it faster than slurp and regexp. That sounds good. I am looking forward to seeing the end

Bug#778955: lintian: suggest check html imgs included in package

2015-02-25 Thread Niels Thykier
On 2015-02-24 11:51, Kevin Ryde wrote: [...] Incidentally for performance perhaps you've thought about mmap to slurp (non-gzipped content). I've had some joy from File::Map (not in debian currently). The only disadvantage I've noticed is mmap reads seem higher kernel priority then plain

Bug#778955: lintian: suggest check html imgs included in package

2015-02-24 Thread Kevin Ryde
Niels Thykier ni...@thykier.net writes: only a few packages. In your list of big htmls, five tickle my check even in its present state, and seem to be real problems, though at most medium badness. fp-docs-2.6.4 IDE screenshots libgcj-doc graphics drawing diagrams

Bug#778955: lintian: suggest check html imgs included in package

2015-02-23 Thread Niels Thykier
On 2015-02-24 01:53, Kevin Ryde wrote: Niels Thykier ni...@thykier.net writes: I do have some concerns on the performance front. On some packages, this will be the second slowest check taking 10s or more. Hmm. If html files are usually only moderately common then maybe the totality

Bug#778955: lintian: suggest check html imgs included in package

2015-02-23 Thread Kevin Ryde
Niels Thykier ni...@thykier.net writes: I do have some concerns on the performance front. On some packages, this will be the second slowest check taking 10s or more. Hmm. If html files are usually only moderately common then maybe the totality across all packages is acceptable, even if a few

Bug#778955: lintian: suggest check html imgs included in package

2015-02-22 Thread Niels Thykier
Control: tags -1 moreinfo On 2015-02-22 05:26, Kevin Ryde wrote: Package: lintian Version: 2.5.30+deb8u3 Severity: wishlist Tags: patch If a .html file is in a package then usually its img files should be in the package too so it displays nicely. I suggest the few lines below to check

Bug#778955: lintian: suggest check html imgs included in package

2015-02-21 Thread Kevin Ryde
Package: lintian Version: 2.5.30+deb8u3 Severity: wishlist Tags: patch If a .html file is in a package then usually its img files should be in the package too so it displays nicely. I suggest the few lines below to check this. Without picking on any particular maintainers, missing images can be