Re: Fix and improve reference links (was: GSoC status: classification, output format and more)

2008-08-12 Thread Frank Lichtenheld
Just some small comments: On Tue, Aug 12, 2008 at 01:05:19PM +0200, Jordà Polo wrote: On Tue, Aug 12, 2008 at 06:48:37AM +0200, Frank Lichtenheld wrote: Do you maybe have an idea how we could enable the devref four part section number links. Maybe it could fall back to the three part

Re: Fix and improve reference links (was: GSoC status: classification, output format and more)

2008-08-11 Thread Jordà Polo
On Mon, Aug 11, 2008 at 05:22:37AM +0200, Frank Lichtenheld wrote: Some comments regarding your patches: Thanks for your comments. Everything should be fixed now. I also made a few more improvements: links to manpages.d.n for man references, and support for the remaining manuals (debconf-spec,

Re: GSoC status: classification, output format and more

2008-08-11 Thread Jordà Polo
On Mon, Aug 11, 2008 at 06:08:03AM +0200, Frank Lichtenheld wrote: Is it still your feeling that this isn't ready to get merged? Since we now internally abandonded lenny for the unstable versions, I see no real reason against merging it from our side. Yes. I'll be reviewing and polishing it

Re: Fix and improve reference links (was: GSoC status: classification, output format and more)

2008-08-11 Thread Frank Lichtenheld
On Mon, Aug 11, 2008 at 12:34:00PM +0200, Jordà Polo wrote: On Mon, Aug 11, 2008 at 05:22:37AM +0200, Frank Lichtenheld wrote: Some comments regarding your patches: Thanks for your comments. Everything should be fixed now. I also made a few more improvements: links to manpages.d.n for man

Re: Fix and improve reference links (was: GSoC status: classification, output format and more)

2008-08-10 Thread Frank Lichtenheld
Some comments regarding your patches: On Wed, Aug 06, 2008 at 10:41:40PM +0200, Jordà Polo wrote: +my $ddoc_title = 'title(.+?)\/title'; +my $ddoc_ref = 'a href=(.+?)([A-Z]|[A-Z]?[\d\.]+?)\.?\s+'. + '([\w\s[:punct:]]+?)\/a'; +my @ddoc_fields = [ [ 'url' ], [ 'section' ], [

Re: GSoC status: classification, output format and more

2008-08-10 Thread Frank Lichtenheld
On Fri, Jul 18, 2008 at 03:30:24PM +0200, Jordà Polo wrote: Finally, the changes to the code are available at git.d.o[7][8], but note that it is not yet ready to be merged and I may rearrange commits if needed (so don't be surprised if you see weird things in the log after pulling new stuff, a

Re: GSoC status: classification, output format and more

2008-08-08 Thread Jordà Polo
On Thu, Jul 24, 2008 at 07:40:29PM -0700, Russ Allbery wrote: In that world, there are a whole pile of tags in Lintian right now that have no Ref information at all. These tags fall roughly into the following categories: * Tags we issue for things that are just obviously broken, even if not

Re: Fix and improve reference links (was: GSoC status: classification, output format and more)

2008-08-06 Thread Jordà Polo
On Sat, Aug 02, 2008 at 09:28:25PM +0200, Jordà Polo wrote: The status of links on lintian.d.o has been bothering me for a while as well. lib/manual_refs is terribly outdated and manual_refs_update.pl is broken, so I tried to fix the problems (and implement some new features as well). I have

Fix and improve reference links (was: GSoC status: classification, output format and more)

2008-08-02 Thread Jordà Polo
(I sent this message yesterday, but it looks like it hasn't reached the list yet. I'm sending it again because other mails sent later have been processed, and also because the original mail included suspicious attachments - a patch with ~900 URLs - which I will simply link this time instead.) On

Re: GSoC status: classification, output format and more

2008-07-26 Thread Marc 'HE' Brockschmidt
[EMAIL PROTECTED] writes: Btw, I didn't say much about Source:, but that's because I was thinking of reusing Ref: which already has the relevant information. Though some standardization wouldn't hurt: using the document ID as defined by doc-base instead of its title (and optional debian-

Re: GSoC status: classification, output format and more

2008-07-26 Thread Russ Allbery
Marc 'HE' Brockschmidt [EMAIL PROTECTED] writes: [EMAIL PROTECTED] writes: Btw, I didn't say much about Source:, but that's because I was thinking of reusing Ref: which already has the relevant information. Though some standardization wouldn't hurt: using the document ID as defined by

Re: GSoC status: classification, output format and more

2008-07-24 Thread Jordà Polo
On Sun, Jul 20, 2008 at 09:13:25PM -0700, Russ Allbery wrote: In practice, there are a few checks that take up a *lot* of time (man page processing, for example), and most checks are fairly fast once you have all the data collected anyway. Currently, the split between checks/* scripts is a

Re: GSoC status: classification, output format and more

2008-07-24 Thread Russ Allbery
Jordà Polo [EMAIL PROTECTED] writes: Checks: po-debconf 954.981 fields 776.158 cruft425.962 standards-version277.267 control-file 231.320 patch-systems192.197 debhelper 60.784 rules

Re: GSoC status: classification, output format and more

2008-07-24 Thread Russ Allbery
Jordà Polo [EMAIL PROTECTED] writes: On Tue, Jul 22, 2008 at 10:42:12PM -0700, Russ Allbery wrote: Yes, standardization would be excellent here, as well as adding more keywords to the translator that turns them into nice descriptions for the web and for -i output. The one thing this

Re: GSoC status: classification, output format and more

2008-07-23 Thread Jordà Polo
On Tue, Jul 22, 2008 at 10:42:12PM -0700, Russ Allbery wrote: Yes, standardization would be excellent here, as well as adding more keywords to the translator that turns them into nice descriptions for the web and for -i output. The one thing this doesn't give us is distinguishing between the

Re: GSoC status: classification, output format and more

2008-07-22 Thread Jordà Polo
On Sun, Jul 20, 2008 at 09:13:25PM -0700, Russ Allbery wrote: The plus is that the basic format uses the same terms that people are already familiar with, even though we also have support for tuning the output for things like ftp-master. The drawback is that we're not pushing people towards

Re: GSoC status: classification, output format and more

2008-07-22 Thread Russ Allbery
Jordà Polo [EMAIL PROTECTED] writes: On Sun, Jul 20, 2008 at 09:13:25PM -0700, Russ Allbery wrote: The plus is that the basic format uses the same terms that people are already familiar with, even though we also have support for tuning the output for things like ftp-master. The drawback is

Re: GSoC status: classification, output format and more

2008-07-20 Thread Russ Allbery
Jordà Polo [EMAIL PROTECTED] writes: I have not been explaining much about the Lintian GSoC project. In the following paragraphs I'll try to summarize how it is coming along, as well as what are the current issues and future directions. Thank you for the update! So far, more than 50% of all

GSoC status: classification, output format and more

2008-07-18 Thread Jordà Polo
I have not been explaining much about the Lintian GSoC project. In the following paragraphs I'll try to summarize how it is coming along, as well as what are the current issues and future directions. Tag classification -- So far, more than 50% of all the tags in checks/*.desc