Your message dated Sun, 30 Mar 2008 20:17:11 -0700 (PDT)
with message-id <[EMAIL PROTECTED]>
and subject line Re: Bug#466473: Wget -r -k: strips closing tags from changed 
urls.
has caused the Debian Bug report #466473,
regarding Wget -r -k: strips closing tags from changed urls.
to be marked as done.

This means that you claim that the problem has been dealt with.
If this is not the case it is now your responsibility to reopen the
Bug report if necessary, and/or fix the problem forthwith.

(NB: If you are a system administrator and have no idea what this
message is talking about, this may indicate a serious mail system
misconfiguration somewhere. Please contact [EMAIL PROTECTED]
immediately.)


-- 
466473: http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=466473
Debian Bug Tracking System
Contact [EMAIL PROTECTED] with problems
--- Begin Message ---
Package: wget
Version: 1.10.2-3
Severity: minor

The -k flag to wget mishandles the code for hyperlinks. 

Let's say that the web page http://foo.com/bar/index.html contains a
hyperlink to http://foo.com/bar/baz.html. The html code for this
hyperlink would be:
  
  http://foo.com/bar/baz.html>display text

Now, the command wget -h -k foo.com/bar/ downloads both pages and
*should* change the code of the hyperlink to the following:

  <a href=baz.html>display text</a>

Instead, the closing tag is removed leaving only 

  <a href=baz.html>display text

If there is any text following the display text, it is also included
as part of the hyperlink. This is mainly a cosmetic issue as the text
is still displayed, but it is certainly not desirable. It should be
fairly straightforward to correct the script to prevent the closing
tag from being removed.

--- End Message ---
--- Begin Message ---
>tags 466473 + moreinfo unreproducible
>thanks
>
>Am Montag, den 18.02.2008, 16:08 -0800 schrieb Jacob Emmert-Aronson:
>
>Hello Jacob,
>
>> The -k flag to wget mishandles the code for hyperlinks. 
>> 
>> Let's say that the web page http://foo.com/bar/index.html contains a
>> hyperlink to http://foo.com/bar/baz.html. The html code for this
>> hyperlink would be:
>>   
>>   http://foo.com/bar/baz.html>display text
>> 
>> Now, the command wget -h -k foo.com/bar/ 
downloads both pages and
>> *should* change the code of the hyperlink to the following:
>> 
>>   <a href=baz.html>display text</a>
>> 
>> Instead, the closing tag is removed leaving only 
>> 
>>   <a href=baz.html>display text
>> 
>> If there is any text following the display text, it is also included
>> as part of the hyperlink. This is mainly a cosmetic issue as the text
>> is still displayed, but it is certainly not desirable. It should be
>> fairly straightforward to correct the script to prevent the closing
>> tag from being removed.
>> 
>I'm trying to reproduce your report (with wget 1.10.2 and 1.11.1).
>
>www.koethe.net/wget/index.html and www.koethe.net/wget/bar.html and then
>run:
>
>wget -r -k www.koethe.net/wget/
>
>But the closing tags are correct. Where is my setup wrong?
>Do you have concret and a reproducible setup to show this problem?
>
>thx.
>
>-- 
>Noèl Köthe <noel debian.org>

Debian GNU/Linux, www.debian.orgThe website I was trying to download from is
http://distro.ibiblio.org/pub/linux/distributions/damnsmall/mydsl/uci/
and I just looked at their page source.  As it turns out, they left 
out the closing tags, so it is little wonder that the tags did not 
reappear after the download.

It seems that it is not your setup but mine that was wrong.  I suppose 
I should have checked that before filing a bug report rather than 
after receiving a reply.  At any rate, I'm closing this bug and I 
appologize for any inconvenience I may have caused.


Jacob Emmert-Aronson


--- End Message ---

Reply via email to