Re: [Bug-wget] "Referer" when using spider mode

2018-11-27 Thread Darshit Shah
Hi Fernando,

Once again, the answer is quite the same. You could parse the --debug output of
Wget to do this. Though, remember, parsing the debug is not always safe since
we may change it any point.

However, even for this case, I guess, using Wget2 is a better choice for you.

https://gitlab.com/gnuwget/wget2

* Fernando Gont  [181127 13:08]:
> Folks,
> 
> I'm using wget in a script to find broken and "moved" links in a web site.
> 
> My problem is that, when parsing the output of "wget --spider", I cannot
> tell which page triggered the retrieval of a URL (i.e., the "referer" of
> such URL) -- so, while I can find that there are broken links, I cannot
> easily tell which page contains the broken link.
> 
> Any clues on how to obtain such info?
> 
> Thanks!
> Fernando
> 
> 
> 
> 
> -- 
> Fernando Gont
> SI6 Networks
> e-mail: fg...@si6networks.com
> PGP Fingerprint:  31C6 D484 63B2 8FB1 E3C4 AE25 0D55 1D4E 7492
> 
> 
> 
> 
> 
> 

-- 
Thanking You,
Darshit Shah
PGP Fingerprint: 7845 120B 07CB D8D6 ECE5 FF2B 2A17 43ED A91A 35B6


signature.asc
Description: PGP signature


[Bug-wget] "Referer" when using spider mode

2018-11-27 Thread Fernando Gont
Folks,

I'm using wget in a script to find broken and "moved" links in a web site.

My problem is that, when parsing the output of "wget --spider", I cannot
tell which page triggered the retrieval of a URL (i.e., the "referer" of
such URL) -- so, while I can find that there are broken links, I cannot
easily tell which page contains the broken link.

Any clues on how to obtain such info?

Thanks!
Fernando




-- 
Fernando Gont
SI6 Networks
e-mail: fg...@si6networks.com
PGP Fingerprint:  31C6 D484 63B2 8FB1 E3C4 AE25 0D55 1D4E 7492