Re: [Bug-wget] sudo wget --accept sandwich

Marcel Partap Wed, 10 Feb 2016 12:16:45 -0800

Salut,

> wget downloads 
> "http://www.mfd.mw.tu-dresden.de/mfd/index.php/lehre/wintersemester/videos-winter";,
>  parses it for 
> URLs and in the end this document becomes removed because it doesn't fit your 
> settings.
That's what I consider the flawed logic. Wget shouldn't remove URLs
explicitly given on the command line, IMHO.
*Edit*: --debug revealed that actually, the no-parent (-np) setting
caused the problem. Can someone confirm wget actually does behave like
stated above? It seems so ^^


> Did you try -A '*.pdf' -R '*.asx' ? (Quotes to avoid shell wildcard
> expansion.)
No pdf files on this page, I was just giving an example.. In my case
it's about .asx-files (dreadful stuff^^) but more commonly, it might be
.pdf-files for others.

> Sorry, but I can only reach the login page...
Yeah, horrible habitude to password-shield educational content..
> else I could have a look deeper inside. Right now I just have to guess.

> Your '-A asx' means: Just download/keep *.asx files
> Your '--accept-regex videos' means: Just download/keep *video* files
> 
> That means, you only want to download *video*.asx files.
> Is that what you want ?
No.. I want to include all asx files, plus the initial URL which
contains the actual links thereto.
Guess I could cook up .. hmm *fumble*
> --regex-type pcre --accept-regex '(videos-winter$|asx$)'
This did the trick. But... mmh ...
Ok my copy'n paste derp. The --no-parent option was the problem. Without
it & with -A asx, it just works™.

#Thanks & Best Regards/MPartap

-- 
Gerne können Sie mir auch verschlüsselt schreiben, und sei es nur, um
einen kleinen Triumph über die Geheimdienste zu feiern.
Meinen öffentlichen Schlüssel finden Sie unter
http://pgp.mit.edu/pks/lookup?op=get&search=0x65C6B04A8A39FBDA .
Der PGP-Fingerprint lautet »2E95 89E4 B9A4 460B 66AD 9B8D 65C6 B04A 8A39
FBDA«.

Re: [Bug-wget] sudo wget --accept sandwich

Reply via email to