Salut, > wget downloads > "http://www.mfd.mw.tu-dresden.de/mfd/index.php/lehre/wintersemester/videos-winter", > parses it for > URLs and in the end this document becomes removed because it doesn't fit your > settings. That's what I consider the flawed logic. Wget shouldn't remove URLs explicitly given on the command line, IMHO. *Edit*: --debug revealed that actually, the no-parent (-np) setting caused the problem. Can someone confirm wget actually does behave like stated above? It seems so ^^
> Did you try -A '*.pdf' -R '*.asx' ? (Quotes to avoid shell wildcard > expansion.) No pdf files on this page, I was just giving an example.. In my case it's about .asx-files (dreadful stuff^^) but more commonly, it might be .pdf-files for others. > Sorry, but I can only reach the login page... Yeah, horrible habitude to password-shield educational content.. > else I could have a look deeper inside. Right now I just have to guess. > Your '-A asx' means: Just download/keep *.asx files > Your '--accept-regex videos' means: Just download/keep *video* files > > That means, you only want to download *video*.asx files. > Is that what you want ? No.. I want to include all asx files, plus the initial URL which contains the actual links thereto. Guess I could cook up .. hmm *fumble* > --regex-type pcre --accept-regex '(videos-winter$|asx$)' This did the trick. But... mmh ... Ok my copy'n paste derp. The --no-parent option was the problem. Without it & with -A asx, it just works™. #Thanks & Best Regards/MPartap -- Gerne können Sie mir auch verschlüsselt schreiben, und sei es nur, um einen kleinen Triumph über die Geheimdienste zu feiern. Meinen öffentlichen Schlüssel finden Sie unter http://pgp.mit.edu/pks/lookup?op=get&search=0x65C6B04A8A39FBDA . Der PGP-Fingerprint lautet »2E95 89E4 B9A4 460B 66AD 9B8D 65C6 B04A 8A39 FBDA«.
