How to force wget to download Java Script links?
How to force wget to download Java Script links: http://znik.wbc.lublin.pl/ChemFan/kalkulatory/javascript:wrzenie(): 17:04:44 ERROR 404: Not Found. http://znik.wbc.lublin.pl/ChemFan/kalkulatory/javascript:cisnienia(): 17:04:45 ERROR 404: Not Found. Or maybe it can download it, but there is just an error on the web page? ak
wget tries to get what it shouldn't get... :(
I'm using the following command to mirror a web page: wget -m -nv -k -K -nH -t 100 -D wbc.lublin.pl -o logchemfanpl -P public_html/mirror http://znik.wbc.lublin.pl/ChemFan/ For some strange reason wget tries to download all http:// addresses put by visitors in a Guest Book: http://znik.wbc.lublin.pl/ChemFan/Goscie/Nowi/none: 17:26:48 ERROR 404: Not Found. http://znik.wbc.lublin.pl/ChemFan/Goscie/Nowi/jeszcze%20nie%20mam: 17:26:48 ERROR 404: Not Found. http://znik.wbc.lublin.pl/ChemFan/Goscie/Nowi/republika.pl/kelog: 17:26:48 ERROR 404: Not Found. http://znik.wbc.lublin.pl/ChemFan/Goscie/Nowi/-: 17:26:48 ERROR 404: Not Found. http://znik.wbc.lublin.pl/ChemFan/Goscie/Nowi/: 17:26:49 ERROR 404: Not Found. http://znik.wbc.lublin.pl/ChemFan/Goscie/Nowi/nieee%20mam:(: 17:26:49 ERROR 404: Not Found. http://znik.wbc.lublin.pl/ChemFan/Goscie/Nowi/www.alpha.uwb.edu.pl/szymcio /menue.htm: 17:26:49 ERROR 404: Not Found. http://znik.wbc.lublin.pl/ChemFan/Goscie/Nowi/alpha.uwb.edu.pl/szymcio/men ue.htm: 17:26:50 ERROR 404: Not Found. http://znik.wbc.lublin.pl/ChemFan/Goscie/Nowi/www.czat.of.pl: 17:26:50 ERROR 404: Not Found. http://znik.wbc.lublin.pl/ChemFan/Goscie/Nowi/chemia.pl: 17:26:50 ERROR 404: Not Found. http://znik.wbc.lublin.pl/ChemFan/Goscie/Nowi/mama%20jak%B1%B6%20ma%20ale% 20nie%20wiem: 17:26:51 ERROR 404: Not Found. http://znik.wbc.lublin.pl/ChemFan/Goscie/Nowi/www.kmkkm.tk.krakow.pl: 17:26:51 ERROR 404: Not Found. etc, etc. I tried to prevent it by -X Goscie, but it didn't help. Why wget tries to download all those URLs? ak
wget is mirroring whole internet instead of just my web page!
When I try to mirror web pages using the command: wget -m -nv -k -K -nH -t 100 -o logchemfanpl -P public_html/mirror http://znik.wbc.lublin.pl/ChemFan/ wget is mirroring not just the domain of the web page but just whole internet... There is robot.txt files, but it should not influence wget to download all available domains I suppose? So why is it happening and how to avoid it? Regards Andrzej.
RE: wget is mirroring whole internet instead of just my web page!
On 18 Aug 2003 at 13:49, Post, Mark K wrote: man wget shows: -D domain-list --domains=domain-list Set domains to be followed. domain-list is a comma-separated list of domains. Note that it does not turn on -H. Right, but by default wget should not follow all domains, then why it was happening in this case? I tried also to mirror another web site from the same server, also containing links to other domains: wget -m -nv -k -K -nH -t 100 -o logmineraly -P public_html/mirror http://znik.wbc.lublin.pl/Mineraly/ and in this case it was not downloading from other domains. So that's a mystery really. Anyway, if I add -D wbc.lublin.pl it should run correctly? wget -m -nv -k -K -nH -t 100 -D wbc.lublin.pl -o logchemfanpl -P public_html/mirror http://znik.wbc.lublin.pl/ChemFan/ ak
wget vs mms://*.wmv?
How could I download using wget that: mms://mms.itvp.pl/bush_archiwum/bush.wmv If wget cannot manage it then what can? Cheers! Andy
archives
Hi! I sent a question yesterday to the list without subscribing, hoping that I will read answers in the archives, however none of the 3 archives work! Please forward me any replies to my yesterday's e-mail in private. Cheers! ak