wget stumbled upon the following HTML file:
--- 8
html
head
titlefoo/title
/head
body
SCRIPT language=JavaScript1.2
var sitems=new Array()
var sitemlinks=new Array()
///Edit below/
//extend or shorten this list
sitems[0]=15.html
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
Hallo specialists,
I used wget 1.8.1 on my system to mirror the site www.europa.eu.int.
Transfer was throug a proxy and DSL over night.
After about 12-13 hours I found following situation:
Totally download about 1.8GB data.
wget process was
Title: Á¦¸ñ¾øÀ½
Csaba Ráduly wrote:
I see that wget handles SCRIPT with tag_find_urls, i.e. it tries to
parse whatever it's inside.
Why was this implemented ? JavaScript is most
used to construct links programmatically. wget is likely to find
bogus URLs until it can properly parse JavaScript.
wget is
On 26 Mar 2002 at 7:05, Tony Lewis wrote:
Csaba Ráduly wrote:
I see that wget handles SCRIPT with tag_find_urls, i.e. it tries to
parse whatever it's inside.
Why was this implemented ? JavaScript is most
used to construct links programmatically. wget is likely to find
bogus URLs
On 26 Mar 2002 at 19:01, Jens Rösner wrote:
I am using wget to parse a local html file which has numerous links into
the www.
Now, I only want hosts that include certain strings like
-H -Daudi,vw,online.de
It's probably worth noting that the comparisons between the -D
strings and the
Title: Àý´ë ¼ºÀθ¸ Ŭ¸¯ÇÏ½Ã°í ¹Ì¼º³âÀÚ´Â Áö¿ì¼¼¿ä.
Àý´ë ¼ºÀθ¸ Ŭ¸¯ÇÏ½Ã°í ¹Ì¼º³âÀÚ´Â Áö¿ì¼¼¿ä.
»õ·Î¿î
½ÅÁ¾ ¼ºÀε¿¿µ»ó À帣
I wrote:
wget is parsing the attributes within the script tag, i.e., script
src=url. It does not examine the content between script and
/script.
and Ian Abbott responded:
I think it does, actually, but that is mostly harmless.
You're right. What I meant was that it does not examine the