wget parsing JavaScript

2002-03-26 Thread csaba . raduly
wget stumbled upon the following HTML file: --- 8 html head titlefoo/title /head body SCRIPT language=JavaScript1.2 var sitems=new Array() var sitemlinks=new Array() ///Edit below/ //extend or shorten this list sitems[0]=15.html

GNU wget 1.8.1 - Bug report memory occupied

2002-03-26 Thread Dipl. Ing. Hermann Rugen
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Hallo specialists, I used wget 1.8.1 on my system to mirror the site www.europa.eu.int. Transfer was throug a proxy and DSL over night. After about 12-13 hours I found following situation: Totally download about 1.8GB data. wget process was

-_-[±¤°í]ÀÚ¼¼°¡ ¹Ù¸£Áö ¸øÇÑ ºÐ¸¸ º¸¼¼¿ä!! Á¤Çü¿Ü°úÀǻ簡 °³¹ßÇÑ ÀÚ¼¼±³Á¤±â

2002-03-26 Thread ½ÅÁ¦Ç°
Title: Á¦¸ñ¾øÀ½

Re: wget parsing JavaScript

2002-03-26 Thread Tony Lewis
Csaba Ráduly wrote: I see that wget handles SCRIPT with tag_find_urls, i.e. it tries to parse whatever it's inside. Why was this implemented ? JavaScript is most used to construct links programmatically. wget is likely to find bogus URLs until it can properly parse JavaScript. wget is

Re: wget parsing JavaScript

2002-03-26 Thread Ian Abbott
On 26 Mar 2002 at 7:05, Tony Lewis wrote: Csaba Ráduly wrote: I see that wget handles SCRIPT with tag_find_urls, i.e. it tries to parse whatever it's inside. Why was this implemented ? JavaScript is most used to construct links programmatically. wget is likely to find bogus URLs

Re: spanning hosts: 2 Problems

2002-03-26 Thread Ian Abbott
On 26 Mar 2002 at 19:01, Jens Rösner wrote: I am using wget to parse a local html file which has numerous links into the www. Now, I only want hosts that include certain strings like -H -Daudi,vw,online.de It's probably worth noting that the comparisons between the -D strings and the

Re: ´äº¯ÀÔ´Ï´Ù.

2002-03-26 Thread ¿©·Ã
Title: Àý´ë ¼ºÀθ¸ Ŭ¸¯ÇÏ½Ã°í ¹Ì¼º³âÀÚ´Â Áö¿ì¼¼¿ä. Àý´ë ¼ºÀθ¸ Ŭ¸¯ÇÏ½Ã°í ¹Ì¼º³âÀÚ´Â Áö¿ì¼¼¿ä. »õ·Î¿î ½ÅÁ¾ ¼ºÀε¿¿µ»ó À帣

Re: wget parsing JavaScript

2002-03-26 Thread Tony Lewis
I wrote: wget is parsing the attributes within the script tag, i.e., script src=url. It does not examine the content between script and /script. and Ian Abbott responded: I think it does, actually, but that is mostly harmless. You're right. What I meant was that it does not examine the