Re: Can't wget anidb.net
Hi, * zanzi ([EMAIL PROTECTED]) wrote: I have tried to wget http://anidb.net/perl-bin/animedb.pl?show=main but all I seem to get is a file with unreadable characters (and not the HTML file I'm after). Is it because of some perl-script on the site? This perl script assume HTTP/1.1 and gzip support for any request :( HTTP/1.1 200 OK Date: Mon, 20 Oct 2008 15:25:09 GMT Server: Apache/1.3.41 (Unix) mod_perl/1.30 Set-Cookie: adbuin=1224516309-mqMQ; path=/; expires=Thu, 18-Oct-2018 15:25:09 GMT Cache-control: no-cache Pragma: no-cache Content-Type: text/html; charset=UTF-8 Expires: Mon, 20 Oct 2008 15:25:09 GMT X-Cache: MISS from anidb.net Connection: close Content-Encoding: gzip ^ Content-Length: 5489 You can manually decompress the data: $ wget http://anidb.net/perl-bin/animedb.pl?show=main; -O page.gz $ gzip -d page.gz page.html Sincerly, Saint Xavier.
Re: Where is program_name?
Hi, * Gisle Vanem ([EMAIL PROTECTED]) wrote: 'program_name' is used in lib/error.c, but it is not allocated anywhere. Should it be added to main.c and initialised to exec_name? $cd wget-mainline $find . -name '*.[ch]' -exec fgrep -H -n 'program_name' '{}' \; ./lib/error.c:63:# define program_name program_invocation_name ^^^ ./lib/error.c:95:/* The calling program should define program_name and set it to the ./lib/error.c:97:extern char *program_name; ./lib/error.c:248: __fxprintf (NULL, %s: , program_name); ./lib/error.c:250: fprintf (stderr, %s: , program_name); ./lib/error.c:307: __fxprintf (NULL, %s:, program_name); ./lib/error.c:309: fprintf (stderr, %s:, program_name); ./src/netrc.c:463: char *program_name, *file, *target; ./src/netrc.c:472: program_name = argv[0]; Google for that and you will find the corresponding man page. Like it's written here http://www.tin.org/bin/man.cgi?section=3topic=PROGRAM_INVOCATION_NAME These variables are automatically initialised by the glibc run-time startup code. I've also opened Wget with GDB: the variable exists but seems to point to a bad memory area... Sincerly, Saint Xavier.
Re: How to debug wget ?
Hi * Jinhui Li ([EMAIL PROTECTED]) wrote: I am browsing the source code. And want to debug it to figure out how it works. So, somebody please tell me how to debug ( with GDB ) or where can I find information that I need. Compile Wget with debug informations (-g flag for GCC) and then run wget into GDB. You can now use and see variables name, structs, function name instead of unfriendly adresses and raw bytes, ... I can't show every aspect of GDB. Use your favorite search engine for more usage info. Hints: http://www.gnu.org/software/gdb/documentation/ http://www.google.com/search?hl=enq=gdb+howtobtnG=Search Quick example: $ CFLAGS=-g ./configure $ make $ cd src/ $ gdb ./wget (gdb) set args -P tmpdir/ -p www.google.com (gdb) break retrieve_url Breakpoint 1 at 0x420260: file retr.c, line 601. (gdb) break url_parse Breakpoint 2 at 0x4231a0: file url.c, line 644. (gdb) run Starting program: /home/xav/xav/sxav/src/wget -P tmpdir/ -p www.google.com [Thread debugging using libthread_db enabled] [New Thread 0x7f1ad48566e0 (LWP 6580)] [Switching to Thread 0x7f1ad48566e0 (LWP 6580)] Breakpoint 2, url_parse (url=0x665810 http://www.google.com;, error=0x7fffdc866aa0, iri=0x665830) at url.c:644 644 { (gdb) continue Continuing. Breakpoint 1, retrieve_url (origurl=0x665ae0 http://www.google.com/;, file=0x7fffdc866a70, newloc=0x7fffdc866a78, refurl=0x0, dt=0x7fffdc866a98, recursive=false, iri=0x665830) at retr.c:601 601 { (gdb) bt #0 retrieve_url (origurl=0x665ae0 http://www.google.com/;, file=0x7fffdc866a70, newloc=0x7fffdc866a78, refurl=0x0, dt=0x7fffdc866a98, recursive=false, iri=0x665830) at retr.c:601 #1 0x0041ed48 in retrieve_tree (start_url=0x665810 http://www.google.com;, pi=0x0) at recur.c:292 #2 0x0041bc31 in main (argc=5, argv=0x7fffdc866d28) at main.c:1201 (gdb) p iri $1 = (struct iri *) 0x665830 (gdb) p *iri $2 = {uri_encoding = 0x665850 UTF-8, content_encoding = 0x0, utf8_encode = false} (gdb) p iri-uri_encoding $3 = 0x665850 UTF-8 (gdb) continue Continuing. Breakpoint 2, url_parse (url=0x665b20 http://www.google.com/;, error=0x7fffdc8669e0, iri=0x665830) at url.c:644 644 { (gdb) continue Continuing. --2008-09-01 14:57:15-- http://www.google.com/ Resolving www.google.com (www.google.com)... 66.249.93.99 Connecting to www.google.com (www.google.com)|66.249.93.99|:80... connected. HTTP request sent, awaiting response... 302 Found Location: http://www.google.be/ [following] Breakpoint 2, url_parse (url=0x669660 http://www.google.be/;, error=0x7fffdc8669e0, iri=0x665830) at url.c:644 644 { (gdb) continue Continuing. --2008-09-01 14:57:18-- http://www.google.be/ Resolving www.google.be (www.google.be)... 66.249.93.104 Connecting to www.google.be (www.google.be)|66.249.93.104|:80... connected. HTTP request sent, awaiting response... 200 OK Length: unspecified [text/html] Saving to: `tmpdir/www.google.be/index.html' [ = ] 6,146 --.-K/s in 0.07s 2008-09-01 14:57:18 (82.1 KB/s) - `tmpdir/www.google.be/index.html' saved [6146] Breakpoint 2, url_parse (url=0x665ae0 http://www.google.com/;, error=0x0, iri=0x0) at url.c:644 644 { (gdb) q The program is running. Exit anyway? (y or n) y Regards, Saint Xavier.
Re: WGET :: [Correction de texte]
* Tom ([EMAIL PROTECTED]) wrote: Bonjour ! bonjour, Je souhaite vous informer d'une touche restée appuyée un quart de seconde trop longtemps semble-t-il ! ... Téléchargement récursif: -r, --recursive spécifer un téléchargement récursif. -l, --level=NOMBRE *profondeeur* maximale de récursion (inf ou 0 Juste un e à enlever de profondeeur, et ca sera réglé ! En effet, merci ! Micah, instead of profondeeur it should be profondeur. Where do you forward that info, French GNU translation team ? (./po/fr.po around line 1472) Saint Xavier.
Re: retrieval of data from a database
* Philip Stephens ([EMAIL PROTECTED]) wrote: Hello wget users and developers, [...] [EMAIL PROTECTED]:~/Documents/projects/Boston/psdiss/code$ wget -dv http://www.bostonresourcenet.org/detail.cfm?StartRow=2EndRow=99SearchType=detailedVarZip=VarMiles=0keyword=ServID=17OrgTypeID=LangID=SpecialID=LocationID= Setting --verbose (verbose) to 1 DEBUG output created by Wget 1.11.2 on linux-gnu. --2008-06-09 14:08:08-- http://www.bostonresourcenet.org/detail.cfm?StartRow=2 Resolving www.bostonresourcenet.org... [1] 12857 [2] 12858 [3] 12859 [4] 12860 [5] 12861 [6] 12862 [7] 12863 [8] 12864 [9] 12865 [10] 12866 [2] DoneEndRow=99 [3] DoneSearchType=detailed [4] DoneVarZip= [5] DoneVarMiles=0 [6] Donekeyword= [7] DoneServID=17 [8] DoneOrgTypeID= [9] DoneLangID= [10]+ DoneSpecialID= Well, you'd better escape the '' in the shell (\)... Regards, Saint Xavier.
Re: retrieval of data from a database
* Philip Stephens ([EMAIL PROTECTED]) wrote: Hello wget users and developers, [...] [EMAIL PROTECTED]:~/Documents/projects/Boston/psdiss/code$ wget -dv http://www.bostonresourcenet.org/detail.cfm?StartRow=2EndRow=99SearchType=detailedVarZip=VarMiles=0keyword=ServID=17OrgTypeID=LangID=SpecialID=LocationID= Setting --verbose (verbose) to 1 DEBUG output created by Wget 1.11.2 on linux-gnu. --2008-06-09 14:08:08-- http://www.bostonresourcenet.org/detail.cfm?StartRow=2 Resolving www.bostonresourcenet.org... [1] 12857 [2] 12858 [3] 12859 [4] 12860 [5] 12861 [6] 12862 [7] 12863 [8] 12864 [9] 12865 [10] 12866 [2] DoneEndRow=99 [3] DoneSearchType=detailed [4] DoneVarZip= [5] DoneVarMiles=0 [6] Donekeyword= [7] DoneServID=17 [8] DoneOrgTypeID= [9] DoneLangID= [10]+ DoneSpecialID= Well, you'd better escape the '' in your shell (\) Regards, Saint Xavier.
Google SoC 2008
Hi all, I would like to participate in GSOC 2008. I saw some Wget project ideas on http://www.gnu.org/software/soc-projects/ideas.html. I'm particulary interested to improve international support and HTTP/1.1 headers support and some not-too-big tasks like FTP proxy authentication. Do you have specific requirements for applying ? Does someone already pick these tasks or similar ones ? Is that enough for applying ? BTW, I'm an electronic engineer student in Belgium (West Europe) and a GNU enthusiast. Regards, Saint Xavier