Re: Can't wget anidb.net

2008-10-20 Thread Saint Xavier
Hi,

* zanzi ([EMAIL PROTECTED]) wrote:
 I have tried to wget http://anidb.net/perl-bin/animedb.pl?show=main but all
 I seem to get is a file with unreadable characters (and not the HTML file
 I'm after).
 Is it because of some perl-script on the site?

This perl script assume HTTP/1.1 and gzip support for any request :(

  HTTP/1.1 200 OK
  Date: Mon, 20 Oct 2008 15:25:09 GMT
  Server: Apache/1.3.41 (Unix) mod_perl/1.30
  Set-Cookie: adbuin=1224516309-mqMQ; path=/; expires=Thu, 18-Oct-2018 15:25:09 
GMT
  Cache-control: no-cache
  Pragma: no-cache
  Content-Type: text/html; charset=UTF-8
  Expires: Mon, 20 Oct 2008 15:25:09 GMT
  X-Cache: MISS from anidb.net
  Connection: close
  Content-Encoding: gzip
   ^
  Content-Length: 5489


You can manually decompress the data:
 $ wget http://anidb.net/perl-bin/animedb.pl?show=main;  -O page.gz
 $ gzip -d page.gz  page.html


Sincerly,
Saint Xavier.


Re: Where is program_name?

2008-09-09 Thread Saint Xavier

Hi,

* Gisle Vanem ([EMAIL PROTECTED]) wrote:
 'program_name' is used in lib/error.c, but it is not allocated anywhere. 
 Should it be added to main.c and initialised to exec_name?

$cd wget-mainline
$find . -name '*.[ch]' -exec fgrep -H -n 'program_name' '{}' \;
./lib/error.c:63:# define program_name program_invocation_name
   ^^^
./lib/error.c:95:/* The calling program should define program_name and set it 
to the
./lib/error.c:97:extern char *program_name;
./lib/error.c:248:  __fxprintf (NULL, %s: , program_name);
./lib/error.c:250:  fprintf (stderr, %s: , program_name);
./lib/error.c:307:  __fxprintf (NULL, %s:, program_name);
./lib/error.c:309:  fprintf (stderr, %s:, program_name);
./src/netrc.c:463:  char *program_name, *file, *target;
./src/netrc.c:472:  program_name = argv[0];

Google for that and you will find the corresponding man page. Like it's
written here 
http://www.tin.org/bin/man.cgi?section=3topic=PROGRAM_INVOCATION_NAME
These variables are automatically initialised by the glibc run-time
 startup code.

I've also opened Wget with GDB: the variable exists but seems to point to
a bad memory area... 

Sincerly,
Saint Xavier.


Re: How to debug wget ?

2008-09-01 Thread Saint Xavier
Hi
* Jinhui Li ([EMAIL PROTECTED]) wrote:
 I am browsing the source code. And want to debug it to figure out how it
 works.

 So, somebody please tell me how to debug ( with GDB ) or where can I find
 information that I need.

Compile Wget with debug informations (-g flag for GCC) and then run wget into
GDB. You can now use and see variables name, structs, function name instead
of unfriendly adresses and raw bytes, ...

I can't show every aspect of GDB. Use your favorite search engine for
more usage info.

Hints:
http://www.gnu.org/software/gdb/documentation/
http://www.google.com/search?hl=enq=gdb+howtobtnG=Search

Quick example:
$ CFLAGS=-g ./configure
$ make
$ cd src/
$ gdb ./wget
(gdb) set args -P tmpdir/ -p www.google.com
(gdb) break retrieve_url
Breakpoint 1 at 0x420260: file retr.c, line 601.
(gdb) break url_parse
Breakpoint 2 at 0x4231a0: file url.c, line 644.
(gdb) run
Starting program: /home/xav/xav/sxav/src/wget -P tmpdir/ -p www.google.com
[Thread debugging using libthread_db enabled]
[New Thread 0x7f1ad48566e0 (LWP 6580)]
[Switching to Thread 0x7f1ad48566e0 (LWP 6580)]

Breakpoint 2, url_parse (url=0x665810 http://www.google.com;, 
error=0x7fffdc866aa0, iri=0x665830) at url.c:644
644 {
(gdb) continue
Continuing.

Breakpoint 1, retrieve_url (origurl=0x665ae0 http://www.google.com/;, 
file=0x7fffdc866a70, newloc=0x7fffdc866a78, refurl=0x0, dt=0x7fffdc866a98, 
recursive=false, iri=0x665830) at retr.c:601
601 {
(gdb) bt
#0  retrieve_url (origurl=0x665ae0 http://www.google.com/;, 
file=0x7fffdc866a70, newloc=0x7fffdc866a78, refurl=0x0, dt=0x7fffdc866a98, 
recursive=false, iri=0x665830) at retr.c:601
#1  0x0041ed48 in retrieve_tree (start_url=0x665810 
http://www.google.com;, pi=0x0) at recur.c:292
#2  0x0041bc31 in main (argc=5, argv=0x7fffdc866d28) at main.c:1201
(gdb) p iri
$1 = (struct iri *) 0x665830
(gdb) p *iri
$2 = {uri_encoding = 0x665850 UTF-8, content_encoding = 0x0, utf8_encode = 
false}
(gdb) p iri-uri_encoding
$3 = 0x665850 UTF-8
(gdb) continue
Continuing.

Breakpoint 2, url_parse (url=0x665b20 http://www.google.com/;, 
error=0x7fffdc8669e0, iri=0x665830) at url.c:644
644 {
(gdb) continue
Continuing.
--2008-09-01 14:57:15--  http://www.google.com/
Resolving www.google.com (www.google.com)... 66.249.93.99
Connecting to www.google.com (www.google.com)|66.249.93.99|:80... connected.
HTTP request sent, awaiting response... 302 Found
Location: http://www.google.be/ [following]

Breakpoint 2, url_parse (url=0x669660 http://www.google.be/;, 
error=0x7fffdc8669e0, iri=0x665830) at url.c:644
644 {
(gdb) continue
Continuing.
--2008-09-01 14:57:18--  http://www.google.be/
Resolving www.google.be (www.google.be)... 66.249.93.104
Connecting to www.google.be (www.google.be)|66.249.93.104|:80... connected.
HTTP request sent, awaiting response... 200 OK
Length: unspecified [text/html]
Saving to: `tmpdir/www.google.be/index.html'

[ =  ] 6,146   --.-K/s   in 0.07s   

2008-09-01 14:57:18 (82.1 KB/s) - `tmpdir/www.google.be/index.html'
saved [6146]


Breakpoint 2, url_parse (url=0x665ae0 http://www.google.com/;, error=0x0, 
iri=0x0) at url.c:644
644 {
(gdb) q
The program is running.  Exit anyway? (y or n) y

Regards,
Saint Xavier.


Re: WGET :: [Correction de texte]

2008-08-11 Thread Saint Xavier
* Tom ([EMAIL PROTECTED]) wrote:
 Bonjour !

bonjour,

 Je souhaite vous informer d'une touche restée appuyée un quart de seconde
 trop longtemps semble-t-il !
...
 Téléchargement récursif:
   -r,  --recursive  spécifer un téléchargement récursif.
   -l,  --level=NOMBRE   *profondeeur* maximale de récursion (inf ou 0
 Juste un e à enlever de profondeeur, et ca sera réglé !

En effet, merci !

Micah, instead of profondeeur it should be profondeur.
Where do you forward that info, French GNU translation team ?
(./po/fr.po around line 1472)

Saint Xavier.


Re: retrieval of data from a database

2008-06-14 Thread Saint Xavier
* Philip Stephens ([EMAIL PROTECTED]) wrote:
 Hello wget users and developers,
 
[...]

 [EMAIL PROTECTED]:~/Documents/projects/Boston/psdiss/code$ wget -dv
 http://www.bostonresourcenet.org/detail.cfm?StartRow=2EndRow=99SearchType=detailedVarZip=VarMiles=0keyword=ServID=17OrgTypeID=LangID=SpecialID=LocationID=
 Setting --verbose (verbose) to 1
 DEBUG output created by Wget 1.11.2 on linux-gnu.
 
 --2008-06-09 14:08:08--
 http://www.bostonresourcenet.org/detail.cfm?StartRow=2
 Resolving www.bostonresourcenet.org... [1] 12857
 [2] 12858
 [3] 12859
 [4] 12860
 [5] 12861
 [6] 12862
 [7] 12863
 [8] 12864
 [9] 12865
 [10] 12866
 [2]   DoneEndRow=99
 [3]   DoneSearchType=detailed
 [4]   DoneVarZip=
 [5]   DoneVarMiles=0
 [6]   Donekeyword=
 [7]   DoneServID=17
 [8]   DoneOrgTypeID=
 [9]   DoneLangID=
 [10]+  DoneSpecialID=

Well, you'd better escape the '' in the shell (\)...

Regards,
Saint Xavier.


Re: retrieval of data from a database

2008-06-09 Thread Saint Xavier
* Philip Stephens ([EMAIL PROTECTED]) wrote:
 Hello wget users and developers,
[...]

 [EMAIL PROTECTED]:~/Documents/projects/Boston/psdiss/code$ wget -dv
 http://www.bostonresourcenet.org/detail.cfm?StartRow=2EndRow=99SearchType=detailedVarZip=VarMiles=0keyword=ServID=17OrgTypeID=LangID=SpecialID=LocationID=
 Setting --verbose (verbose) to 1
 DEBUG output created by Wget 1.11.2 on linux-gnu.
 
 --2008-06-09 14:08:08--
 http://www.bostonresourcenet.org/detail.cfm?StartRow=2
 Resolving www.bostonresourcenet.org... [1] 12857
 [2] 12858
 [3] 12859
 [4] 12860
 [5] 12861
 [6] 12862
 [7] 12863
 [8] 12864
 [9] 12865
 [10] 12866
 [2]   DoneEndRow=99
 [3]   DoneSearchType=detailed
 [4]   DoneVarZip=
 [5]   DoneVarMiles=0
 [6]   Donekeyword=
 [7]   DoneServID=17
 [8]   DoneOrgTypeID=
 [9]   DoneLangID=
 [10]+  DoneSpecialID=
Well, you'd better escape the '' in your shell (\)

Regards,
Saint Xavier.


Google SoC 2008

2008-03-20 Thread Saint Xavier

Hi all,

I would like to participate in GSOC 2008. I saw some Wget project ideas on
http://www.gnu.org/software/soc-projects/ideas.html. I'm particulary interested
to improve international support and HTTP/1.1 headers support and some 
not-too-big
tasks like FTP proxy authentication.

Do you have specific requirements for applying ? Does someone already pick these
tasks or similar ones ? Is that enough for applying ?

BTW, I'm an electronic engineer student in Belgium (West Europe) and a GNU
enthusiast.

Regards,
Saint Xavier