Gabor,

These files are created when there is no index.html file in a directory using Apache.

Example: http://foo.org/dir1/

If no index.html file exists in dir1, Apache generates a directory listing which Wget thinks is the index.html file. The listing contains links to "?C=D;O=A" etc. Wget follows these links, not knowing that this file is created by Apache.

It would be nice if Wget could somehow know that the returned web page is dynamically generated by Apache and not from index.html, but so far I haven't heard of any discussion about implementing this.

Frank


Gabor Istvan wrote:
Dear wget users:

I would like to mirror the following url:
http://www.biolinux.org/soft/suse/9.1/

I used the following command:
wget -m -np -l0 -N -P BIOLINUX-YaST http://www.biolinux.org/
soft/suse/9.1/

This works but it also creates the following files that do not exist at the original place:

index.html
index.html?C=D;O=A
index.html?C=D;O=D
index.html?C=M;O=A
index.html?C=M;O=D
index.html?C=N;O=A
index.html?C=N;O=D
index.html?C=S;O=A
index.html?C=S;O=D

These files are created in every directory/subdirectory.
My questions are why these files are created in the local mirror by wget and how could I tell wget not to create these files.

TIA,
IG

_______________________________________________________________________
[freemail] extra 1GB-os postafiókkal, Önnek már van? http://freemail.hu



--
Frank McCown
Old Dominion University
http://www.cs.odu.edu/~fmccown

Reply via email to