Hello,

When I run wget to download any URL.......

        http://www.expert.ru/expert/current/data/raznoe.shtml

I *always* want wget to name the final main .html file "index.html".

Long story short, I am downloading webpages, and then sticking them under
the an Apache htdocs dir.  After that I want to be able to point people to
the correct url under my Apache htdocs dir to view the downloaded webpage.
I have control over the directory name BUT not the downloaded wget'ed
webpages, wget decides the final name.  

An example of what I *ideally want*.....

1) I perform the wget get on the URL
http://www.expert.ru/expert/current/data/raznoe.shtml

wget -nd -e robots=off -v -t 2 -S -T 30 -w 1 --random-wait -E -U
"Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.0)" -k -p -H -R
*exe,*vbs,*pdf,*doc,*bat
http://www.expert.ru/expert/current/data/raznoe.shtml

2) when wget is done and we go to the dir were wget dumped all the files it
looks like this on the UNIX command line........

/eval//apache_1.3.28/htdocs/wget/test1
[ 11:25 : Thu,18 Sep 2003 ] $ ls

600090.gif?0-0-15560021-1-a:0p:0&            html.js
advert.gif                                   lock.gif
blic2.ii                                     logo.gif
budgetrf.gif                                 rambler.gif
cnt?cid=16612&p=1                            raznoe.shtml.html
counter?id=6386;t=84;l=1                     red.gif
empty.gif                                    red_folder.gif
empty.gif.1
rle.cgi?sid=23705&bt=33&pz=0&rnd=898701491
folder.gif                                   scripart.js
grey.gif                                     top100.cnt?134760
hot_folder.gif                               yell.gif
hot_red_folder.gif

/eval/apache_1.3.28/htdocs/wget/test1 ]
[ 11:25 : Thu,18 Sep 2003 ] $ 

as we can see above, wget has "raznoe.shtml.html" as the main file, this is
*not* what I want, I *always* want the main file to be name "index.html".

I want this so when I have people go to my apache web server they always
know to put "index.html" on the end of the URL........

http://webserver_machine_name:6060/wget/test1/index.html

and *NOT* this.....

http://webserver_machine_name:6060/wget/test1/raznoe.shtml.html


Ideally I want to do this with some parameter I feed wget on the command
line.

Any help would be MUCH appreciated!

thanks!



Pete


Reply via email to