Hello,
When I run wget to download any URL.......
http://www.expert.ru/expert/current/data/raznoe.shtml
I *always* want wget to name the final main .html file "index.html".
Long story short, I am downloading webpages, and then sticking them under
the an Apache htdocs dir. After that I want to be able to point people to
the correct url under my Apache htdocs dir to view the downloaded webpage.
I have control over the directory name BUT not the downloaded wget'ed
webpages, wget decides the final name.
An example of what I *ideally want*.....
1) I perform the wget get on the URL
http://www.expert.ru/expert/current/data/raznoe.shtml
wget -nd -e robots=off -v -t 2 -S -T 30 -w 1 --random-wait -E -U
"Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.0)" -k -p -H -R
*exe,*vbs,*pdf,*doc,*bat
http://www.expert.ru/expert/current/data/raznoe.shtml
2) when wget is done and we go to the dir were wget dumped all the files it
looks like this on the UNIX command line........
/eval//apache_1.3.28/htdocs/wget/test1
[ 11:25 : Thu,18 Sep 2003 ] $ ls
600090.gif?0-0-15560021-1-a:0p:0& html.js
advert.gif lock.gif
blic2.ii logo.gif
budgetrf.gif rambler.gif
cnt?cid=16612&p=1 raznoe.shtml.html
counter?id=6386;t=84;l=1 red.gif
empty.gif red_folder.gif
empty.gif.1
rle.cgi?sid=23705&bt=33&pz=0&rnd=898701491
folder.gif scripart.js
grey.gif top100.cnt?134760
hot_folder.gif yell.gif
hot_red_folder.gif
/eval/apache_1.3.28/htdocs/wget/test1 ]
[ 11:25 : Thu,18 Sep 2003 ] $
as we can see above, wget has "raznoe.shtml.html" as the main file, this is
*not* what I want, I *always* want the main file to be name "index.html".
I want this so when I have people go to my apache web server they always
know to put "index.html" on the end of the URL........
http://webserver_machine_name:6060/wget/test1/index.html
and *NOT* this.....
http://webserver_machine_name:6060/wget/test1/raznoe.shtml.html
Ideally I want to do this with some parameter I feed wget on the command
line.
Any help would be MUCH appreciated!
thanks!
Pete