Bastiaan,
I too looked at CACHE.IDX, but didn't copy it to a .TXT file to open it with
a text editor.
As I see it, the CACHE.IDX database relates the <http://....> image to the
stored cryptic cached name. There should be a way to extract the
<http://....> names from the database do a search and replace on the saved
web page, replacing <http://....> with the cryptic cached image name.
I do not know what database program CACHE.IDX uses, but I tried to access it
with FOXPRO2 and got the error message that CACHE.IDX is not a database, even
after I copied it to CACHE.DBF. I also did a web search on the extension,
.IDX to see what database programs it might be associated with, but had no
success there either.
With FOXPRO2, you can copy fields to a text file with the command,
COPY TO [path]filename.txt FIELDS field1 field2 ... DELIMITED WITH '
That would not give a long line and you would only need the fields that have
the image location on the original web page and the cryptic cached name.
A .BAT file then could do the search and replace and, if necessary, use a
routine like STRINGS.COM to increment thru the text file.
Roger Turk
Tucson, Arizona
Bastiaan wrote:
Hi List,
I did some trials in saving two pages.
1- empty the cache (F8)... I have my cache written to the HD
2- go on line to the desired website
3- disconnect after downloading the page(s) (Alt H)
4- leave Arachne (Alt X) so no events are added to the cache
5- go to cache.idx. COPY cache.idx website.txt
6- view or print website.txt
This file is one very long line... not all viewers/editors like
this and it is not easy to understand it.
cache.idx is a list that states:
original filename => cache converted => cache + original extension
I had the cache.idx (website.txt) made better readable by searching for
">" by the editor and giving an enter to break up the very long line.
Cache converted = conversion by Arachne on JPG files. Default JPG =>
BMP. I use JPG => GIF.
7- Use Norton Commander and make two windows:
********* LEFT ************* *********** RIGHT ****************
New directory for webpage Arachne\cache
**************************** **********************************
index.html <= 54930731.htm
kopframe.htm <= 54930735.htm
inhoud.htm <= 54930737.htm
eng-vlag.bmp <= 54930740.tmp <<<
updated.gif <= 54930741.gif
*
remainder clipped