Cache.idx can only be 'decoded' (so to speak), using the latest version of wwwman.exe
Did you try viewing it with these key-strokes ? "U" and then link to "Cache Index". There you are. ;-) All of the original filenames along with their corresponding names in your cache directory. Here's a screen-cap of my cache index after loading the images at http://arachne.cz/ http://www.cisnet.com/glennmcc/images/cacheidx.gif And this is the .HTM file itself which was generated by wwwman.exe http://www.cisnet.com/glennmcc/images/cacheidx.htm On Fri, 6 Jun 2003 21:19:56 -0400, Roger Turk wrote: > Bastiaan, > I too looked at CACHE.IDX, but didn't copy it to a .TXT file to open it with > a text editor. > As I see it, the CACHE.IDX database relates the <http://....> image to the > stored cryptic cached name. There should be a way to extract the > <http://....> names from the database do a search and replace on the saved > web page, replacing <http://....> with the cryptic cached image name. > I do not know what database program CACHE.IDX uses, but I tried to access it > with FOXPRO2 and got the error message that CACHE.IDX is not a database, even > after I copied it to CACHE.DBF. I also did a web search on the extension, > ..IDX to see what database programs it might be associated with, but had no > success there either. > With FOXPRO2, you can copy fields to a text file with the command, > COPY TO [path]filename.txt FIELDS field1 field2 ... DELIMITED WITH ' > That would not give a long line and you would only need the fields that have > the image location on the original web page and the cryptic cached name. > A .BAT file then could do the search and replace and, if necessary, use a > routine like STRINGS.COM to increment thru the text file. > Roger Turk > Tucson, Arizona > Bastiaan wrote: > Hi List, > I did some trials in saving two pages. > 1- empty the cache (F8)... I have my cache written to the HD > 2- go on line to the desired website > 3- disconnect after downloading the page(s) (Alt H) > 4- leave Arachne (Alt X) so no events are added to the cache > 5- go to cache.idx. COPY cache.idx website.txt > 6- view or print website.txt > This file is one very long line... not all viewers/editors like > this and it is not easy to understand it. > cache.idx is a list that states: > original filename => cache converted => cache + original extension > I had the cache.idx (website.txt) made better readable by searching for > ">" by the editor and giving an enter to break up the very long line. > Cache converted = conversion by Arachne on JPG files. Default JPG => > BMP. I use JPG => GIF. > 7- Use Norton Commander and make two windows: > ********* LEFT ************* *********** RIGHT **************** > New directory for webpage Arachne\cache > **************************** ********************************** > index.html <= 54930731.htm > kopframe.htm <= 54930735.htm > inhoud.htm <= 54930737.htm > eng-vlag.bmp <= 54930740.tmp <<< > updated.gif <= 54930741.gif > * > remainder clipped -- Glenn http://arachne.cz/ http://www.delorie.com/listserv/mime/ http://www.cisnet.com/glennmcc/ http://www.cisnet.com/glennmcc/aqc/
