Micha <[EMAIL PROTECTED]> writes:

> Is there a feature to refresh all cached files, and delete anything 
> that doesn't exist anymore ?
> 
> Best with a list of domains to exclude from that, maybe.

There is no built-in feature to do this.

It would be quite a task to try and refresh everything.  It could be
done by getting a list of the URLs that are cached for a host (using
wwwoffle-ls perhaps) and then making requests for all of them.  To
check if the URLs no longer exist you would need to check the HTTP
status value.  It could probably be done with a Perl script without
too much problem.

The problem that you would get is that lots of pages would have
changed and you need to get new images and things for them.  You would
end up with lots more in the cache than you had before and no way of
knowing what had changed and what has stayed the same.

It would depend a lot on the type of content that the web page has,
for pages that change a lot in layout or content it might not be much
use.  for a local cache of wikipedia pages (for example) it might work
well since the pages are unlikely to move around, the format will stay
constant and since it is reference material you won't care too much
what has changed, you only care that it is up to date.

-- 
Andrew.
----------------------------------------------------------------------
Andrew M. Bishop                             [EMAIL PROTECTED]
                                      http://www.gedanken.demon.co.uk/

WWWOFFLE users page:
        http://www.gedanken.demon.co.uk/wwwoffle/version-2.9/user.html

Reply via email to