Hi,

I have been running a plucker-build script that plucks news from a couple of
sites.  However, this script generally takes 35-40 minutes (!) to run, and
since I live in Sweden and pay for even local phone calls by the minute, I'd
like to shorten this.

Just for the heck of it, I've just installed wwwoffle, which caches HTML
documents from the Web, and played with it a little bit.  I can easily get
it to download and cache the documents I'd like to pluck (which should take
less time than downloading *and* parsing them, right?) -- but it stores them
in a hashed form in special directories, and they're only accessible through
a proxy server on my local machine.

Is there any way to make plucker use this proxy server?  I checked the docs
and tried setting up a .pluckerrc file with the "http_proxy=" option in the
[DEFAULT] section, but when I try to pluck a document that's in the wwwoffle
cache, the system still brings up the Internet connection, indicating that
plucker isn't trying to use the proxy server to access that document.

Or can anyone suggest some other intelligent way to do what I need to do?
Any help is welcome.

Regards,
Tim Kynerd

Sunrise in Stockholm today:  7:20
Sunset in Stockholm today:  17:47
My rail transit photos at http://www.kynerd.nu

Reply via email to