Hi David, Have you thought of using webstemmer[1]? I'm actually in the process of designing a solution for this at the moment. Interested in sharing ideas?
Thanks, Tim [1] http://www.unixuser.org/~euske/python/webstemmer/ On Wed, 2008-07-30 at 14:20 -0500, David Bain wrote: > Martin, > That looks a bit more sane, but when I attempt to pull the site down > with wget I end up having a <base href=mysite> issue. Where everty > html files points back to the original site as a result. > > On Wed, Jul 30, 2008 at 5:59 AM, Martin Stadler <[EMAIL PROTECTED]> wrote: > > > > Am 28.07.2008 um 18:38 schrieb David Bain: > > > >> I need to save an all html version of a plone site, so that it can be > >> stored on a thumbdrive or CD for archiving purposes. What's the best > >> approach? > >> In the past I've used httrack, but it does some strange things with images > >> creating a directory named after the image for example image.jpg would > >> become image.jpg/thumb, image.jpg/preview etc... > >> > > > > You can also try wget (find a hint how to grab a site here: > > http://en.wikipedia.org/wiki/Wget#Using_Wget). Don't know how good it works > > with a Plone site. > > > > Martin > > > > > > _______________________________________________ > Product-Developers mailing list > [email protected] > http://lists.plone.org/mailman/listinfo/product-developers _______________________________________________ Product-Developers mailing list [email protected] http://lists.plone.org/mailman/listinfo/product-developers
