Re: [Zope] Making lots of external data searchable?
Anselm Lingnau writes: > indexing big mail archives from Zope You can use the Zope product "LocalFS" (maybe also "ExternalFile") to wrap external files into Zope objects (without storing then in the ZODB). Newer versions of "LocalFS" allow you the catalog its contents with ZCatalog. However, before you commit to ZCatalog, make some performance measures. ZCatalog tends to require large amounts of time and memory to build its indexes. Dieter ___ Zope maillist - [EMAIL PROTECTED] http://lists.zope.org/mailman/listinfo/zope ** No cross posts or HTML encoding! ** (Related lists - http://lists.zope.org/mailman/listinfo/zope-announce http://lists.zope.org/mailman/listinfo/zope-dev )
[Zope] Making lots of external data searchable?
Anselm Lingnau <[EMAIL PROTECTED]> wrote: > I'm using Zope to re-vamp a web site, one of whose most > important features is an archive of a reasonably busy mailing > list, which is accessed using home-grown Perl CGI code. I've > written Python code to let users browse the archive sorted by > users, subject etc., but now I'm looking at allowing text > searches. The »old« instance of the web site used Glimpse and a > simple CGI script (in Perl) to do this across the whole site > (including the mail archive) and ideally this would be what I'm > after for the new version as well. > > However, the mail archive now weighs in at about 45 MB in > individual text files (one per message), and I don't really see > myself putting this into the ZODB so I can use ZCatalog. > ZCatalog, however, looks good for indexing the rest of the site > (I haven't done this yet). Is there a reasonable way of > interfacing Glimpse with the Zope searching machinery so I > could again have one-stop searching of the whole site? (It > would probably be straightforward to search just the mail > archive by calling out to Glimpse and massaging the results.) You could probably use ZCatalog in conjunction with LocalFS to accomplish this; I think LocalFS was recently revved to allow cataloguing. Note that the actual mass-indexing process is going to be *painful*, as ZCatalog is intended to ease incremental indexing. I think I would write a script which walked the hierarchy, calling a method to index one (or a few) messages at a time. This script might also need to pack the database at intervals; the catalog is a bit space inefficient across mutliple index/reindex operations. Tres. -- === Tres Seaver[EMAIL PROTECTED] Digital Creations "Zope Dealers" http://www.zope.org ___ Zope maillist - [EMAIL PROTECTED] http://lists.zope.org/mailman/listinfo/zope ** No cross posts or HTML encoding! ** (Related lists - http://lists.zope.org/mailman/listinfo/zope-announce http://lists.zope.org/mailman/listinfo/zope-dev )
[Zope] Making lots of external data searchable?
I'm using Zope to re-vamp a web site, one of whose most important features is an archive of a reasonably busy mailing list, which is accessed using home-grown Perl CGI code. I've written Python code to let users browse the archive sorted by users, subject etc., but now I'm looking at allowing text searches. The »old« instance of the web site used Glimpse and a simple CGI script (in Perl) to do this across the whole site (including the mail archive) and ideally this would be what I'm after for the new version as well. However, the mail archive now weighs in at about 45 MB in individual text files (one per message), and I don't really see myself putting this into the ZODB so I can use ZCatalog. ZCatalog, however, looks good for indexing the rest of the site (I haven't done this yet). Is there a reasonable way of interfacing Glimpse with the Zope searching machinery so I could again have one-stop searching of the whole site? (It would probably be straightforward to search just the mail archive by calling out to Glimpse and massaging the results.) Anselm -- Anselm Lingnau . [EMAIL PROTECTED] To those who doom Unix at the hands of NT, well, not this decade, pal. -- Ade Barkah ___ Zope maillist - [EMAIL PROTECTED] http://lists.zope.org/mailman/listinfo/zope ** No cross posts or HTML encoding! ** (Related lists - http://lists.zope.org/mailman/listinfo/zope-announce http://lists.zope.org/mailman/listinfo/zope-dev )