I seem to be getting to message not long after Gilles writes in...
On Mon, 19 Feb 2001, Gilles Detillieux wrote:
> According to Michael Olds:
> set you use, so that htdig can know what's a letter and what isn't.
> See http://www.htdig.org/FAQ.html#q4.10 for more information. I don't
> know what's involved in making your own locale definition.
This obviously depends a lot on the OS (and C library), but it sounds like
you have a fairly complex mapping. Are you using more than 8-bit
characters?
> > I am wondering also, if it might be worth waiting for the next generation,
> > which I understand will create the indexes on the fly so as to save disk
> > usage. What is the ratio of original material to indexed space used?
>
> I don't think you will save any disk space using the 3.2 betas.
> If anything, they may use a bit more. For a small site like yours, that
> shouldn't be a big concern, though, should it? (My databases are under
> 13 MB.)
OK, there are two issues here:
1) Even though 3.2 can build databases "on-the-fly" do you really want do
do this to save disk space? What do you do if your only copy of the
databases dies? I don't recommend anyone going this approach even though
it might be feasible.
2) The 3.2 code introduces word database compression, which generally more
than offsets the increase in size due to phrase searching. So most people
will see smaller databases. (I'm not sure 13MB will necessarily get
smaller, but it's possible.)
--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
Information: http://lists.sourceforge.net/lists/listinfo/htdig-general
FAQ: http://htdig.sourceforge.net/FAQ.html