At 3:07 PM -0500 3/16/01, Peterman, Timothy P wrote:
>- there are about 64 million words in it. I would think that
>might contribute to performance problems!
Yes and no. It is certainly the root cause of your problem. On the
other hand, this is not to be an excuse for ht://Dig. :-)
>- Most of them are garbage, 'zzzzzzzzzz' for example.
Index enough files and some of them have tons of garbage.
>it is just a text file, can I simply write a perl script to remove
>the unwanted words?
Certainly, if you like. You'll want to do it before running htmerge.
--
--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html