At 3:07 PM -0500 3/16/01, Peterman, Timothy P wrote:
>- there are about 64 million words in it.  I would think that
>might contribute to performance problems!

Yes and no. It is certainly the root cause of your problem. On the 
other hand, this is not to be an excuse for ht://Dig. :-)

>- Most of them are garbage, 'zzzzzzzzzz' for example.

Index enough files and some of them have tons of garbage.

>it is just a text file, can I simply write a perl script to remove
>the unwanted words?

Certainly, if you like. You'll want to do it before running htmerge.

-- 
--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to