Leon Messerschmidt writes:
 > I've got a client that is interested in indexing a large number of Word
 > .doc files. 

Do you mean for free-text search ? or indexing for some other purpose ?

If you just want to index to text content of these docs, the best trick
is to convert them to plain text or HTML and index that. This is the same
technique we used for indexing Word docs and PDFs with our navigation
engine ( see nzone.dcs.bbk.ac.uk ). It's also the same technique Google
use (or at least I'm 90% certain it is). If you need more details, let
me know,

Richard

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to