According to Christian Fredrickson: > This same problem also occurs if I attempt to run parse_doc.pl as my Word > parser. While running, I get the following error: > !! Cannot load charset cp1251 - file not found > When I use the doc2html.pl to parse, the .DOC files are placed into the > index, however the only portion of the .DOC file that is indexed is the name > of the .DOC file. So the body of the .DOC files are not parsed at all. > > I can attach anything you need to help me solve this problem. I have > followed all of the steps I could find in the list.
The error comes from "catdoc", which you most likely have not installed correctly. It comes with a directory of several *.txt files that contain the charset definitions, and you must install these where it expects to find them. Read through the directions for compiling and installing catdoc. -- Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]> Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil Dept. Physiology, U. of Manitoba Phone: (204)789-3766 Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930 _______________________________________________ htdig-general mailing list <[EMAIL PROTECTED]> To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html

