I'm trying to parse microsoft word documents. When I try to parse a microsoft word document
I get an error that tells me catdoc is missing.
The following url, as referenced in parse_doc.pl, where i should be able to download catdoc from, isn't available.
ftp://ftp.ice.ru/pub/vitus/catdoc-0.90.3.tar.gz
Also the link to ftp.htdig.org where the parsers live isn't available.
Does anyone know of a mirror where I can download these parsers?
Thanks,
JP
Jean-Paul Cozzatti
Sapient, S.p.A
via Crocefisso, 19
Milano, ITALIA 20122
cell | +39 0348 254 9361

