Title: laola2html.pl
I have no experience with laola, or other Perl software for MS files.  I use a combination of wp2html and catdoc, and though I can't recall ever seeing anything like ��ࡱ�, I could well believe that catdoc could produce such output.  However, I've no explanation for what you are seeing.  Htdig launches an external parser and waits for it to complete, as you know from your expereince with word2x :)
 
If your Word documents are Word97 and later you might find the best solution is to spend a few pounds on wp2html.
 
I don't recall you every giving a reason for rejecting catdoc?
 
--
David Adams
Computing Services
Southampton University
----- Original Message -----
Sent: Thursday, July 05, 2001 2:28 PM
Subject: RE: [htdig] laola2html.pl

David:
 
I knew I forgot something :)  Thanks, I will install Sys::AlarmCall.
 
I am having one problem with laola2html, sometimes I get an excerpt of:
 
��ࡱ�
 
in the search results for some Word documents, but not all documents and not every time I dig.  If I just run doc2html or laola2html on the very same *.doc file on the command line, I get a normal looking HTML page outputted.
 
Have you ever encountered anything like this?  Does htdig launch multiple instances of the external parsers/converters, and could this be a factor, since I can't reproduce the effect on the same documents one at a time?
 
Thanks,
 
-Greg
-----Original Message-----
From: David Adams [mailto:[EMAIL PROTECTED]]
Sent: Thursday, July 05, 2001 8:30 AM
To: Holmes, Gregory; [EMAIL PROTECTED]
Subject: Re: [htdig] laola2html.pl

That looks interesting, I will certainly look to including your code in the next release of doc2html, at the moment I'm too pushed for time to do much.
 
Doc2html version 3.0 does have provision for coping when a utility hangs.  You just have to install the small Perl module Sys::AlarmCall 
 
--
David Adams
Computing Services
Southampton University
 

Reply via email to