Thanks Dave for the info. What I needed to do was install and enable Sys::AlarmCall and that mostly solved my problem. What I mean by mostly is that after completing a successful "rundig" I still get a process hung. Example:
 
  PID USERNAME PRI NICE  SIZE   RES STATE WAIT     TIME    CPU COMMAND
27817 root      64    0   88M   89M run   -       50:40 53.12% xlhtml
25943 root      -5    0   91M   39M sleep biowai  54:11 15.23% ppthtml
12512 root      -5    0   88M   12M sleep biowai  50:43  5.13% ppthtml
 
Do you know why that is, or how to prevent that? Thanks!
Bryan
-----Original Message-----
From: David Adams [mailto:[EMAIL PROTECTED]]
Sent: Tuesday, June 11, 2002 11:54 PM
To: Bryan C. Woods; [EMAIL PROTECTED]
Subject: Re: [htdig] htdig/ppthtml hangs

Bryan,
 
You don't say which process is using 99% of the processor, htdig or pptHtml.  
 
Exactly how have you configured htdig to use pptHtml?  The script doc2html.pl has a timeout feature which could be the solution.  You must install the Sys::AlarmCall Perl module for it to work.
 
Have you tried the test of executing pptHtml from the command line to convert this document directly?
 
It looks as though you need to use the max_doc_size configuration statement as the document is larger than the default 200000 characters.
 
--
David Adams
Computing Services
Southampton University
----- Original Message -----
Sent: Wednesday, June 12, 2002 1:52 AM
Subject: [htdig] htdig/ppthtml hangs

Hi, I am using htdig 3.1.6 and ppthml which came with the source for xlhtml-0.4 and running them on OpenBSD 3.1
 
Everything works fine except for parsing ppt files. Htdig hangs every time I run rundig, in what appears as an infinite loop, because the process uses up to 99% of the processor and I even let it run for over a day with no results. 
 
I have been trying to test this on a directory of about 15 misc office documents, and they all are parsed and indexed until I add support for the ppt files, then it hangs on next rundig.
 
 
 This is where rundig -vvv hangs:
 
Header line: HTTP/1.1 200 OK
Header line: Server: Microsoft-IIS/5.0
Header line: MicrosoftOfficeWebServer: 5.0_Collab
Header line: Date: Wed, 12 Jun 2002 00:49:19 GMT
Header line: Content-Type: application/vnd.ms-powerpoint
Header line: Accept-Ranges: bytes
Header line: Last-Modified: Wed, 29 May 2002 20:26:54 GMT
Converted Wed, 29 May 2002 20:26:54 GMT to Wed, 29 May 2002 20:26:54
Header line: ETag: "80ea682c4f7c21:860"
Header line: Content-Length: 204288
Header line:
returnStatus = 0
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 3392 from document
Read a total of 200000 bytes
 
 
 
 
 
-Thanks for any help!
 
Bryan

Reply via email to