Sorry, but most people on the list don't speak German. Would you mind restating your question/comment in English? Or can someone else translate this? Also, I don't see the connection between Olivier's tip on htsearch logging and my two e-mails to Brett about doc2html.pl. My two e-mail do indeed talk about pdf parsers but Olivier's doesn't.
According to Ute Wehner@home: > WElcher pdf-Parser steht in letzter Nachricht > > > > Message: 1 > > From: Gilles Detillieux <[EMAIL PROTECTED]> ... > > According to Brett Simpson: > > > I'm currently using the RPM version of Htdig 3.2.0-1.b3.6 on Redhat 7.2 > > > with apache. What do I need to perform text searching of pdf files? If > > > I copy a pdf file into /var/www/html and run "htdig -iv" it lists the > > > pdf file as not Parsable. I am able to do a search with the default > > > stuff that comes loaded in /var/www/html. Do I need to add some sort > > > of external parser? Does anyone know of any? Thanks. > > > > I recommend doc2html.pl. See http://www.htdig.org/FAQ.html#q4.9 ... > > Message: 5 ... > > From: Olivier Korn <[EMAIL PROTECTED]> ... > > At 15:37 06/12/2001 -0500, Geoff Hutchison wrote: > > >On Thu, 6 Dec 2001, B.G. Mahesh wrote: > > > > > > > Is there any htdig-log processing utility like webalizer [mrunix.net]? I > > > > > >Not to my knowledge. I believe one issue is that the htsearch log output > > >isn't in a form that webalizer et al find easy to parse. This is an issue > > >we'd like to correct, but it's pretty far down the TODO list unless > > >someone contributes something in this direction. > > > > There is a patch for ht://Dig 3.1.5 which modify the way htsearch log > > output is written. I found it rather useful (but I don't use any kind of > > log processing utility. Sorry). ... > > Message: 10 > > From: Gilles Detillieux <[EMAIL PROTECTED]> ... > > According to Brett Simpson: > > > I updated to the latest rpms from redhat and got conv_doc.pl to > > > work. I'm going to give doc2html.pl a try. > > > > I use conv_doc.pl myself and am happy with it. I only recommend > > doc2html.pl because it's more configurable, and has hooks for more > > document types, so it's more generally useful. However if conv_doc.pl > > meets all your external converter needs, you probably don't need to > > switch. -- Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]> Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil Dept. Physiology, U. of Manitoba Phone: (204)789-3766 Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930 _______________________________________________ htdig-general mailing list <[EMAIL PROTECTED]> To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html

