Sorry, but most people on the list don't speak German.  Would you
mind restating your question/comment in English?  Or can someone else
translate this?  Also, I don't see the connection between Olivier's
tip on htsearch logging and my two e-mails to Brett about doc2html.pl.
My two e-mail do indeed talk about pdf parsers but Olivier's doesn't.

According to Ute Wehner@home:
> WElcher pdf-Parser steht in letzter Nachricht
> 
> 
> > Message: 1
> > From: Gilles Detillieux <[EMAIL PROTECTED]>
...
> > According to Brett Simpson:
> > > I'm currently using the RPM version of Htdig 3.2.0-1.b3.6 on Redhat 7.2
> > > with apache. What do I need to perform text searching of pdf files? If
> > > I copy a pdf file into /var/www/html and run "htdig -iv" it lists the
> > > pdf file as not Parsable. I am able to do a search with the default
> > > stuff that comes loaded in /var/www/html. Do I need to add some sort
> > > of external parser? Does anyone know of any? Thanks.
> > 
> > I recommend doc2html.pl.  See http://www.htdig.org/FAQ.html#q4.9
...
> > Message: 5
...
> > From: Olivier Korn <[EMAIL PROTECTED]>
...
> > At 15:37 06/12/2001 -0500, Geoff Hutchison wrote:
> > >On Thu, 6 Dec 2001, B.G. Mahesh wrote:
> > >
> > > > Is there any htdig-log processing utility like webalizer [mrunix.net]? I
> > >
> > >Not to my knowledge. I believe one issue is that the htsearch log output
> > >isn't in a form that webalizer et al find easy to parse. This is an issue
> > >we'd like to correct, but it's pretty far down the TODO list unless
> > >someone contributes something in this direction.
> > 
> > There is a patch for ht://Dig 3.1.5 which modify the way htsearch log
> > output is written. I found it rather useful (but I don't use any kind of
> > log processing utility. Sorry).
...
> > Message: 10
> > From: Gilles Detillieux <[EMAIL PROTECTED]>
...
> > According to Brett Simpson:
> > > I updated to the latest rpms from redhat and got conv_doc.pl to
> > > work. I'm going to give doc2html.pl a try.
> > 
> > I use conv_doc.pl myself and am happy with it.  I only recommend
> > doc2html.pl because it's more configurable, and has hooks for more
> > document types, so it's more generally useful.  However if conv_doc.pl
> > meets all your external converter needs, you probably don't need to
> > switch.


-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to