aseek-devel  

[aseek-devel] PDF Converter

Elsner
Wed, 29 Aug 2001 01:06:03 -0700

Hello,

aspseek 1.2.4 supports external converters (there is no doc
for 1.2.5, yet?).

I tried the Linux binaries for xpdf from http://www.foolabs.com/xpdf/
and installed it somewhere, say /usr/local/pdftotext.

$ASPSEEK=<my aspseek root directory>
I added/changed the following lines in $ASPSEEK/etc/aspseek.conf:

#Disallow \.rtf$  \.pdf$  \.cdf$  \.ps$
Disallow \.rtf$  \.cdf$  \.ps$

AddType application/pdf \.pdf$  \.PDF$
...
Converter application/pdf text/plain /usr/local/bin/pdftotext $in $out

When I indexed my test site kalypso.rz.uni-osnabrueck.de,
the conversion was successful,
search in the text version seems ok, but the link "text version"
in the results does not output the text, but gives internal server error.

Try:
http://kalypso/cgi-bin/s.cgi
q: sample size

Resulting text version:
http://kalypso/cgi-bin/s.cgi?cs=&q=sample+size&ch=http:%2F%2Fkalypso.rz.uni-osnabrueck.de%2Fxx.pdf&fm=off

Any ideas?

In my opinion, the use of converters should be clarified with some
more working examples, a contrib directory with links to converters
and working configurations for PS, PDF, ...would be helpful.

Frank

#-------------------------------------------------------#
Dipl.-Math. Frank Elsner
Universitaet Osnabrueck (University of Osnabrueck)
- Rechenzentrum - (Computing Center)
Albrechstrasse 28, AVZ
D-49076 Osnabrueck
Deutschland (Germany)

Tel. (Phone): ++49 (0)541/969-2343 Fax: -2470
E-Mail: [EMAIL PROTECTED]
#-------------------------------------------------------#

  • [aseek-devel] PDF Converter Elsner