Try running it with

rundig -i -vvv

as we need to see what MIME-type your server gives for file of type .djvu
(By the way what is a .djvu file?)

You don't seem to have any .pdf files to be indexed.

I second Adrian Bolzan's recommendation thet you move from parse_doc.pl to
doc2html.pl

--
David Adams
Computing Services
Southampton University


----- Original Message -----
From: "Tom Sawyer" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Sent: Thursday, October 24, 2002 3:53 AM
Subject: [htdig] pdf and djvu indexing problems


> i'm trying to get ht://dig configured and working. but for the life of
> me i can't get it to index my pdf and djvu documents.
>
> i'm running debian woody so i thought the default configuration would
> work for at least the pdfs. here's the relevent parts of my config file:
>
>
> max_doc_size:     9999999
>
> external_parsers: application/msword /usr/share/htdig/parse_doc.pl \
>                   application/postscript /usr/share/htdig/parse_doc.pl \
>                   application/pdf /usr/share/htdig/parse_doc.pl \
>                   application/djvu->text/plain /usr/local/bin/djvutxt
>
> debian_pdf_parser: xpdf
>
> WHEN I RUN:
>
> rundig -i -v
>
> I GET THIS:
>
> New server: localhost, 80
> 0:0:0:http://localhost/files/: ++++-++++ size = 1340
> 1:1:1:http://localhost/files/?N=D: +***-**** size = 1340
> 2:2:1:http://localhost/files/?M=A: *+**-**** size = 1340
> 3:3:1:http://localhost/files/?S=A: **+*-**** size = 1340
> 4:4:1:http://localhost/files/?D=A: ***+-**** size = 1340
> 5:5:1:http://localhost/files/test2.djvu:  not HTML
> 6:6:1:http://localhost/files/text1.djvu:  not HTML
> 7:7:1:http://localhost/files/tty.pdf:  not found
> 8:8:1:http://localhost/files/word.rhtml:  size = 796
> 9:9:2:http://localhost/files/?N=A: ****-**** size = 1340
> 10:10:2:http://localhost/files/?M=D: ****-**** size = 1340
> 11:11:2:http://localhost/files/?S=D: ****-**** size = 1340
> 12:12:2:http://localhost/files/?D=D: ****-**** size = 1340
> htmerge: Sorting...
> htmerge: Removing doc #5
> htmerge: Removing doc #6
> htmerge: Removing doc #7
> htmerge: Merging...
>
> Deleted, no excerpt: 5/http://localhost/files/test2.djvu
> Deleted, no excerpt: 6/http://localhost/files/text1.djvu
> Deleted, no excerpt: 7/http://localhost/files/tty.pdf
> htmerge: 10
>
> WHAT AM I DOING WRONG? IS THERE SOMETHING I HAVE TO DO TO GET MY CONFIG
> FILE TO REGISTER EACH TIME I CHANGE IT? PLEASE HELP. THANKS.
>
> --
> tom sawyer, aka transami
> [EMAIL PROTECTED]
>
>
>
> -------------------------------------------------------
> This sf.net email is sponsored by: Influence the future
> of Java(TM) technology. Join the Java Community
> Process(SM) (JCP(SM)) program now.
> http://ads.sourceforge.net/cgi-bin/redirect.pl?sunm0002en
>
> _______________________________________________
> htdig-general mailing list <[EMAIL PROTECTED]>
> To unsubscribe, send a message to
<[EMAIL PROTECTED]> with a subject of unsubscribe
> FAQ: http://htdig.sourceforge.net/FAQ.html
>



-------------------------------------------------------
This sf.net email is sponsored by: Influence the future 
of Java(TM) technology. Join the Java Community 
Process(SM) (JCP(SM)) program now. 
http://ad.doubleclick.net/clk;4729346;7592162;s?http://www.sun.com/javavote
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to