Hallo,
I'm an Italian engineer (so sorry for my English) :-) and I'm evaluating
htdig to use it on the website of University of Naples...It's a very good
search engine, but I had some problems...I succeeded in indexing .doc, .rtf,
.pdf, .ps and .ppt files, but I couldn't index .xls files.
Actually I'm using RedHat 7.1 for the testing. These are the options that I
inserted in my htdig.conf file:
external_parsers: application/rtf->text/html /usr/local/scripts/doc2html.pl \
text/rtf->text/html /usr/local/scripts/doc2html.pl \
application/pdf->text/html /usr/local/scripts/doc2html.pl \
application/postscript->text/html
/usr/local/scripts/doc2html.pl \
application/msword->text/html
/usr/local/scripts/doc2html.pl \
application/msexcel->text/html
/usr/local/scripts/doc2html.pl \
application/vnd.ms-excel->text/html
/usr/local/scripts/doc2html.pl \
application/vnd.ms-powerpoint->text/html
/usr/local/scripts/doc2html.pl
I installed xlHtml-0.2.6-2 as an excel parser. In this package there's a ppt
parser too (pptHtml). The thing I can't understand is that the pptHtml works
fine (when it's called from doc2html.pl) but xlHtml doesn't work :-(((.
I tested it from the command line and it works great....but I don't know why
it doesn't work when called from doc2html.pl. In this Perl script, the lines
concerning the parsing are almost the same for both .ppt and .xls files....
Plz, help me!!!
--
A presto!!!
Pietro Palladino
<[EMAIL PROTECTED]>
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html