I've had problems getting htdig 3.1.6 (plus 3.2 see below) to run,
(aborts, core dumps).
So I thought the problem maybe PDF files so I created a test.conf file
with
#
bad_extensions: .pdf .wav .gz .z .sit .au .zip .tar .hqx .exe
.com .gif \
.jpg .jpeg .aiff .class .map .ram .tgz .bin .rpm .mpg .mov .avi
.css
I then delete all the database files and ran
/opt/htdig/bin/rundig -a /opt/htdig/conf/test.conf -c -v -v -v >
runtest.txt
The error output gave 5 aborts (If the above is run as a cron, I get
core dumps !)
Abort
Abort
Abort
Abort
Abort
The runtest.txt produced the following:-
/opt/htdig/db/htdig19798.pdf: The file is damaged and could not be
repaired.
/opt/htdig/db/htdig19798.pdf: The file is damaged and could not be
repaired.
/opt/htdig/db/htdig19798.pdf: The file is damaged and could not be
repaired.
PDF::parse: error running pdf_parser on
http://www.itri.bton.ac.uk/projects/agile/deliverables/agile-wp3-corp/ag
ile-wp3-corp.pdf
PDF::parse: error running pdf_parser on
http://www.itri.bton.ac.uk/projects/agile/deliverables/agile-wp5-texs2/a
gile-wp5-texs2.pdf
PDF::parse: cannot open acroread output from
http://www.itri.bton.ac.uk/projects/agile/deliverables/agile-wp5-texs3/a
gile-wp5-texs3.pdf
PDF::parse: cannot open acroread output from
http://www.itri.bton.ac.uk/projects/agile/deliverables/agile-wp6-spec2/a
gile-wp6-spec2.pdf
PDF::parse: cannot open acroread output from
http://www.itri.bton.ac.uk/projects/agile/deliverables/agile-wp6-7-spec3
-impl3/agile-wp6-7-spec3-impl3.pdf
PDF::parse: error running pdf_parser on
http://www.itri.bton.ac.uk/projects/agile/deliverables/agile-wp7-impl1/a
gile-wp7-impl1.pdf
PDF::parse: error running pdf_parser on
http://www.itri.bton.ac.uk/projects/agile/deliverables/agile-wp7-impl2/a
gile-wp7-impl2.pdf
If you look at the PDF files there fine, but htdig should not have
scanned them ?
I don''t have any external parsers set up (well I don't think so)
HTDIG 3.2 Problem
Tried a 3.2 Solaris 9 binary (20020920) by Neil Khol (Thanks Neil) but
got the libstdc++ message.
Added LD_LIBRARY_PATH=/usr/local/lib and then the rundig worked fine.
BUT running htsearch from the web page failed, error loags showed that
again, libstdc++ could not be found.
Tried adding variables to htsearch but nothing worked.
So downloaded 3.2.0b4 soure and complied as
>htdig-3.2.0b4-20030119]# ./configure --without-zlib
--prefix=/opt/htdig32
(I put the --without-zlib in because the configure complained that it
could not find the zlib, even though it's in /usr/local/include)
>checking if zlib is wanted... no
OK
>checking zlib.h usability... yes
>checking zlib.h presence... yes
??? I said not to use it ??
Don't know why/what this means
>updating cache /dev/null
>loading cache /dev/null
>./configure: .: /dev/null: not a regular file
So come to make and I get this error
>echo timestamp > mp_bh.lo
>/bin/bash ./libtool --mode=compile gcc -DHAVE_CONFIG_H -I. -I. -I.
-I./../htlib -D_REENTRANT -g -O2 -Wall -c mp_cmpr.c
>gcc -DHAVE_CONFIG_H -I. -I. -I. -I./../htlib -D_REENTRANT -g -O2 -Wall
-Wp,-MD,.deps/mp_cmpr.pp -c mp_cmpr.c -o mp_cmpr.o
>mp_cmpr.c: In function `CDB___memp_cmpr':
>mp_cmpr.c:140: `Z_DEFAULT_COMPRESSION' undeclared (first use in this
function)
>mp_cmpr.c:140: (Each undeclared identifier is reported only once
>mp_cmpr.c:140: for each function it appears in.)
>mp_cmpr.c: In function `CDB___memp_cmpr_read':
>mp_cmpr.c:214: warning: implicit declaration of function `memcpy'
>mp_cmpr.c: In function `CDB___memp_cmpr_page':
>mp_cmpr.c:508: warning: implicit declaration of function `memset'
>mp_cmpr.c: In function `CDB___memp_cmpr_open':
>mp_cmpr.c:761: warning: implicit declaration of function `strlen'
>make[1]: *** [mp_cmpr.lo] Error 1
>make[1]: Leaving directory `/backup/src/htdig-3.2.0b4-20030119/db'
>make: *** [all-recursive] Error 1
Z_DEFAULT_COMPRESSION !!, it this anything to do with zlib ?
Any ideas ?
Leigh Dodd (2 months of on/off trying to get htdig to work with solaris
9)
ITRI
Ext 2925
-------------------------------------------------------
This SF.NET email is sponsored by:
SourceForge Enterprise Edition + IBM + LinuxWorld = Something 2 See!
http://www.vasoftware.com
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html