To remind you all, this works fine on .DOC files, not on PDF. Running
pdf2html from the command line outputs text perfectly. I ran htdig
with -vvvv -s and here is the output while the start URL is set to one PDF
file.

Here are my external parser lines in the .conf file:


external_parsers: application/msword->text/html
/home/ch/staff/fredrick/htdig/contrib/doc2html/doc2html.pl \
                     application/pdf->text/html
/home/ch/staff/fredrick/htdig/contrib/doc2html/doc2html.pl \
                     application/postscript->text/html
/home/ch/staff/fredrick/htdig/contrib/doc2html/doc2html.pl


Here is the output from htdig:

1:1:http://www.mydomain.com/minutes/facultyMeetingMinutes11_09_2000.pdf
New server: www.mydomain.com, 80
Retrieval command for http://www.mydomain.com/robots.txt: GET /robots.txt
HTTP/1.0
User-Agent: htdig/3.1.6 ([EMAIL PROTECTED])
Authorization: Basic d2ViOmNoZXdlYjY2Ng==
Host: www.mydomain.com

Header line: HTTP/1.1 404 Not Found
Header line: Date: Mon, 08 Apr 2002 15:40:06 GMT
Header line: Server: Apache/1.3.9 (Unix) PHP/3.0.14 mod_perl/1.21
mod_ssl/2.4.10 OpenSSL/0.9.4
Header line: Last-Modified: Mon, 25 Feb 2002 21:25:43 GMT
Converted Mon, 25 Feb 2002 21:25:43 GMT to Mon, 25 Feb 2002 21:25:43
Header line: ETag: "24867-52-3c7aabd7"
Header line: Accept-Ranges: bytes
Header line: Content-Length: 82
Header line: Connection: close
Header line: Content-Type: text/html
Header line:
returnStatus = 1
 pushed
pick: www.mydomain.com, # servers = 1
0:0:0:http://www.mydomain.com/minutes/facultyMeetingMinutes11_09_2000.pdf:
Retrieval command for
http://www.mydomain.com/minutes/facultyMeetingMinutes11_09_2000.pdf: GET
/minutes/facultyMeetingMinutes11_09_2000.pdf HTTP/1.0
User-Agent: htdig/3.1.6 ([EMAIL PROTECTED])
Authorization: Basic d2ViOmNoZXdlYjY2Ng==
Host: www.mydomain.com

Header line: HTTP/1.1 200 OK
Header line: Date: Mon, 08 Apr 2002 15:40:06 GMT
Header line: Server: Apache/1.3.9 (Unix) PHP/3.0.14 mod_perl/1.21
mod_ssl/2.4.10 OpenSSL/0.9.4
Header line: Last-Modified: Tue, 02 Apr 2002 19:22:36 GMT
Converted Tue, 02 Apr 2002 19:22:36 GMT to Tue, 02 Apr 2002 19:22:36
Header line: ETag: "2d53e-22ea-3caa04fc"
Header line: Accept-Ranges: bytes
Header line: Content-Length: 8938
Header line: Connection: close
Header line: Content-Type: application/pdf
Header line:
returnStatus = 0
Read 8192 from document
Read 746 from document
Read a total of 8938 bytes
PDF::setContents(8938 bytes)
PDF::parse(http://www.mydomain.com/minutes/facultyMeetingMinutes11_09_2000.p
df)
PDF::parseNonTextLine: title is "CHEMICAL AND FUELS ENGINEERING"

title: CHEMICAL AND FUELS ENGINEERING
PDF::parseNonTextLine: total pages is 1
PDF::parseNonTextLine: start page 1
PDF::parseNonTextLine: begin text block
PDF::parseTextLine("/N18 1 Tf") cmd=Tf
PDF::parseTextLine("9.95919 0 0 9.95919 90.03318 747.00379 Tm") cmd=Tm
PDF::parseTextLine("/N21 /ColorSpace findRes cs") cmd=cs
PDF::parseTextLine("0 0 0 sc") cmd=sc
PDF::parseTextLine("/N25 /ExtGState findRes gs") cmd=gs
PDF::parseTextLine("0 Tc") cmd=Tc
PDF::parseTextLine("0 Tw") cmd=Tw
PDF::parseTextLine("( )Tj ") cmd=
PDF::parseTextLine("0 -71.13249 TD") cmd=TD
PDF::parseTextLine("( )Tj ") cmd=
PDF::parseTextLine("/N20 1 Tf") cmd=Tf
PDF::parseTextLine("11.99899 0 0 11.99899 90.03318 708.96688 Tm") cmd=Tm
PDF::parseTextLine("0.00178 Tc") cmd=Tc
PDF::parseTextLine("0.00039 Tw") cmd=Tw
PDF::parseTextLine("(C)Tj [4 ]TJ (H)Tj [4 ]TJ (E)Tj [-1.19999 ]TJ (M)Tj
[4.79998 ]TJ (I)Tj [-0.39999 ]TJ (C)Tj [-16 ]TJ (A)Tj [24 ]TJ (L )Tj
[-20 ]TJ (A)Tj [24 ]TJ (N)Tj [4 ]TJ (D)Tj [4 ]TJ ( FU)Tj [4 ]TJ (E)Tj
[-11.19999 ]TJ (L)Tj [2.59999 ]TJ (S EN)Tj [4 ]TJ (GIN)Tj [4 ]TJ (EER)Tj
[4 ]TJ (IN)Tj [4 ]TJ (G )Tj ") cmd=
PDF::parseTextLine("0 -1.14999 TD") cmd=TD
PDF::parseTextLine("0.00219 Tc") cmd=Tc
PDF::parseTextLine("-0.00009 Tw") cmd=Tw
PDF::parseTextLine("(F)Tj [-7 ]TJ (A)Tj [24.39999 ]TJ (C)Tj [-5.59999 ]TJ
(U)Tj [4.39999 ]TJ (LT)Tj [-7 ]TJ (Y)Tj [9.19999 ]TJ ( M)Tj [5.19999 ]TJ
(EETING \226 11)Tj [8.29998 ]TJ (/09)Tj [8.29998 ]TJ (/)Tj [0 ]TJ (00 )Tj ")
cmd=
PDF::parseTextLine("T*") cmd=T*
PDF::parseTextLine("0 Tc") cmd=Tc
PDF::parseTextLine("0 Tw") cmd=Tw
PDF::parseTextLine("( )Tj ") cmd=
PDF::parseTextLine("/N22 1 Tf") cmd=Tf
PDF::parseTextLine("T*") cmd=T*
PDF::parseTextLine("0.00209 Tc") cmd=Tc
PDF::parseTextLine("0.00009 Tw") cmd=Tw
PDF::parseTextLine("(Pr)Tj [5.09999 ]TJ (esent)Tj [9.89999 ]TJ (:  S)Tj
[9.09999 ]TJ (a)Tj [-1.69999 ]TJ (r)Tj [5.09999 ]TJ (o)Tj [8.19999 ]TJ (f)Tj
[-10.09999 ]TJ (i)Tj [4.29998 ]TJ (m)Tj [5.09999 ]TJ (,)Tj [-0.09999 ]TJ (
P)Tj [9.09999 ]TJ (u)Tj [-1.79998 ]TJ (g)Tj [8.19999 ]TJ (m)Tj [-4.89999 ]TJ
(ir)Tj [5.09999 ]TJ (e, Silcox)Tj [12.09999 ]TJ (, de)Tj [8.19999 ]TJ (
Nev)Tj [12.09999 ]TJ (e)Tj [-1.79998 ]TJ (r)Tj [5.09999 ]TJ (s)Tj
[2.09999 ]TJ (, Rose, )Tj [10 ]TJ (Eddi)Tj [14.29998 ]TJ (ng)Tj [8.19999 ]TJ
(s, Bodi)Tj [14.29998 ]TJ (ly)Tj [12.09999 ]TJ (, B. T)Tj [-7.09999 ]TJ
(y)Tj [12.09999 ]TJ (ler)Tj [5.09999 ]TJ (,)Tj [-0.09999 ]TJ ( )Tj ") cmd=
PDF::parseTextLine("T*") cmd=T*
PDF::parseTextLine("0.00299 Tc") cmd=Tc
PDF::parseTextLine("-0.00079 Tw") cmd=Tw
PDF::parseTextLine("(R)Tj [5.19999 ]TJ (appa)Tj [9.09999 ]TJ (por)Tj [6 ]TJ
(t, )Tj [10 ]TJ (P. S)Tj [10 ]TJ (m)Tj [-4 ]TJ (i)Tj [5.19999 ]TJ (t)Tj
[0.79998 ]TJ (h)Tj [9.09999 ]TJ (,)Tj [0.79998 ]TJ ( S)Tj [10 ]TJ (l)Tj
[5.19999 ]TJ (aug)Tj [9.09999 ]TJ (hter)Tj [6 ]TJ (, Kr)Tj [6 ]TJ (a)Tj
[9.09999 ]TJ (h)Tj [-0.89999 ]TJ (e)Tj [9.09999 ]TJ (nbu)Tj [9.09999 ]TJ
(hl)Tj [5.19999 ]TJ (, )Tj [10 ]TJ (Zmi)Tj [5.19999 ]TJ (e)Tj [-0.89999 ]TJ
(r)Tj [6 ]TJ (c)Tj [3 ]TJ (z)Tj [13 ]TJ (a)Tj [-0.89999 ]TJ (k, M)Tj [6 ]TJ
(a)Tj [-0.79998 ]TJ (g)Tj [9.09999 ]TJ (d)Tj [-0.89999 ]TJ (a, P)Tj [10 ]TJ
(e)Tj [9.09999 ]TJ (ter)Tj [6 ]TJ (s)Tj [3 ]TJ (on, )Tj ") cmd=
PDF::parseTextLine("T*") cmd=T*
PDF::parseTextLine("0.00158 Tc") cmd=Tc
PDF::parseTextLine("0.00059 Tw") cmd=Tw
PDF::parseTextLine("(Lig)Tj [7.69999 ]TJ (h)Tj [-2.29998 ]TJ (ty)Tj
[11.59999 ]TJ (; Absent:)Tj [9.39999 ]TJ (  Deo)Tj [7.69999 ]TJ ( \()Tj
[4.59999 ]TJ (m)Tj [-5.39999 ]TJ (edic)Tj [11.59999 ]TJ (a)Tj [-2.29998 ]TJ
(l\))Tj [4.59999 ]TJ (, JDS \()Tj [4.59999 ]TJ (o)Tj [7.69999 ]TJ (ff se)Tj
[7.69999 ]TJ (m)Tj [4.59999 ]TJ (e)Tj [-2.09999 ]TJ (ster)Tj [4.59999 ]TJ
(\))Tj [4.59999 ]TJ (,)Tj [-0.59999 ]TJ ( EM)Tj [14.59999 ]TJ (T)Tj
[-7.59999 ]TJ ( \()Tj [4.59999 ]TJ (s)Tj [1.59999 ]TJ (ab)Tj [7.69999 ]TJ
(batica)Tj [7.69999 ]TJ (l\))Tj [4.59999 ]TJ (, JVF \()Tj [4.59999 ]TJ
(out )Tj [10 ]TJ (o)Tj [7.69999 ]TJ (f)Tj [-10.59999 ]TJ ( )Tj ") cmd=
PDF::parseTextLine("T*") cmd=T*
PDF::parseTextLine("-0.00028 Tc") cmd=Tc
PDF::parseTextLine("0.00248 Tw") cmd=Tw
PDF::parseTextLine("(to)Tj [-4.19999 ]TJ (w)Tj [11.89999 ]TJ (n)Tj
[-4.19999 ]TJ (\), G. Sm)Tj [-7.29998 ]TJ (ith)Tj [-4.19999 ]TJ (,)Tj
[-2.5 ]TJ ( F)Tj [10.5 ]TJ (V)Tj [-3.29998 ]TJ (H, )Tj [10 ]TJ (HL)Tj
[-4.19999 ]TJ (CM, T)Tj [-9.5 ]TJ (A)Tj [-3.29998 ]TJ (R, MS)Tj
[-3.19999 ]TJ ( )Tj ") cmd=
PDF::parseTextLine("T*") cmd=T*
PDF::parseTextLine("0 Tc") cmd=Tc
PDF::parseTextLine("0 Tw") cmd=Tw
PDF::parseTextLine("( )Tj ") cmd=
PDF::parseTextLine("T*") cmd=T*
PDF::parseTextLine("0.00199 Tc") cmd=Tc
PDF::parseTextLine("0.00019 Tw") cmd=Tw
PDF::parseTextLine("(In)Tj [8.09999 ]TJ (f)Tj [-10.19999 ]TJ (o)Tj
[-1.89999 ]TJ (r)Tj [15 ]TJ (m)Tj [-5 ]TJ (atio)Tj [8.09999 ]TJ (n ite)Tj
[8.09999 ]TJ (m)Tj [-5 ]TJ (s c)Tj [12 ]TJ (o)Tj [-1.89999 ]TJ (v)Tj [12 ]TJ
(e)Tj [-1.89999 ]TJ (r)Tj [5 ]TJ (ed.  A)Tj [9 ]TJ ( sug)Tj [8.09999 ]TJ
(g)Tj [8.09999 ]TJ (e)Tj [-1.89999 ]TJ (stion w)Tj [14.19999 ]TJ (a)Tj
[-1.89999 ]TJ (s ma)Tj [8.09999 ]TJ (de t)Tj [9.79998 ]TJ (o)Tj
[-1.89999 ]TJ ( inv)Tj [12 ]TJ (i)Tj [4.19999 ]TJ (te SAC c)Tj [12 ]TJ (h)Tj
[8.09999 ]TJ (air)Tj [5 ]TJ (s)Tj [2 ]TJ ( to )Tj ") cmd=
PDF::parseTextLine("T*") cmd=T*
PDF::parseTextLine("0.00329 Tc") cmd=Tc
PDF::parseTextLine("-0.00109 Tw") cmd=Tw
PDF::parseTextLine("(atte)Tj [9.39999 ]TJ (nd )Tj [10 ]TJ (facul)Tj [5.5 ]TJ
(t)Tj [1.09999 ]TJ (y)Tj [13.29998 ]TJ ( me)Tj [9.39999 ]TJ (eti)Tj [5.5 ]TJ
(n)Tj [-0.59999 ]TJ (g)Tj [19.39999 ]TJ (s)Tj [3.29998 ]TJ (.  C)Tj [5.5 ]TJ
(o)Tj [-0.59999 ]TJ (m)Tj [6.29998 ]TJ (m)Tj [6.29998 ]TJ (ents:)Tj
[11.09999 ]TJ (  )Tj [20 ]TJ (W)Tj [-22.89999 ]TJ (e)Tj [9.39999 ]TJ (l)Tj
[5.5 ]TJ (c)Tj [3.29998 ]TJ (o)Tj [9.39999 ]TJ (m)Tj [-3.69999 ]TJ (e i)Tj
[5.5 ]TJ (t)Tj [11.09999 ]TJ (ems)Tj [13.29998 ]TJ ( )Tj [10 ]TJ (f)Tj
[-8.89999 ]TJ (o)Tj [-0.5 ]TJ (r)Tj [6.29998 ]TJ ( ag)Tj [9.39999 ]TJ (e)Tj
[9.39999 ]TJ (n)Tj [-0.59999 ]TJ (da)Tj [9.39999 ]TJ ( )Tj [10 ]TJ (f)Tj
[-8.89999 ]TJ (r)Tj [6.29998 ]TJ (om )Tj [10 ]TJ (SAC)Tj [5.5 ]TJ ( )Tj ")
cmd=
PDF::parseTextLine("T*") cmd=T*
PDF::parseTextLine("0.00219 Tc") cmd=Tc
PDF::parseTextLine("0 Tw") cmd=Tw
PDF::parseTextLine("(me)Tj [8.29998 ]TJ (mb)Tj [8.29998 ]TJ (er)Tj
[5.19999 ]TJ (s; inv)Tj [12.19999 ]TJ (i)Tj [4.39999 ]TJ (te SAC t)Tj
[10 ]TJ (o)Tj [-1.69999 ]TJ ( all m)Tj [5.19999 ]TJ (eeti)Tj [14.39999 ]TJ
(n)Tj [-1.69999 ]TJ (g)Tj [8.29998 ]TJ (s)Tj [2.19999 ]TJ ( w)Tj [14.5 ]TJ
(i)Tj [4.39999 ]TJ (th special inv)Tj [12.19999 ]TJ (i)Tj [4.39999 ]TJ
(tation to )Tj [10 ]TJ (m)Tj [5.19999 ]TJ (e)Tj [-1.59999 ]TJ (etin)Tj
[8.29998 ]TJ (g)Tj [8.29998 ]TJ (s)Tj [2.19999 ]TJ ( that )Tj [10 ]TJ
(mig)Tj [8.29998 ]TJ (h)Tj [-1.69999 ]TJ (t )Tj ") cmd=
PDF::parseTextLine("T*") cmd=T*
PDF::parseTextLine("0.00289 Tc") cmd=Tc
PDF::parseTextLine("-0.00068 Tw") cmd=Tw
PDF::parseTextLine("(hav)Tj [12.89999 ]TJ (e ag)Tj [9 ]TJ (end)Tj [9 ]TJ
(a)Tj [-1 ]TJ ( i)Tj [5.09999 ]TJ (t)Tj [0.69999 ]TJ (e)Tj [9 ]TJ (m)Tj
[-4.09999 ]TJ (s s)Tj [12.89999 ]TJ (p)Tj [9 ]TJ (eci)Tj [5.09999 ]TJ (f)Tj
[-9.29998 ]TJ (i)Tj [5.09999 ]TJ (c)Tj [12.89999 ]TJ (al)Tj [5.09999 ]TJ
(l)Tj [5.09999 ]TJ (y)Tj [12.89999 ]TJ ( i)Tj [5.09999 ]TJ (m)Tj
[-4.09999 ]TJ (pacti)Tj [5.09999 ]TJ (ng)Tj [9 ]TJ ( stu)Tj [9 ]TJ (dents)Tj
[12.89999 ]TJ (.)Tj [0.69999 ]TJ (  Ex)Tj [12.89999 ]TJ (cuse )Tj [10 ]TJ
(SAC)Tj [5.09999 ]TJ ( )Tj [10 ]TJ (m)Tj [5.89999 ]TJ (e)Tj [-1 ]TJ (mb)Tj
[9 ]TJ (er)Tj [5.89999 ]TJ (s )Tj ") cmd=
PDF::parseTextLine("T*") cmd=T*
PDF::parseTextLine("0.00318 Tc") cmd=Tc
PDF::parseTextLine("-0.01098 Tw") cmd=Tw
PDF::parseTextLine("(w)Tj [15.39999 ]TJ (hen/i)Tj [5.39999 ]TJ (f)Tj [-9 ]TJ
( ag)Tj [9.29998 ]TJ (end)Tj [9.29998 ]TJ (a )Tj [-10 ]TJ (i)Tj [5.39999 ]TJ
(t)Tj [1 ]TJ (e)Tj [9.29998 ]TJ (m)Tj [-3.79998 ]TJ (s ar)Tj [6.19999 ]TJ
(e )Tj [-10 ]TJ (o)Tj [9.29998 ]TJ (f)Tj [-9 ]TJ ( a )Tj [-10 ]TJ (co)Tj
[9.29998 ]TJ (n)Tj [9.29998 ]TJ (f)Tj [-9 ]TJ (i)Tj [5.39999 ]TJ (d)Tj
[9.29998 ]TJ (enti)Tj [5.39999 ]TJ (a)Tj [-0.69999 ]TJ (l)Tj [5.39999 ]TJ
( na)Tj [9.29998 ]TJ (tur)Tj [6.19999 ]TJ (e)Tj [-0.69999 ]TJ (. )Tj ") cmd=
PDF::parseTextLine("T*") cmd=T*
PDF::parseTextLine("0 Tc") cmd=Tc
PDF::parseTextLine("0 Tw") cmd=Tw
PDF::parseTextLine("( )Tj ") cmd=
PDF::parseTextLine("T*") cmd=T*
PDF::parseTextLine("-0.00459 Tc") cmd=Tc
PDF::parseTextLine("0.00679 Tw") cmd=Tw
PDF::parseTextLine("(P)Tj [-7.59999 ]TJ (r)Tj [-1.59999 ]TJ (o)Tj [-8.5 ]TJ
(p)Tj [-8.39999 ]TJ (o)Tj [-8.5 ]TJ (s)Tj [5.39999 ]TJ (e)Tj [-8.5 ]TJ (d)Tj
[-8.5 ]TJ ( Cop)Tj [-8.5 ]TJ (y)Tj [5.39999 ]TJ (ri)Tj [-12.39999 ]TJ (gh)Tj
[-8.5 ]TJ (t)Tj [-6.79998 ]TJ ( Po)Tj [-8.5 ]TJ (lic)Tj [-4.59999 ]TJ (y:)Tj
[-6.79998 ]TJ (  Dis)Tj [-4.59999 ]TJ (c)Tj [-4.59999 ]TJ (u)Tj [-8.5 ]TJ
(s)Tj [-4.59999 ]TJ (s)Tj [-4.59999 ]TJ (e)Tj [-8.5 ]TJ (d)Tj [-8.5 ]TJ (
a)Tj [-8.5 ]TJ (t)Tj [-6.79998 ]TJ ( lengt)Tj [-6.79998 ]TJ (h)Tj [-8.5 ]TJ
( \(b)Tj [-8.5 ]TJ (o)Tj [-8.5 ]TJ (t)Tj [-6.79998 ]TJ (h)Tj [-8.5 ]TJ (
i)Tj [7.59999 ]TJ (n)Tj [-8.39999 ]TJ ( t)Tj [-6.79998 ]TJ (h)Tj [-8.5 ]TJ
(is)Tj [-4.59999 ]TJ ( )Tj [10 ]TJ (m)Tj [-11.59999 ]TJ (e)Tj [1.5 ]TJ (e)Tj
[-8.5 ]TJ (t)Tj [-6.79998 ]TJ (ing a)Tj [-8.39999 ]TJ (n)Tj [-8.5 ]TJ (d)Tj
[-8.39999 ]TJ ( in)Tj [-8.5 ]TJ ( )Tj ") cmd=
PDF::parseTextLine("T*") cmd=T*
PDF::parseTextLine("0.00299 Tc") cmd=Tc
PDF::parseTextLine("-0.00079 Tw") cmd=Tw
PDF::parseTextLine("(Acade)Tj [9.09999 ]TJ (mi)Tj [5.19999 ]TJ (c S)Tj
[10 ]TJ (e)Tj [-0.89999 ]TJ (n)Tj [9.09999 ]TJ (a)Tj [-0.89999 ]TJ (te )Tj
[10 ]TJ (me)Tj [9.09999 ]TJ (e)Tj [9.09999 ]TJ (t)Tj [0.79998 ]TJ (i)Tj
[5.19999 ]TJ (n)Tj [-0.89999 ]TJ (g)Tj [9.09999 ]TJ (\))Tj [6 ]TJ (.  C)Tj
[5.19999 ]TJ (H)Tj [5.19999 ]TJ (F)Tj [3.79998 ]TJ (EN)Tj [5.19999 ]TJ (
Facul)Tj [5.19999 ]TJ (t)Tj [0.79998 ]TJ (y)Tj [13 ]TJ ( ag)Tj [9.09999 ]TJ
(r)Tj [6 ]TJ (eed to)Tj [9.09999 ]TJ ( i)Tj [5.19999 ]TJ (n)Tj [9.09999 ]TJ
(f)Tj [-9.19999 ]TJ (o)Tj [-0.79998 ]TJ (r)Tj [16 ]TJ (m)Tj [-4 ]TJ ( D)Tj
[5.19999 ]TJ (M)Tj [6 ]TJ (B )Tj [10 ]TJ (o)Tj [9.19999 ]TJ (f)Tj
[-9.19999 ]TJ ( th)Tj [9.09999 ]TJ (ei)Tj [5.19999 ]TJ (r)Tj [6 ]TJ ( )Tj ")
cmd=
PDF::parseTextLine("T*") cmd=T*
PDF::parseTextLine("0.00318 Tc") cmd=Tc
PDF::parseTextLine("-0.00099 Tw") cmd=Tw
PDF::parseTextLine("(opi)Tj [5.39999 ]TJ (ni)Tj [5.39999 ]TJ (ons)Tj
[13.19999 ]TJ (/sug)Tj [9.29998 ]TJ (g)Tj [9.29998 ]TJ (e)Tj [-0.59999 ]TJ
(sti)Tj [5.39999 ]TJ (ons )Tj [10 ]TJ (o)Tj [9.29998 ]TJ (n)Tj [-0.69999 ]TJ
( th)Tj [9.29998 ]TJ (e m)Tj [6.19999 ]TJ (a)Tj [-0.69999 ]TJ (tt)Tj [11 ]TJ
(er)Tj [6.19999 ]TJ ( \()Tj [6.19999 ]TJ (i)Tj [5.39999 ]TJ (n)Tj
[9.29998 ]TJ (f)Tj [-9 ]TJ (or)Tj [6.19999 ]TJ (m)Tj [6.19999 ]TJ (a)Tj
[-0.69999 ]TJ (ti)Tj [15.39999 ]TJ (on d)Tj [9.29998 ]TJ (e)Tj [-0.69999 ]TJ
(tai)Tj [5.39999 ]TJ (l)Tj [5.39999 ]TJ (e)Tj [9.29998 ]TJ (d i)Tj
[5.39999 ]TJ (n)Tj [-0.69999 ]TJ ( )Tj [10 ]TJ (pap)Tj [9.29998 ]TJ (e)Tj
[-0.69999 ]TJ (r)Tj [6.19999 ]TJ ( pr)Tj [16.19999 ]TJ (esent)Tj [11 ]TJ
(ed )Tj [10 ]TJ (to )Tj ") cmd=
PDF::parseTextLine("T*") cmd=T*
PDF::parseTextLine("-0.00109 Tc") cmd=Tc
PDF::parseTextLine("0.00329 Tw") cmd=Tw
PDF::parseTextLine("(th)Tj [-5 ]TJ (e)Tj [-5 ]TJ ( CHFE)Tj [-4.09999 ]TJ
(N)Tj [1.09999 ]TJ ( )Tj [10 ]TJ (fa)Tj [-5 ]TJ (cu)Tj [-5 ]TJ (lty)Tj
[8.89999 ]TJ ( b)Tj [-5 ]TJ (y)Tj [8.89999 ]TJ ( P)Tj [-4.09999 ]TJ (h)Tj
[-5 ]TJ (il S)Tj [-4.09999 ]TJ (m)Tj [-8.09999 ]TJ (it)Tj [6.69999 ]TJ (h)Tj
[-5 ]TJ (,)Tj [-3.29998 ]TJ ( w)Tj [11.09999 ]TJ (h)Tj [-5 ]TJ (o)Tj [-5 ]TJ
( is on)Tj [-5 ]TJ ( t)Tj [6.69999 ]TJ (h)Tj [-5 ]TJ (e)Tj [-5 ]TJ ( Un)Tj
[-4.89999 ]TJ (iv)Tj [8.89999 ]TJ (e)Tj [-4.89999 ]TJ (r)Tj [1.89999 ]TJ
(sity)Tj [8.89999 ]TJ ( co)Tj [-5 ]TJ (m)Tj [-8.09999 ]TJ (m)Tj
[-8.09999 ]TJ (i)Tj [1.09999 ]TJ (tt)Tj [6.69999 ]TJ (ee)Tj [-5 ]TJ (\) )Tj
[10 ]TJ (f)Tj [-13.29998 ]TJ (o)Tj [-5 ]TJ (r in)Tj [-5 ]TJ (pu)Tj
[-4.89999 ]TJ (t )Tj [10 ]TJ (a)Tj [-4.89999 ]TJ (t)Tj [-3.29998 ]TJ ( )Tj
") cmd=
PDF::parseTextLine("T*") cmd=T*
PDF::parseTextLine("-0.00309 Tc") cmd=Tc
PDF::parseTextLine("0.00529 Tw") cmd=Tw
PDF::parseTextLine("(th)Tj [-7 ]TJ (e)Tj [-7 ]TJ ( En)Tj [-7 ]TJ (gin)Tj
[-6.89999 ]TJ (e)Tj [-7 ]TJ (e)Tj [-7 ]TJ (r)Tj [-0.09999 ]TJ (in)Tj [-7 ]TJ
(g Co)Tj [-7 ]TJ (lle)Tj [-7 ]TJ (ge)Tj [-7 ]TJ ( Co)Tj [-7 ]TJ (un)Tj
[-7 ]TJ (cil m)Tj [-10.09999 ]TJ (e)Tj [3.19999 ]TJ (e)Tj [-7 ]TJ (t)Tj
[-5.29998 ]TJ (in)Tj [-7 ]TJ (g. )Tj ") cmd=
PDF::parseTextLine("T*") cmd=T*
PDF::parseTextLine("0 Tc") cmd=Tc
PDF::parseTextLine("0 Tw") cmd=Tw
PDF::parseTextLine("( )Tj ") cmd=
PDF::parseTextLine("T*") cmd=T*
PDF::parseTextLine("0.00318 Tc") cmd=Tc
PDF::parseTextLine("-0.00099 Tw") cmd=Tw
PDF::parseTextLine("(Gr)Tj [6.19999 ]TJ (adua)Tj [9.29998 ]TJ (t)Tj [1 ]TJ
(e )Tj [10 ]TJ (Stu)Tj [9.29998 ]TJ (dent)Tj [11 ]TJ ( R)Tj [5.39999 ]TJ
(e)Tj [-0.69999 ]TJ (c)Tj [13.19999 ]TJ (r)Tj [6.19999 ]TJ (ui)Tj
[5.39999 ]TJ (tme)Tj [9.29998 ]TJ (n)Tj [-0.69999 ]TJ (t:  F)Tj [14 ]TJ
(a)Tj [-0.69999 ]TJ (cul)Tj [5.39999 ]TJ (t)Tj [1 ]TJ (y)Tj [13.19999 ]TJ
( w)Tj [15.39999 ]TJ (e)Tj [-0.69999 ]TJ (r)Tj [6.19999 ]TJ (e)Tj
[-10.69999 ]TJ ( r)Tj [6.19999 ]TJ (e)Tj [-0.69999 ]TJ (mi)Tj [5.39999 ]TJ
(n)Tj [9.29998 ]TJ (d)Tj [-0.69999 ]TJ (ed)Tj [9.29998 ]TJ ( o)Tj
[9.29998 ]TJ (f)Tj [-9 ]TJ ( g)Tj [9.29998 ]TJ (r)Tj [6.19999 ]TJ (ad)Tj
[9.39999 ]TJ (uat)Tj [11 ]TJ (e)Tj [-0.59999 ]TJ ( )Tj [10 ]TJ (stude)Tj
[9.39999 ]TJ (nt g)Tj [9.29998 ]TJ (e)Tj [-0.69999 ]TJ (t-)Tj ") cmd=
PDF::parseTextLine("T*") cmd=T*
PDF::parseTextLine("-0.00088 Tw") cmd=Tw
PDF::parseTextLine("(tog)Tj [9.19999 ]TJ (e)Tj [-0.79998 ]TJ (ther)Tj
[6.09999 ]TJ ( )Tj [10 ]TJ (on N)Tj [5.29998 ]TJ (o)Tj [-0.79998 ]TJ (v)Tj
[13.09999 ]TJ (e)Tj [-0.79998 ]TJ (m)Tj [6.09999 ]TJ (ber)Tj [6.09999 ]TJ
( )Tj [10 ]TJ (22 e)Tj [9.19999 ]TJ (t)Tj [0.89999 ]TJ (hni)Tj [5.29998 ]TJ
(c )Tj [10 ]TJ (pot-)Tj [6.09999 ]TJ (l)Tj [5.29998 ]TJ (uck,)Tj
[10.89999 ]TJ ( Par)Tj [6.09999 ]TJ (l)Tj [15.29998 ]TJ (or)Tj [6.09999 ]TJ
( A, U)Tj [5.29998 ]TJ (n)Tj [-0.79998 ]TJ (i)Tj [5.29998 ]TJ (o)Tj
[9.19999 ]TJ (n Bl)Tj [5.29998 ]TJ (dg)Tj [9.19999 ]TJ (.  F)Tj
[13.89999 ]TJ (a)Tj [-0.79998 ]TJ (c)Tj [13.09999 ]TJ (u)Tj [-0.69999 ]TJ
(l)Tj [5.29998 ]TJ (t)Tj [0.89999 ]TJ (y)Tj [13.09999 ]TJ ( ar)Tj
[6.09999 ]TJ (e )Tj ") cmd=
PDF::parseTextLine("T*") cmd=T*
PDF::parseTextLine("0.00338 Tc") cmd=Tc
PDF::parseTextLine("-0.00119 Tw") cmd=Tw
PDF::parseTextLine("(ur)Tj [6.39999 ]TJ (g)Tj [9.5 ]TJ (ed to c)Tj
[13.39999 ]TJ (o)Tj [-0.5 ]TJ (m)Tj [6.39999 ]TJ (m)Tj [-3.59999 ]TJ (u)Tj
[9.5 ]TJ (n)Tj [-0.5 ]TJ (i)Tj [5.59999 ]TJ (c)Tj [3.39999 ]TJ (at)Tj
[11.19999 ]TJ (e )Tj [10 ]TJ (to Bo)Tj [9.5 ]TJ (nni)Tj [5.59999 ]TJ (e)Tj
[9.5 ]TJ ( )Tj [10 ]TJ (T)Tj [-5.79998 ]TJ (y)Tj [13.39999 ]TJ (l)Tj
[5.59999 ]TJ (e)Tj [-0.5 ]TJ (r)Tj [6.39999 ]TJ ( and/or)Tj [6.39999 ]TJ
( )Tj [10 ]TJ (Adel)Tj [5.59999 ]TJ ( S)Tj [10.39999 ]TJ (a)Tj [-0.5 ]TJ
(r)Tj [6.39999 ]TJ (o)Tj [9.5 ]TJ (f)Tj [-8.79998 ]TJ (i)Tj [5.59999 ]TJ
(m)Tj [6.39999 ]TJ ( opi)Tj [15.59999 ]TJ (n)Tj [-0.5 ]TJ (i)Tj [5.59999 ]TJ
(o)Tj [-0.39999 ]TJ (ns/)Tj [11.19999 ]TJ (c)Tj [3.39999 ]TJ (om)Tj
[6.39999 ]TJ (me)Tj [9.5 ]TJ (nts )Tj ") cmd=
PDF::parseTextLine("T*") cmd=T*
PDF::parseTextLine("0.00259 Tc") cmd=Tc
PDF::parseTextLine("-0.00039 Tw") cmd=Tw
PDF::parseTextLine("(on r)Tj [5.59999 ]TJ (e)Tj [-1.29998 ]TJ (cr)Tj
[5.59999 ]TJ (ui)Tj [4.79998 ]TJ (ti)Tj [4.79998 ]TJ (ng)Tj [8.69999 ]TJ (
o)Tj [8.69999 ]TJ (ffer)Tj [5.59999 ]TJ (s)Tj [2.69999 ]TJ (, )Tj [10 ]TJ
(as)Tj [12.59999 ]TJ (si)Tj [4.79998 ]TJ (g)Tj [8.69999 ]TJ (n)Tj
[-1.29998 ]TJ (ments,)Tj [10.39999 ]TJ ( )Tj [10 ]TJ (f)Tj [-9.59999 ]TJ
(i)Tj [4.79998 ]TJ (na)Tj [8.69999 ]TJ (nci)Tj [4.79998 ]TJ (a)Tj
[-1.29998 ]TJ (l)Tj [4.79998 ]TJ ( ai)Tj [14.79998 ]TJ (d, etc)Tj
[12.59999 ]TJ (.)Tj [0.39999 ]TJ ( )Tj [10 ]TJ (f)Tj [-9.59999 ]TJ (o)Tj
[-1.29998 ]TJ (r)Tj [5.59999 ]TJ ( )Tj [10 ]TJ (fur)Tj [5.59999 ]TJ (t)Tj
[0.39999 ]TJ (h)Tj [8.69999 ]TJ (e)Tj [-1.29998 ]TJ (r)Tj [5.59999 ]TJ (
di)Tj [4.79998 ]TJ (scu)Tj [8.69999 ]TJ (ssi)Tj [4.79998 ]TJ (on at)Tj
[10.39999 ]TJ ( )Tj ") cmd=
PDF::parseTextLine("T*") cmd=T*
PDF::parseTextLine("0.00299 Tc") cmd=Tc
PDF::parseTextLine("-0.00079 Tw") cmd=Tw
PDF::parseTextLine("(D)Tj [5.19999 ]TJ (e)Tj [-0.89999 ]TJ (cem)Tj [6 ]TJ
(ber)Tj [6 ]TJ ( 7 )Tj [10 ]TJ (facul)Tj [5.19999 ]TJ (t)Tj [0.79998 ]TJ
(y)Tj [13 ]TJ ( m)Tj [6 ]TJ (eeti)Tj [5.19999 ]TJ (n)Tj [-0.89999 ]TJ (g)Tj
[9.09999 ]TJ (.)Tj [0.89999 ]TJ ( )Tj ") cmd=
PDF::parseTextLine("T*") cmd=T*
PDF::parseTextLine("0 Tc") cmd=Tc
PDF::parseTextLine("0 Tw") cmd=Tw
PDF::parseTextLine("( )Tj ") cmd=
PDF::parseTextLine("T*") cmd=T*
PDF::parseTextLine("0.00279 Tc") cmd=Tc
PDF::parseTextLine("-0.00059 Tw") cmd=Tw
PDF::parseTextLine("(T)Tj [-6.39999 ]TJ (e)Tj [8.89999 ]TJ (achi)Tj [5 ]TJ
(ng)Tj [8.89999 ]TJ ( For)Tj [5.79998 ]TJ (m)Tj [5.79998 ]TJ (u)Tj
[-1.09999 ]TJ (l)Tj [5 ]TJ (a)Tj [-1.09999 ]TJ ( )Tj [10 ]TJ (and)Tj
[8.89999 ]TJ ( Sche)Tj [8.89999 ]TJ (dul)Tj [5 ]TJ (e:)Tj [10.59999 ]TJ (
N)Tj [5 ]TJ (o)Tj [-1.09999 ]TJ (t c)Tj [12.79998 ]TJ (o)Tj [-1.09999 ]TJ
(v)Tj [12.79998 ]TJ (e)Tj [-1.09999 ]TJ (r)Tj [5.79998 ]TJ (ed du)Tj
[8.89999 ]TJ (e t)Tj [10.59999 ]TJ (o)Tj [-1 ]TJ ( ti)Tj [5 ]TJ (m)Tj
[5.79998 ]TJ (e)Tj [-1.09999 ]TJ ( co)Tj [8.89999 ]TJ (nstr)Tj [5.79998 ]TJ
(ai)Tj [5 ]TJ (n)Tj [8.89999 ]TJ (t)Tj [0.59999 ]TJ (s.  Ge)Tj [8.89999 ]TJ
(o)Tj [8.89999 ]TJ (f)Tj [0.59999 ]TJ (f)Tj [-9.39999 ]TJ ( )Tj ") cmd=
PDF::parseTextLine("T*") cmd=T*
PDF::parseTextLine("-0.00109 Tc") cmd=Tc
PDF::parseTextLine("0.00329 Tw") cmd=Tw
PDF::parseTextLine("(S)Tj [-4.09999 ]TJ (ilco)Tj [-5 ]TJ (x)Tj [8.89999 ]TJ
( )Tj [-10 ]TJ (w)Tj [11.09999 ]TJ (ill d)Tj [-4.89999 ]TJ (i)Tj
[1.09999 ]TJ (sse)Tj [-5 ]TJ (m)Tj [-8.09999 ]TJ (i)Tj [1.09999 ]TJ (n)Tj
[-5 ]TJ (a)Tj [-5 ]TJ (t)Tj [-3.29998 ]TJ (e)Tj [-5 ]TJ ( )Tj [10 ]TJ (in)Tj
[-5 ]TJ (fo)Tj [-5 ]TJ (rma)Tj [-5 ]TJ (tio)Tj [-5 ]TJ (n)Tj [-5 ]TJ ( )Tj
[10 ]TJ (to)Tj [-5 ]TJ ( )Tj [10 ]TJ (fa)Tj [-5 ]TJ (cu)Tj [-5 ]TJ (lty)Tj
[8.89999 ]TJ ( v)Tj [8.89999 ]TJ (i)Tj [-8.89999 ]TJ (a)Tj [-5 ]TJ ( em)Tj
[-8.09999 ]TJ (a)Tj [-5 ]TJ (il an)Tj [-5 ]TJ (d)Tj [-4.89999 ]TJ ( )Tj
[10 ]TJ (d)Tj [-4.89999 ]TJ (i)Tj [1.09999 ]TJ (scu)Tj [-4.89999 ]TJ (ss
at )Tj [10 ]TJ (a)Tj [-5 ]TJ ( la)Tj [-5 ]TJ (te)Tj [-5 ]TJ (r )Tj ") cmd=
PDF::parseTextLine("T*") cmd=T*
PDF::parseTextLine("-0.00479 Tc") cmd=Tc
PDF::parseTextLine("0 Tw") cmd=Tw
PDF::parseTextLine("(m)Tj [-11.79998 ]TJ (e)Tj [-8.69999 ]TJ (et)Tj [-7 ]TJ
(in)Tj [-8.69999 ]TJ (g.)Tj [-6.89999 ]TJ ( )Tj ") cmd=
PDF::parseTextLine("T*") cmd=T*
PDF::parseTextLine("0 Tc") cmd=Tc
PDF::parseTextLine("( )Tj ") cmd=
PDF::parseTextLine("T*") cmd=T*
PDF::parseTextLine("0.00309 Tc") cmd=Tc
PDF::parseTextLine("-0.00088 Tw") cmd=Tw
PDF::parseTextLine("(Budg)Tj [9.19999 ]TJ (ets: )Tj [10 ]TJ ( N)Tj
[5.29998 ]TJ (o)Tj [-0.79998 ]TJ (t c)Tj [13.09999 ]TJ (o)Tj [-0.69999 ]TJ
(v)Tj [13.09999 ]TJ (e)Tj [-0.79998 ]TJ (r)Tj [6.09999 ]TJ (ed )Tj [10 ]TJ
(due )Tj [10 ]TJ (to ti)Tj [15.29998 ]TJ (me c)Tj [13.09999 ]TJ (onstr)Tj
[6.09999 ]TJ (ai)Tj [15.29998 ]TJ (nts)Tj [13.09999 ]TJ (.)Tj [0.89999 ]TJ
(  Jer)Tj [6.09999 ]TJ (i)Tj [5.29998 ]TJ ( Schr)Tj [6.09999 ]TJ (y)Tj
[13.09999 ]TJ (v)Tj [13.09999 ]TJ (e)Tj [-0.69999 ]TJ (r)Tj [6.09999 ]TJ
( )Tj [-10 ]TJ (w)Tj [15.29998 ]TJ (i)Tj [5.29998 ]TJ (ll)Tj [5.29998 ]TJ
( pr)Tj [6.09999 ]TJ (esent )Tj [10 ]TJ (at a)Tj [9.19999 ]TJ ( )Tj ") cmd=
PDF::parseTextLine("T*") cmd=T*
PDF::parseTextLine("-0.00158 Tc") cmd=Tc
PDF::parseTextLine("0.00379 Tw") cmd=Tw
PDF::parseTextLine("(la)Tj [-5.5 ]TJ (te)Tj [-5.5 ]TJ (r me)Tj [-5.5 ]TJ
(e)Tj [-5.5 ]TJ (t)Tj [-3.79998 ]TJ (in)Tj [-5.5 ]TJ (g. )Tj ") cmd=
PDF::parseTextLine("T*") cmd=T*
PDF::parseTextLine("0 Tc") cmd=Tc
PDF::parseTextLine("0 Tw") cmd=Tw
PDF::parseTextLine("( )Tj ") cmd=
PDF::parseTextLine("T*") cmd=T*
PDF::parseTextLine("-0.00498 Tc") cmd=Tc
PDF::parseTextLine("0.00718 Tw") cmd=Tw
PDF::parseTextLine("(Me)Tj [-8.89999 ]TJ (e)Tj [-8.89999 ]TJ (t)Tj
[-7.19999 ]TJ (in)Tj [-8.89999 ]TJ (g a)Tj [-8.89999 ]TJ (d)Tj [-8.89999 ]TJ
(j)Tj [-2.79998 ]TJ (ou)Tj [-8.89999 ]TJ (rn)Tj [-8.89999 ]TJ (ed)Tj
[-8.89999 ]TJ ( a)Tj [-8.89999 ]TJ (t)Tj [2.79998 ]TJ ( 10)Tj [-8.89999 ]TJ
(:)Tj [-7.19999 ]TJ (3)Tj [-8.89999 ]TJ (0 a)Tj [-8.89999 ]TJ (.)Tj
[2.79998 ]TJ (m)Tj [-12 ]TJ (.)Tj [-7.09999 ]TJ ( )Tj ") cmd=
PDF::parseTextLine("T*") cmd=T*
PDF::parseTextLine("0 Tc") cmd=Tc
PDF::parseTextLine("0 Tw") cmd=Tw
PDF::parseTextLine("( )Tj ") cmd=
PDF::parseTextLine("ET") cmd=ET
PDF::parse: head = ""
PDF::parse: 5539 lines parsed
PDF::parse ends normally
 size = 8938
pick: www.mydomain.com, # servers = 1
htdig: Run complete
htdig: 1 server seen:
htdig:     www.mydomain.com:80 1 document
htmerge: Sorting...
DB2 problem...: missing or empty key value specified

htmerge: Total word count: 0
Deleted, no excerpt:
0/http://www.mydomain.com/minutes/facultyMeetingMinutes11_09_2000.pdf

htmerge: Total documents: 0
htmerge: Total size of documents (in K): 0
Preamble text:


Postamble text:
Note: This message will be sent again if you do not change or
take away the notification of the above mentioned HTML page.

Find out more about the notification service at

    http://www.htdig.org/meta.html

Cheers!

ht://Dig Notification Service

-----Original Message-----
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED]]On Behalf Of Gilles
Detillieux
Sent: Thursday, April 04, 2002 7:39 PM
To: Christian Fredrickson
Cc: Rzepa Henry; [EMAIL PROTECTED]
Subject: Re: [htdig] PDF problems


According to Christian Fredrickson:
> Well, I tried this to no avail. I still receive no errors, but do see:
> Deleted, no excerpt:
> for every PDF file. All my Word docs are parsed fine using doc2html. Yes
> this is version 3.1.6. Any other ideas? This is driving me nuts and many
> documents are PDF format so I have to have them parsed.

OK, but have you determined for sure that htdig is actually
calling doc2html for PDF files too, or is it just doing it for
Word docs?  What does your external_parsers attribute setting
look like?  Are you sure there aren't any problems with it (see
http://www.htdig.org/FAQ.html#q5.31).

It would really be helpful to see the output of htdig -ivvvv
when start_url is the URL for a single PDF file - that would
tell us a lot about what htdig is actually doing when it gets
a PDF file.

--
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:
http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to
<[EMAIL PROTECTED]> with a subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html


_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to