David, thanks for your suggestion
The trouble is, htdig's output looks fine to me, seems to get the Content-Type correct, the length looks sensible at 29122 bytes, it just doesn't put anything it finds into its database scratch files. It lists the text from the pdf when in -vvvv mode, so it's not one of those pdf-image issues.
Output is listed below
Any other thoughts?
Steve
title: Atelier Ten Web Graphics
image: http://192.168.1.2/pdfs/TSB_Exterior_thumb.gif
href: http://192.168.1.2/pdfs/phoenix.pdf (support images)
resolving 'http://192.168.1.2/pdfs/phoenix.pdf'
pushing http://192.168.1.2/pdfs/phoenix.pdf
+ size = 1186
pick: 192.168.1.2, # servers = 1
1:1:1:http://192.168.1.2/pdfs/phoenix.pdf: Retrieval command for http://192.168.1.2/pdfs/phoenix.pdf: GET /pdfs/phoenix.pdf HTTP/1.0
User-Agent: htdig/3.1.6 ([EMAIL PROTECTED])
Referer: http://192.168.1.2/
Host: 192.168.1.2
Header line: HTTP/1.1 200 OK
Header line: Date: Tue, 12 Mar 2002 20:00:42 GMT
Header line: Server: Apache/1.3.20 (Linux/SuSE) PHP/4.0.6
Header line: Last-Modified: Thu, 14 Jun 2001 08:59:02 GMT
Converted Thu, 14 Jun 2001 08:59:02 GMT to Thu, 14 Jun 2001 08:59:02
Header line: ETag: "9813c-71c2-3b287cd6"
Header line: Accept-Ranges: bytes
Header line: Content-Length: 29122
Header line: Connection: close
Header line: Content-Type: application/pdf
Header line:
returnStatus = 0
Read 8192 from document
Read 8192 from document
Read 8192 from document
Read 4546 from document
Read a total of 29122 bytes
PDF::setContents(29122 bytes)
PDF::parse(http://192.168.1.2/pdfs/phoenix.pdf)
PDF::parse: 19272 lines parsed
PDF::parse ends normally
size = 29122
pick: 192.168.1.2, # servers = 1
________________________________________________________________________
This e-mail has been scanned for all viruses by Star Internet. The
service is powered by MessageLabs. For more information on a proactive
anti-virus service working around the clock, around the globe, visit:
http://www.star.net.uk
________________________________________________________________________

