This is pretty obvious, but I will mention it just-in-case.
PDFs can either be text or graphics or both
The graphic only (and sometimes the both) will not index properly, since there is no text for the indexer to find.
The easiest way to test this is to open the pdf, select the text tool, and try to highlight and copy text from the document.
If you can copy text from it, it is a text or text and graphic pdf.
If these turn out to be text-based PDFs, the only other thing I can think is that htey are copy-protected somehow, and the indexer can't open them to read.
Jerry Johnson
>>> [EMAIL PROTECTED] 03/30/04 07:49AM >>>
Hey all,
Does anyone use Ultraseek for their internal search engine? We do, but the
administrator is rather protective of anyone but him seeing anything with
the docs. Of course, we have a site where it's not working correctly, for
some unknown reason (A bunch of pdfs - which get indexed, but fail to show
up when you search for words in the text, show up if you search for the name
of the document. Some are sans description, while others have one. Can't
find a pattern. Server admin says it's problems with the pdf's. Pdf people
say it's problems with the server. I'm stuck in the middle. Google indexes
everything just dandy - so why can't Ultraseek (say the PDF people).)
Any thoughts? I'm just a wee tad frustrated by this one.
[Todays Threads] [This Message] [Subscription] [Fast Unsubscribe] [User Settings]
