Jann,
Unfortunately the client uses Version 4 and now 5 to create his PDF files
and Verity will only index Version 3 and below. However, the actual problem
is not only searching the PDF's but also the database records at the same
time. We ultimately want to get them to the database record with its
associated PDF documents and not to the actual PDF document alone.
Even if we could index them with Verity (We are using CF 4.01) we would need
to perform a second verity search on the database records as well, which
would dilute the relevance of the search since the database records hold
crucial information required for the search. Putting the text of the PDF
into the database would then only require one search and return much more
relevant results.
Best Regards,
Dennis Powers
UXB Internet
(203) 879-2844
http://www.uxbinfo.com/
-----Original Message-----
From: Jann VanOver [mailto:[EMAIL PROTECTED]]
Sent: Friday, July 13, 2001 12:24 PM
To: CF-Talk
Subject: RE: Extracting Text from PDF Documents with CF
I thought Verity could index the text in PDFs automatically! It did with
previous versions of Acrobat. You can create a Verity index that indexes
your database AND other files (pdfs) that you want. Try it and see.
-----Original Message-----
From: Dennis Powers [mailto:[EMAIL PROTECTED]]
Sent: Friday, July 13, 2001 9:00 AM
To: CF-Talk
Subject: Extracting Text from PDF Documents with CF
Hi,
I am wondering if anyone has a method of extracting the raw (unformatted)
text from a PDF file using CF? I have a project were we need to index PDF
files AND associated information in a database. We are currently using
Verity for searching the database with great success but now we need to
index the PDF files that are associated with the data records in the
database.
When a user uploads a new PDF I would like to extract the text from it and
add it to the database with the other information. Then I can use Verity to
search all the data fields AND the PDF text data field.
A CFX or a COM object would be nice so that I can call it from CF. I would
be very appreciative if anyone can steer me to a tag or object that can
accomplish this task.
Best Regards,
Dennis Powers
UXB Internet
(203) 879-2844
http://www.uxbinfo.com/
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Structure your ColdFusion code with Fusebox. Get the official book at
http://www.fusionauthority.com/bkinfo.cfm
Archives: http://www.mail-archive.com/[email protected]/
Unsubscribe: http://www.houseoffusion.com/index.cfm?sidebar=lists