Verity requires a filter for the indexing of PDF
documents. This filter is provided with CF in most
cases.
The version of Verity that is included in ColdFusion
4.5.1 for Linux and HP-UX does not include a filter
for PDF files.

Regards,
Rob
--- Jann VanOver <[EMAIL PROTECTED]> wrote:
> I thought Verity could index the text in PDFs
> automatically!  It did with
> previous versions of Acrobat.  You can create a
> Verity index that indexes
> your database AND other files (pdfs) that you want. 
> Try it and see.
> 
> -----Original Message-----
> From: Dennis Powers [mailto:[EMAIL PROTECTED]]
> Sent: Friday, July 13, 2001 9:00 AM
> To: CF-Talk
> Subject: Extracting Text from PDF Documents with CF
> 
> 
> Hi,
> 
> I am wondering if anyone has a method of extracting
> the raw (unformatted)
> text from a PDF file using CF?  I have a project
> were we need to index PDF
> files AND associated information in a database.  We
> are currently using
> Verity for searching the database with great success
> but now we need to
> index the PDF files that are associated with the
> data records in the
> database.
> 
> When a user uploads a new PDF I would like to
> extract the text from it and
> add it to the database with the other information. 
> Then I can use Verity to
> search all the data fields AND the PDF text data
> field.
> 
> A CFX or a COM object would be nice so that I can
> call it from CF. I would
> be very appreciative if anyone can steer me to a tag
> or object that can
> accomplish this task.
> 
> Best Regards,
> 
> Dennis Powers
> UXB Internet
> (203) 879-2844
> http://www.uxbinfo.com/
>
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Structure your ColdFusion code with Fusebox. Get the official book at 
http://www.fusionauthority.com/bkinfo.cfm

Archives: http://www.mail-archive.com/[email protected]/
Unsubscribe: http://www.houseoffusion.com/index.cfm?sidebar=lists

Reply via email to