This is not a CF solution but it may at least help with what it has to trawl
through  - in any case this will help anything else that has to call or
access the document. 

This is for PDF files but you might consider converting Office files to PDF
at your discretion of course - a properly prepared PDF version of an Office
document can be up to a quarter of the file size of the source document -
that's useful and print and view quality is not compromised.

I'm a big fan of PDF but unfortunately it's a file format that suffers a lot
from bad file preparation -  the result is unnecessarily big files amongst
other things. 

Try optimising all the PDFs to see if this reduces the size of some of the
files - I suspect it might.  You'll need to check that the output settings
(e.g. print resolution, image resolution etc.) are suitable for the end
purpose of the document but from my experience the default settings are
usually quite good.

The good news is you can automate this process over the entire file system
with Acrobat Pro's batch feature.

Hope that helps in some way!


++++++++++
Kevin Parker

++++++++++

-----Original Message-----
From: Les Mizzell [mailto:lesm...@bellsouth.net] 
Sent: Thursday, 9 April 2015 8:23 AM
To: cf-talk
Subject: cfindex is taking forever


I'm working on building a search interface for a "document depo" on a site.
The document folder has files going all the way back to 2005, and includes a
number of 10+ meg pdf files,  a few that are over 20 megs, countless Word
and Excel files, Power Point presentations....

I don't have access to the CFAdministrator, so:

<cfcollection
     action = "create"
     categories = "no"
     collection = "docDEPO"
     engine = "verity"
     language = "English"
     path = "#req.path#\collections\">

<cfindex
     collection="docDEPO"
     action="refresh"
     type="path"
     key="#req.path#\documentdepot\"
     language="English"
     status="info"
     extensions=".pdf,.pptx,.docx,.doc,.xls,.xlsx,.ppsx,.txt, ppt">


The collection was created successfully as far as I can tell. However,
indexing has been running (or at least the wheel on my browser is still
turning) for almost 3 hours now. I'm going to forget about it and go mow my
grass and see what's happening when I finish.

I'm thinking though ... too much stuff to index? Or is amount of time not
out of line for a very large collection of files?
Also, I've not been able to find a list of legally accepted extensions. 
I might have something listed that's just going to cause it to crap out
anyway.

Thoughts? Try something else? What exactly?



~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~|
Order the Adobe Coldfusion Anthology now!
http://www.amazon.com/Adobe-Coldfusion-Anthology/dp/1430272155/?tag=houseoffusion
Archive: 
http://www.houseoffusion.com/groups/cf-talk/message.cfm/messageid:360439
Subscription: http://www.houseoffusion.com/groups/cf-talk/subscribe.cfm
Unsubscribe: http://www.houseoffusion.com/groups/cf-talk/unsubscribe.cfm

Reply via email to