Here is what happens when you don't look at the code first. Our system pulls down the pdf files into a local folder, then runs the index, but....
We cheat. We use a local Key key="g:\production\opinions\mi\coaunpub\20060523_C256194_50_256194O.ON.PDF" and an external URL urlpath="http://courtofappeals.mijud.net/documents/OPINIONS/FINAL/COA/" So this may not actually help you at all. Exactly the opposite of what you need. On 5/24/06, Crow T. Robot <[EMAIL PROTECTED]> wrote: > OK, done that, or I should say this is running in the background right now. > > One thing I'm confused about though. In the following snippet of code, > isn't the collection "veritysearch" just constantly reindexing itself? > So, it would only reindex the last "spidered" page? > > <cfdirectory action="list" directory="#ExpandPath('.')#" > name="directory_list" filter="*.cfm"> > > <cfloop query="directory_list"> > <cfindex action="REFRESH" > collection="veritysearch" > key="#webaddress##name#" > type="PATH" > urlpath="#webaddress#" > extensions=".cfm" > recurse="Yes" > language="English"> > </cfloop> > > After this runs, all my searches just come up blanko. > > Did I miss your original point? > > > > Jerry Johnson wrote: > > Crow, > > > > You should be able to "fake it" by building your own spider (just do a > > directory listing, turn the filename into a url, and pass the url to > > the indexer, one beautiful page at a time). > > > > Yes, I've done this (last summer) to index PDFs stored on Federal > > Court websites. > > > > On 5/24/06, Crow T. Robot <[EMAIL PROTECTED]> wrote: > >> Hrm...that sucks, looks like this won't work in a shared environment, > >> eh? Or, I should say that it won't work unless I get my host to make > >> the changes for me. Which we all know will never happen. > >> > >> I can't wait until we get our own server. 2 months and counting! > >> > >> Jim Wright wrote: > >>> On 5/24/06, Ray Champagne <[EMAIL PROTECTED]> wrote: > >>>> I am creating a very simple site search using Verity. When outputting > >>>> the search results (using #summary#), the various variable names used in > >>>> the text of the pages are getting output as their variable names, not as > >>>> their values. Is there some trick I'm missing? > >>>> -- > >>> You aren't missing any trick...cfindex does a spider of the > >>> filesystem, and views your .cfm files as text files. Unfortunately, > >>> it doesn't include any http based spidering capability (which would > >>> see the processed pages). There is a way to do this using verity's > >>> vspider utility.... > >>> > >>> http://www.adobe.com/devnet/coldfusion/articles/vspider.html > >>> > >> > > > > > > ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~| Message: http://www.houseoffusion.com/lists.cfm/link=i:4:241339 Archives: http://www.houseoffusion.com/cf_lists/threads.cfm/4 Subscription: http://www.houseoffusion.com/lists.cfm/link=s:4 Unsubscribe: http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=11502.10531.4 Donations & Support: http://www.houseoffusion.com/tiny.cfm/54

