Here is what happens when you don't look at the code first.

Our system pulls down the pdf files into a local folder, then runs the
index, but....

We cheat.

We use a local Key
key="g:\production\opinions\mi\coaunpub\20060523_C256194_50_256194O.ON.PDF"

and an external URL
urlpath="http://courtofappeals.mijud.net/documents/OPINIONS/FINAL/COA/";

So this may not actually help you at all. Exactly the opposite of what you need.

On 5/24/06, Crow T. Robot <[EMAIL PROTECTED]> wrote:
> OK, done that, or I should say this is running in the background right now.
>
> One thing I'm confused about though.  In the following snippet of code,
> isn't the collection "veritysearch" just constantly reindexing itself?
> So, it would only reindex the last "spidered" page?
>
> <cfdirectory action="list" directory="#ExpandPath('.')#"
> name="directory_list" filter="*.cfm">
>
> <cfloop query="directory_list">
>         <cfindex action="REFRESH"
>           collection="veritysearch"
>           key="#webaddress##name#"
>           type="PATH"
>           urlpath="#webaddress#"
>           extensions=".cfm"
>           recurse="Yes"
>           language="English">
> </cfloop>
>
> After this runs, all my searches just come up blanko.
>
> Did I miss your original point?
>
>
>
> Jerry Johnson wrote:
> > Crow,
> >
> > You should be able to "fake it" by building your own spider (just do a
> > directory listing, turn the filename into a url, and pass the url to
> > the indexer, one beautiful page at a time).
> >
> > Yes, I've done this (last summer) to index PDFs stored on Federal
> > Court websites.
> >
> > On 5/24/06, Crow T. Robot <[EMAIL PROTECTED]> wrote:
> >> Hrm...that sucks, looks like this won't work in a shared environment,
> >> eh?  Or, I should say that it won't work unless I get my host to make
> >> the changes for me.  Which we all know will never happen.
> >>
> >> I can't wait until we get our own server.  2 months and counting!
> >>
> >> Jim Wright wrote:
> >>> On 5/24/06, Ray Champagne <[EMAIL PROTECTED]> wrote:
> >>>> I am creating a very simple site search using Verity.  When outputting
> >>>> the search results (using #summary#), the various variable names used in
> >>>> the text of the pages are getting output as their variable names, not as
> >>>> their values.  Is there some trick I'm missing?
> >>>> --
> >>> You aren't missing any trick...cfindex does a spider of the
> >>> filesystem, and views your .cfm files as text files.  Unfortunately,
> >>> it doesn't include any http based spidering capability (which would
> >>> see the processed pages).  There is a way to do this using verity's
> >>> vspider utility....
> >>>
> >>> http://www.adobe.com/devnet/coldfusion/articles/vspider.html
> >>>
> >>
> >
> >
>
> 

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~|
Message: http://www.houseoffusion.com/lists.cfm/link=i:4:241339
Archives: http://www.houseoffusion.com/cf_lists/threads.cfm/4
Subscription: http://www.houseoffusion.com/lists.cfm/link=s:4
Unsubscribe: 
http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=11502.10531.4
Donations & Support: http://www.houseoffusion.com/tiny.cfm/54

Reply via email to