On Fri, 2005-07-22 at 01:08 -0300, Manuel Lemos wrote: > on 07/19/2005 08:10 PM Christopher Murtagh said the following: > > To do an incremental index: > > > > echo URL_list.txt | htdig -m foo -c conf_file.conf - > > > > (notice the trailing '-'). Making this work wasn't obvious, but I had a > > bit of help from the list, and it's all working for me now. > > hummm... I had the impression from a message posted in this list that > when you do incremental indexing, HtDig will still traverse all pages > but just performs HEAD requests to verify whether other pages were > updated. Is this what happens or I misunderstood the point of this? > > Another thing that confuses me about the example above is the parameter > that follows the -m switch. If it is supposed to read from STDIN, why > foo and not just - ?
Yeah, I can't remember exactly why, other than it didn't work if I didn't do it. Sorry, it was a while ago when I set things up. A smarter person would have documented what I did, but I was swamped and didn't. :-) > Other than that, if I want to update existing index database files, > letting the users search the current databases while htdig is finishe, > adding -a switch to the htdig command line will work ok whe just > updating a few URLs as you suggest? I use htdig for several things, including indexing results of PostgreSQL queries and joins. For example, if you go to: http://www.mcgill.ca/classified/ The search tool uses htdig, embedded inside PostgreSQL (via stored procedures that call htdig). Same goes for: http://www.mcgill.ca/search/ Just about everything there uses htdig, inside PostgreSQL and with a PHP wrapper. Cheers, Chris ------------------------------------------------------- SF.Net email is sponsored by: Discover Easy Linux Migration Strategies from IBM. Find simple to follow Roadmaps, straightforward articles, informative Webcasts and more! Get everything you need to get up to speed, fast. http://ads.osdn.com/?ad_id=7477&alloc_id=16492&op=click _______________________________________________ ht://Dig Developer mailing list: [email protected] List information (subscribe/unsubscribe, etc.) https://lists.sourceforge.net/lists/listinfo/htdig-dev
