On Wed, Mar 05, 2003 at 04:20:39AM -0800, Erick Papadakis wrote: > 1. This time, the first time I ran rundig I started only with a small > subset of my websit (the "start_url" was only a subfolder). Now that all > is working I have set the "start_url" to my main domain name, but the > database does not get refreshed with the new words at all! I read up the > documentation and it says that "htdig -i" should do it but apparently it > does not?!
Sometimes I have to try a couple of times. Make sure there are links from your start_url to all sub-folders you want indexed (links can be on secondary pages and don't have to be on the top page). Read the output from htdig to see where it's getting caught up. I usually do this: htdig -i -c config.conf -v > htdig.01.txt And then I scan through the text file as the program is running (or after it's finished). > 2. Secondly, how to enable phrase searching? I.e., if I look for the > words using the search form "open source" (including the quotes) I still > find results containing just the word "open". I want to be able to have a > feature where if the user puts in double quotes or selects an option that > says "EXACT SEARCH" in my form, then I want to search only for the phrase > between quotes. These might be useful: http://htdig.org/hts_method.html http://htdig.org/hts_form.html You want to look for information on BOOLEAN searching or use the AND method. -- Emma Jane Hogbin [[ 416 417 2868 ][ www.xtrinsic.com ]] ------------------------------------------------------- This sf.net email is sponsored by:ThinkGeek Welcome to geek heaven. http://thinkgeek.com/sf _______________________________________________ htdig-general mailing list <[EMAIL PROTECTED]> To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html

