At 11:34 AM 4/26/2004, Anu Vaidyanathan wrote:
> but google.com isn't the index - it's the search page - not much there for
> htdig to index.

ok.. what does an "index" page look like. For example, the first link that
is returned on the search string "april fool" on google is:

www.2meta.com/april-fools/

can I put this on my start_url and have htdig create an index of all pages
served up by www.2meta.com - i.e, would this be an index page?

> what about rundig -vv - should be less verbose, but maybe more meaningful
> information.

OK.

> So did you run htmerge?

like the next message said - i thought the steps were:
./configure
make
make install
rundig
htsearch -c htdig.conf

while we are at it, I assume that the output of htsearch is what I see on
the screen after it runs.

cheers
a
once you have installed properly and edited htconfig.conf to your liking you need only execute rundig for the indexing to take place.

using google.com as your start url *should* give you something in your database as there are links to follow but you wont get any "april fools" related data. You would need to set your start url to something like http://www.google.com/search?q=april+fools but I don't if that will work or not. Even if it did it would seem pretty useless unless you hacked the source to allow a variable in the start url definition (http://www.google.com/search?q=SEARCH_WORDS)...




------------------------------------------------------- This SF.net email is sponsored by: The Robotic Monkeys at ThinkGeek For a limited time only, get FREE Ground shipping on all orders of $35 or more. Hurry up and shop folks, this offer expires April 30th! http://www.thinkgeek.com/freeshipping/?cpg=12297 _______________________________________________ ht://Dig general mailing list: <[EMAIL PROTECTED]> ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html List information (subscribe/unsubscribe, etc.) https://lists.sourceforge.net/lists/listinfo/htdig-general

Reply via email to